Jedidiah Morse, The American Gazetteer (Boston 1797) [plain text, TEI/XML format]

DOI

This dataset contains the digitalised text of Jedidiah Morse's The American Gazetteer (Boston 1797). The text was digitalised using scans provided by the John Adams Library at the Boston Public Library (Internet Archive) and the HTR software Transkribus.

The text is presented in several formats (txt, TEI/XML, ALTO line, ALTO word, Transkribus PAGE/XML), stored in separate folders. We have also included two extra plain text files, one file ('§') containing only the place name lemmas (and no introduction and appendices), the other one which was manually edited to correct a limited number of incorrect line breaks (e.g. 'New¬York' instead of 'New-York').

In the dataset 'Ground Truth' the transcriptions can be found that were manually created in order to train the Transkribus HTR model.

Transkribus, 1.4.2

Identifier
DOI https://doi.org/10.34894/WYCC7G
Metadata Access https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/WYCC7G
Provenance
Creator Stapel, Rombert (ORCID: 0000-0001-6394-260X); Ashkpour, Ashkan; Reynaert, Martin
Publisher DataverseNL
Contributor Stapel, Rombert
Publication Year 2026
Rights CC0-1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Stapel, Rombert (International Institute of Social History)
Representation
Resource Type Dataset
Format text/plain; application/xml
Size 2604938; 2693614; 12626193; 2696591
Version 1.0
Discipline Humanities
Spatial Coverage (-135.000W, -90.000S, 20.000E, 90.000N); Boston