-
Database for The Power of Networks and the Networks of Power PhD Thesis, Meli...
This dataset of correspondence, gift exchange, and appointment to office was created in 2020-2024 to form the basis of a social network analysis of the life and reign of Mary I... -
Scrambled text: training Language Models to correct OCR errors using syntheti...
This data repository contains the key datasets required to reproduce the paper "Scrambled text: training Language Models to correct OCR errors using synthetic data". In addition... -
NCSE v2.0: A Dataset of OCR-Processed 19th Century English Newspapers
NCSE v2.0 Dataset RepositoryThis repository contains the NCSE v2.0 dataset and associated supporting data used in the paper "Reading the unreadable: Creating a dataset of 19th...
