Dataset - B2FIND

Database for The Power of Networks and the Networks of Power PhD Thesis, Meli...

This dataset of correspondence, gift exchange, and appointment to office was created in 2020-2024 to form the basis of a social network analysis of the life and reign of Mary I...
Scrambled text: training Language Models to correct OCR errors using syntheti...

This data repository contains the key datasets required to reproduce the paper "Scrambled text: training Language Models to correct OCR errors using synthetic data". In addition...
NCSE v2.0: A Dataset of OCR-Processed 19th Century English Newspapers

NCSE v2.0 Dataset RepositoryThis repository contains the NCSE v2.0 dataset and associated supporting data used in the paper "Reading the unreadable: Creating a dataset of 19th...

You can also access this registry using the API (see API Docs).

3 datasets found