-
The corpus of older Slovenian narrative prose PriLit 1.0
The PriLit corpus contains 37 texts of older Slovenian narrative prose by 12 authors. One text, Sreča v nesreči (Fortune in Misfortune) by Janez Cigler (first published in... -
Concordance of Trubar's Gospel of St. Matthew (1555) (ELEXIS)
Konkordance Trubarjevega Evangelija sv. Matevža (1555). The 23603 concordances represent a transcription of the book "Ta evangeli sv. Matevža" (1555) by Primož Trubar. See also:... -
Post-OCR correction training dataset sPeriodika-postOCR
The post-OCR correction dataset consists of paragraphs of text, at least 100 characters in length, extracted from documents randomly sampled from the sPeriodika dataset...