-
Digital library and corpus of historical Slovene IMP 1.1
The IMP digital library contains historical Slovene books and other publications, together 658 texts with over 45,000 pages from the period 1584-1919. Each text contains... -
Corpus of Slovenian historical legal texts SI-IUS 1.0
The SI-IUS collection of older law texts is meant to be used both as a digital library and as a language corpus. For the former, each text has been carefully annotated in TEI... -
The corpus of older Slovenian narrative prose PriLit 1.0
The PriLit corpus contains 37 texts of older Slovenian narrative prose by 12 authors. One text, Sreča v nesreči (Fortune in Misfortune) by Janez Cigler (first published in... -
Concordance of Trubar's Gospel of St. Matthew (1555) (ELEXIS)
Konkordance Trubarjevega Evangelija sv. Matevža (1555). The 23603 concordances represent a transcription of the book "Ta evangeli sv. Matevža" (1555) by Primož Trubar. See also:... -
Post-OCR correction training dataset sPeriodika-postOCR
The post-OCR correction dataset consists of paragraphs of text, at least 100 characters in length, extracted from documents randomly sampled from the sPeriodika dataset...
