-
The corpus of older Slovenian narrative prose PriLit 1.0
The PriLit corpus contains 37 texts of older Slovenian narrative prose by 12 authors. One text, Sreča v nesreči (Fortune in Misfortune) by Janez Cigler (first published in... -
Corpus of Slovenian periodicals (1771-1914) sPeriodika 1.0
The corpus of Slovenian periodicals sPeriodika contains linguistically annotated periodicals published during the 18th, 19th, and beginning of 20th century (1771-1914). The... -
Reference corpus of historical Slovene goo300k 1.2
goo300k is a manually annotated reference corpus of historical Slovene. It contains 1,100 pages (about 300,000 tokens) sampled from 89 texts from the period 1584-1899. Each text... -
Corpus of texts by Hijacint Repič in "Cvetje z vertov sv. Frančiška" CVET 1.0
The CVET corpus contains 230 texts (around 175 thousand words) of varying length, published in the religious journal "Cvetje z vertov sv. Frančiška" between 1887 and 1916, when... -
Dataset of normalised Slovene text KonvNormSl 1.0
Data used in the experiments described in: Nikola Ljubešić, Katja Zupan, Darja Fišer and Tomaž Erjavec: Normalising Slovene data: historical texts vs. user-generated content.... -
Concordances of Primož Trubar's "Ta evangeli sv. Matevža" (1555)
The 23603 concordances represent a transcription of the book "Ta evangeli sv. Matevža" (1555) by Primož Trubar. -
Dictionary of the Slovenian Language in the Works of Janez Svetokriški
The Dictionary of the Slovenian Language in the Works of Janez Svetokriški (Slovar jezika Janeza Svetokriškega) presents and explains the lexis, including proper nouns, from 233... -
Lexicon of historical Slovene imp25k 1.1
The imp25k lexicon of historical Slovene was created automatically from the goo300k and foo3M annotated corpora and contains attested and manually verified word forms and their... -
IMP corpus n-grams 1.0
This is a collection of n-grams extracted from the IMP corpus of historical Slovene (http://hdl.handle.net/11356/1031). In addition to the separate lists of n-grams for tokens... -
Digital library and corpus of historical Slovene IMP 1.1
The IMP digital library contains historical Slovene books and other publications, together 658 texts with over 45,000 pages from the period 1584-1919. Each text contains... -
Slovenian-German Dictionary of Maks Pleteršnik (1894-1895)
The Slovenian-German Dictionary of Maks Pleteršnik was first published in 1894-1895. It contains 103,185 dictionary entries. Beside standard and dialect lexis of the 19th... -
Words of the 16th-Century Slovenian Literary Language
This dictionary provides comprehensive information on the vocabulary used in the Slovenian literary language during the period of the Reformation. It was written based on...