-
Pan-Latin Photovoltaic Systems Lexicon
The Pan-Latin Photovoltaic Systems Lexicon (Lessico panlatino dei sistemi fotovoltaici), developed within the Realiter network, contains the basic terms related to photovoltaic... -
Pan-Latin Geothermal Energy Lexicon
The Pan-Latin Geothermal Energy Lexicon (Lessico panlatino dell’energia geotermica), developed within the Realiter network, contains the basic terms related to geothermal energy... -
Pan-Latin Smart City Lexicon
The Pan-Latin Smart City Lexicon (Lessico panlatino della Smart City), developed within the Realiter network, contains the basic terms related to the Smart City concept in seven... -
Word combination lexicons
10 lexicons comprising of several hundreds word combinations exhibiting various stages of lexicalisation (from free word combinations to fixed idioms) manually annotated by many... -
Bilingual terminology extraction dataset KAS-biterm 1.0
The KAS-biterm bilingual term extraction dataset contains complete sentences selected from PhD theses from the KAS corpus of Slovene academic writing. Only sentences that have a... -
TermFrame: Terms, definitions and semantic annotations for karstology
The resource contains several datasets containing domain-specific data in three languages, English, Slovenian and Croatian, which can be used for various knowledge extraction or... -
English-Slovene term candidates KAS-biterm 1.0
KAS-biterm is an automatically generated glossary of English terms with their translations into Slovene. The pairs, possibly with their English and Slovene acronyms, were... -
English-Slovenian chess terminology database 1.0
The English-Slovenian chess terminology database (termbase) is a bilingual database containing 82 English and 109 Slovenian chess terms, which make up 77 entries. The terms are... -
Slovenian-English glossary of education
The glossary, which contains 1028 dictionary entries, is a terminological collection of Slovenian terms with English equivalents in the field of education. The collection was... -
Geographical Terminological Dictionary
The Geographical Terminological Dictionary contains 8,922 geographical terms and covers a total of nineteen geographical disciplines. It provides detailed definitions of terms,... -
Corpus of term-annotated texts RSDO5 1.1
The RSDO5 corpus was compiled in order to serve as a training set for automatic term identification. It consists of 12 texts with 250,000 words and almost 38,000 manually... -
Corpus of academic Slovene KAS 1.0
The KAS corpus of Slovene academic writing consists of almost 65,000 BSc/BA, 16,000 MSc/MA and 1,600 PhD theses (82 thousand texts, 5 million pages or 1,7 billion tokens)... -
Terminology identification dataset KAS-term 1.0
The dataset contains 22,950 term candidates extracted from 15 Slovenian PhD theses. The term candidates are of length 1 to 4, extracted via morphosyntactic patterns and the... -
Lexical database of Slovene PR terminology TERMIS
TERMIS is a terminology database with 2,000 entries from the field of public relations. The terms in Slovene are explained and translated into English, with typical context... -
Glossary for academic integrity
The Glossary for academic integrity comprises 212 terms related to academic integrity presented in alphabetical order. Each term is accompanied by a definition, and if relevant,... -
Corpus of Academic Slovene (PhD theses) KAS-dr 1.0
The KAS-dr corpus of Slovene PhD theses consists of almost 1,600 texts (266 thousand pages or 100 million tokens) written 2000 - 2018 and gathered from the digital libraries of... -
DigiLing e-Learning Hub: e-Courses for Digital Linguistics
The files represent exported e-learning resources created within the DigiLing project, www.digiling.eu. We have identified seven core subjects in Digital Linguistics and built... -
Corpus of Academic Slovene (MSc/MA theses) KAS-mag 1.0
The KAS-mag corpus of Slovene MSc/MA theses consists of almost 16,000 texts (1,360 thousand pages or 500 million tokens) written 2000 - 2018 and gathered from the digital... -
Corpus of term-annotated texts RSDO5 1.0
The RSDO5 corpus was compiled in order to serve as a training set for automatic term identification. It consists of 12 texts with 250,000 words and almost 38,000 manually... -
Corpus of Informatics DSI 5.0
The DSI corpus is meant as a terminological resource for the field of informatics, esp. for the development of the on-line terminological dictionary of informatics, Islovar...