Morphological lexicon Sloleks 1.2

PID

Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains approx. 100.000 most frequent Slovenian lemmas, their inflected or derivative word forms and the corresponding grammatical description. Lemmatization rules, part-of-speech categorization and the set of feature-value pairs follow the JOS morphosyntactic specifications. In addition to grammatical information, each word form is also given the information on its absolute corpus frequency and its compliance with the reference language standard.

Note that this entry updates Sloleks 1.0 by fixing various encoding and content errors.

The resource is further described in:

Kaja Dobrovoljc, Simon Krek and Tomaž Erjavec, 2017: The Sloleks Morphological Lexicon and its Future Development. In (Vojko Gorjanc, Polona Gantar, Iztok Kosem and Simon Krek, eds.): Dictionary of Modern Slovene: Problems and Solutions. Ljubljana University Press, Faculty of Arts. https://ebooks.uni-lj.si/ZalozbaUL/catalog/view/2/1/47

Identifier
PID http://hdl.handle.net/11356/1039
Related Identifier https://ebooks.uni-lj.si/ZalozbaUL/catalog/view/2/1/47
Related Identifier http://hdl.handle.net/11356/1230
Related Identifier http://hdl.handle.net/11356/1033
Related Identifier http://eng.slovenscina.eu/sloleks/opis
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1039
Provenance
Creator Dobrovoljc, Kaja; Krek, Simon; Holozan, Peter; Erjavec, Tomaž; Romih, Miro
Publisher Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2015
Rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0); PUB; https://creativecommons.org/licenses/by-nc-sa/4.0/
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 5
Discipline Linguistics