Frequency list of words from the Trendi corpus 2019

PID

This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene (see e.g. http://hdl.handle.net/11356/1590) covering the period between 1 January 2019 and 31 December 2019 using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The Trendi frequency list was then compared to the frequency list of words from the Gigafida 2.0 corpus of Slovene, which covers the period between 1991 and 2018. The words were compared using the simple maths formula implemented by SketchEngine (see https://www.sketchengine.eu/documentation/simple-maths/).

The final list contains lemmas, their lexical features, their absolute and relative frequencies from the first (1991–2018) and second periods (2019), and the simple maths value indicating if the word is more frequent in 2019 (simple maths > 1.00) or in 1991–2018 (simple maths < 1.00).

Identifier
PID http://hdl.handle.net/11356/1701
Related Identifier http://hdl.handle.net/11356/1705
Related Identifier https://sled.ijs.si/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1701
Provenance
Creator Čibej, Jaka; Kosem, Iztok
Publisher Jožef Stefan Institute
Publication Year 2022
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics