Lemma list of the SYN-series corpora (ELEXIS)

PID

Lemma list derived from the representative synchronic written corpora of the SYN series. The format is quite straightforward, it is a simple tsv file with the columns in the following order: lemma POS SYN2000 SYN2005 SYN2010 SYN2015

where every corpus is in fact represented by two columns, with frequency and i.p.m., so the total number of columns in the file is 10. The lemma list is filtered and includes only alphabetical lemmas with non-zero frequency in all four corpora.

Identifier
PID http://hdl.handle.net/11356/1554
Related Identifier http://www.lrec-conf.org/proceedings/lrec2014/pdf/294_Paper.pdf
Related Identifier https://wiki.korpus.cz/doku.php/en:cnk:syn
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1554
Provenance
Creator Křen, Michal
Publisher Institute of the Czech National Corpus
Publication Year 2020
OpenAccess true
Contact info(at)clarin.si
Representation
Language Czech
Resource Type lexicalConceptualResource
Format downloadable_files_count: 0
Discipline Linguistics