-
INEL Enets Corpus
Corpus Citation Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2024. INEL Enets Corpus. Version 1.0. Publication date 2024-11-30.... -
tweeDe
A German UD Twitter treebank, with >12,000 tokens from 519 tweets, annotated in the Universal Dependencies framework -
Swedish Academy Wordlist - SAOL (ELEXIS)
Svenska Akademiens ordlista. This is the standard wordlist for spelling and inflection for modern Swedish. Edition 14 (2015). -
Dictionary of Lesser Used Slovenian Words (ELEXIS)
Besedišče slovenskega jezika z oblikoslovnimi podatki (po gradivu za slovar sodobnega knjižnega jezika zbrane besede, ki niso bile sprejete v Slovar slovenskega knjižnega... -
Lemma list of the SYN-series corpora (ELEXIS)
Lemma list derived from the representative synchronic written corpora of the SYN series. The format is quite straightforward, it is a simple tsv file with the columns in the... -
Lemma list of the Dictionary of the Danish Language - ODS (ELEXIS)
Ordbog over det Danske Sprog (ODS), lemma list. Contents and format: This list contains the headwords of the online version of ODS (and ODS-S) (ordnet.dk/ods). ODS describes the... -
Swedish Academy Dictionary - SAOB (ELEXIS)
Svenska Akademiens ordbok. The Swedish Academy Dictionary, a historical Dictionary. The first part published in 1883, and in 2019 Words up to and including "VÄVNAD" were... -
Lemma list of the Danish Dictionary - DDO (ELEXIS)
Den Danske Ordbog (DDO), Lemma list. Contents and format: This list contains the headwords of the online version of DDO (ordnet.dk/ddo). DDO describes Danish lemmas from 1950... -
Lemma list of the Beseda Corpus Lemmatisation Lexicon (ELEXIS)
Lematizacijski slovar (leksikon besednih oblik za Besedo). Beseda Corpus Lemmatisation Lexicon for Slovenian language was generated at the Fran Ramovš Institute of Slovenian... -
Croatian Language Resources for NooJ (ELEXIS)
Croatian Language Resources for NooJ is a set of files holding a list of Croatian words marked for POS, and depending on the POS with type, form, gender, category, case, number,... -
Lemma list of the Dictionary of Contemporary Portuguese - DLPC (ELEXIS)
Dicionário da Língua Portuguesa Contemporânea (DLPC) is a monolingual Portuguese dictionary published by Academia das Ciências de Lisboa (2001). This dictionary also represents... -
Lemma list of the German Dictionary elexiko (ELEXIS)
elexiko is an online information system ("dictionary") on contemporary German language (mainly post World War II), which documents, explains and scientifically comments on the... -
Deltacorpus
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger... -
The Diorisis Ancient Greek Corpus
An annotated corpus of literary Ancient Greek sourced from the Perseus Canonical Greek Lit repository (https://github.com/PerseusDL/canonical-greekLit), “The Little Sailing”... -
Lingua::Interset 2.026
Lingua::Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. Version 2.026 covers 37 different tagsets of 21... -
Deltacorpus 1.1
Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger...