-
Inflectional lexicon srLex 1.0
hrLex is an large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD) triple. The MSD tagset follows the revised MULTEXT-East V4... -
Beseda Corpus Lemmatisation Lexicon
Beseda Corpus Lemmatisation Lexicon for Slovenian language was generated at the Fran Ramovš Institute of Slovenian Language, primarily through inflection of open class words... -
Morphological lexicon Sloleks 1.0
Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains... -
Corpus extraction tool LIST 1.3
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI... -
Morphological patterns from the Sloleks 2.0 lexicon 1.0
This entry consists of XML files with 96,290 lexical units (nouns, verbs, adjectives, and adverbs) from the Sloleks Morphological Lexicon of Slovene 2.0... -
Inflectional lexicon hrLex 1.1
hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,... -
Macedonian linguistic training corpus SETimes.MK 0.1
The SETimes.MK corpus is a sample of 570 sentences from the now unavailable setimes.com website of news articles on topics of South-Eastern Europe. The sentences were manually... -
Inflectional lexicon hrLex 1.2
hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,... -
Irish National Morphology Database (ELEXIS)
Bunachar Gramadaí is a large collection of Irish words which records their inflected forms and linguistic properties. The database contains some 43,000 entries and covers nouns,... -
Inflectional lexicon hrLex 1.3
hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency,... -
Frequency lists of word parts from the GOS 1.0 corpus
Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool... -
Corpus extraction tool LIST 1.0
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI... -
Frequency lists of word parts from the Gigafida 2.0 corpus
Frequency lists of words split into word parts were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus... -
ILSP Conceptual Dictionary of Modern Greek (ELEXIS)
ConceptNet-el (Εννοιολογικό Λεξικό της Νέας Ελληνικής ΙΕΛ). ConceptNet-el is a conceptual dictionary of Modern Greek that assumes the form of a linguistic ontology. It... -
Inflectional lexicon srLex 1.1
srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,... -
Dataset of Slovene word formation trees ArboSloleks 1.0
ArboSloleks is a dataset containing Slovene word formation trees that have been automatically constructed from word relations (http://hdl.handle.net/11356/1986) extracted from... -
Corpus extraction tool LIST 1.2
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI... -
Lithuanian morphologically annotated corpus - MATAS v3.0
MATAS corpus (version 3.0) DESCRIPTION Updated, manually checked, morphologically annotated corpus MATAS LANGUAGE Lithuanian PREVIOUS VERSIONS 1. MATAS v0.2... -
Size measurements of cryogenic gypsum at Ice Station PS106_32-2, under sea ic...
This dataset has no description
-
Size measurements of cryogenic gypsum at Ice Station PS106_45-1, under sea ic...
This dataset has no description
