-
Universal Dependencies 2.6 models for UDPipe 2 (2020-08-31)
Tokenizer, POS Tagger, Lemmatizer and Parser models for 99 treebanks of 63 languages of Universal Depenencies 2.6 Treebanks, created solely using UD 2.6 data... -
Czech Models (MorfFlex CZ 2.0 + PDT-C 1.0) for MorphoDiTa 220710
Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ 2.0,... -
Aging effects in an evolving phonological network
Phonological networks are representations of word forms and their phonological relationships with other words in a given language lexicon. A principle underlying the growth (or... -
On-line Dictionary of medieval latin in the Czech lands
The Dictionary of Medieval Latin in the Czech Lands registers and explains the vocabulary of Medieval Latin as used in the Czech lands since the beginnings of Latin writing in... -
HinDialect: 26 Hindi-related languages and dialects of the Indic Continuum in...
HinDialect: 26 Hindi-related languages and dialects of the Indic Continuum in North India Languages This is a collection of folksongs for 26 languages that form a dialect... -
Nottinghamer Korpus Deutscher YouTube-Sprache (The NottDeuYTSch Corpus)
The NottDeuYTSch corpus contains over 33 million words taken from approximately 3 million YouTube comments from videos published between 2008 to 2018 targeted at a young,... -
SnakeCLEF 2021
The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images... -
CoCzeFLA Chroma 1.2.7.0
Transcripts of longitudinal audio recordings of 7 Czech typical monolingual children between 1;7 to 3;9. Files are in plain text with UTF-8 encoding. Each file represents one... -
Semantic Features and Their Role In Conceptual Representation In School Age C...
Language acquisition is one of the currently much discussed topics in the field of psycholinguistics. Considerable space for future research can be seen in the development of... -
Arabic Phonetic Rules
Description: this xml file describes the Arabic phonetic constraints (rules) resulting from the analysis of the lexicons(Taj Alarous, Al ain, Lisan Al arab, Alwassit and... -
Universal Dependencies 2.10
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual... -
VALLEX 4.5
VALLEX 4.5 provides information on the valency structure (combinatorial potential) of Czech verbs in their particular senses (almost 4 700 verbs in more than 11 080 lexical... -
Hausa Visual Genome 1.0
Data Hausa Visual Genome 1.0, a multimodal dataset consisting of text and images suitable for English-to-Hausa multimodal machine translation tasks and multimodal research. We... -
Individual Textual Profiles of Hillary Clinton and Donald Trump
This corpus consists of full transcriptions of both Democratic and Republican 2016 presidential candidate debates, with a special focus on the idiolects of Hillary Clinton and... -
Memorial Day of Heroes at the German Opera in Prague
Segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1942, issue no. 12, captures the Memorial Day of Heroes events held as part of the... -
The German economic societies meeting in Prague
Segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1941, issue no. 52, reports on a meeting of the Southeast European Economic Society... -
Funeral of journalist Karel Lažnovský in Prague
Segment from UFA Praha 1941 no. 2 depicts the enormous funeral of pro-German journalist Karel Lažnovský held in the Small Hall of the Crematorium of the City of Prague in... -
Objects from the Scene of Reinhard Heydrich' s Assassination
Segment consisting of footage showing objects from the scene of the assassination of acting Reich Protector Reinhard Heydrich, which was screened in all cinemas throughout the... -
Farmers' Holiday at Jarov Heath Resort
Segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1943, issue no. 11B, reports on a workers´ holiday organized by the Reinhard... -
A Gift of an Ambulance Train to the German Army
Segment from Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel) 1942, issue no. 17, captures the presentation of a gift Ï Ambulance Train no. 751 Ï...