39 datasets found

Creator: Mírovský, Jiří

Filter Results
  • Uniform Meaning Representation 2.1 (Czech and Latin)

    Czech and Latin UMR data, both manually annotated and programmatically converted from manually annotated tectogrammatical data.
  • Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0)

    The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consisting of 742,316 tokens and 73,835 sentences, representing 7,324 minutes...
  • Prague Discourse Treebank 3.0

    The Prague Discourse Treebank 3.0 (PDiT 3.0) is a new version of annotation of discourse relations marked by primary and secondary discourse connectives in the data of the...
  • Prague Discourse Treebank 2.0

    PDiT 2.0 is a new version of the Prague Discourse Treebank. It contains a complex annotation of discourse phenomena enriched by the annotation of secondary connectives.
  • EVALD 3.0 – Evaluator of Discourse

    EVALD 3.0 serves for automatic evaluation of surface coherence (cohesion) in Czech texts written by native speakers of Czech.
  • Lexicon of Czech and German Anaphoric Connectives

    GeCzLex 1.0 is an online electronic resource for translation equivalents of Czech and German discourse connectives. It contains anaphoric connectives for both languages and...
  • SiR 1.0

    SiR 1.0 is a corpus of Czech articles published on iRozhlas, a news server of a Czech public radio (https://www.irozhlas.cz/). It is a collection of 1 718 articles (42 890...
  • EVALD 1.0

    EVALD 1.0 serves for automatic evaluation of surface coherence (cohesion) in Czech texts written by native speakers of Czech.
  • EVALD 1.0 for Foreigners

    EVALD 1.0 for Foreigners is a software for automatic evaluation of surface coherence (cohesion) in Czech texts written by non-native speakers of Czech.
  • EVALD 4.0 – Evaluator of Discourse

    EVALD 4.0 serves for automatic evaluation of surface coherence (cohesion) in Czech texts written by native speakers of Czech.
  • Extended Textual Coreference and Bridging Relations in PDT 2.0

    Annotation of extended textual coreference and bridging relations in the Prague Dependency Treebank 2.0
  • EVALD 3.0 for Foreigners – Evaluator of Discourse

    EVALD 3.0 for Foreigners is a software for automatic evaluation of surface coherence (cohesion) in Czech texts written by non-native speakers of Czech.
  • Preamble 1.0

    Preamble 1.0 is a multilingual annotated corpus of the preamble of the EU REGULATION 2020/2092 OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL. The corpus consists of four...
  • Enriched Discourse Annotation of PDiT Subset 1.0 (PDiT-EDA 1.0)

    Enriched discourse annotation of a subset of the Prague Discourse Treebank, adding implicit relations, entity based relations, question-answer relations and other discourse...
  • NomVallex 2.5

    NomVallex is a manually annotated valency lexicon of Czech nouns and adjectives, adopting the theoretical framework of Functional Generative Description as its theoretical...
  • KUK 1.0

    KUK 1.0 is a corpus of Czech legal and administrative texts accompanied by extensive metadata information for automatic assessment of accessibility (comprehensibility or...
  • Czech RST Discourse Treebank 1.0

    The Czech RST Discourse Treebank 1.0 (CzRST-DT 1.0) is a dataset of 54 Czech journalistic texts manually annotated using the Rhetorical Structure Theory (RST). Each text...
  • CzeDLex 0.7

    CzeDLex 0.7 is the third development version of the Lexicon of Czech discourse connectives. The lexicon contains connectives partially automatically extracted from the Prague...
  • Netgraph

    Netgraph is a graphically oriented client-server application for searching in linguistically annotated treebanks. The query language of Netgraph is simple and intuitive, yet...
  • CzeDLex 0.5

    CzeDLex 0.5 is a pilot version of a lexicon of Czech discourse connectives. The lexicon contains connectives partially automatically extracted from the Prague Discourse Treebank...
You can also access this registry using the API (see API Docs).