70 datasets found

Keywords: treebank

Filter Results
  • Universal Dependencies 1.1

    Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual...
  • IWPT 2020 Shared Task Data and System Outputs

    This package contains data used in the IWPT 2020 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal...
  • Czech Legal Text Treebank 2.0

    The Czech Legal Text Treebank 2.0 (CLTT 2.0) annotates the same texts as the CLTT 1.0. These texts come from the legal domain and they are manually syntactically annotated. The...
  • Poliqarp2

    Poliqarp2 is a linguistic search engine, capable of searching through large corpora annotated on multiple levels. It is not an upgraded version of Poliqarp, it is a...
  • POLFIE Bank, an LFG structure bank of Polish: pol-składnica-pargram

    The pol-składnica-pargram structure bank was created using POLFIE: an LFG grammar of Polish. This structure bank contains FULL type sentences from Składnica, which were in turn...
  • POLFIE Bank, an LFG structure bank of Polish: pol-nkjp1m-pargram-dev

    The pol-nkjp1m-pargram-dev structure bank was created using POLFIE: an LFG grammar of Polish. This structure bank contains sentences from the NKJP1M subcorpus of NKJP which were...
  • Składnica frazowa — a constituency treebank of Polish

    Składnica frazowa is a constituency treebank of Polish. The treebank is a result of parsing Polish sentences with the syntactic parser Świgra. For every sentence, the parser...
  • Lithuanian Treebank ALKSNIS

    ALKSNIS v2.1 ALKSNIS v2.1 consists of 2,355 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit...
  • Lithuanian Treebank ALKSNIS (2019-10-24)

    ALKSNIS v3.0. ALKSNIS v3,0 consists of 3,643 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit...
  • Prague Dependency Treebank 2.0 Sample Data

    This is a small sample dataset from PDT 2.0. As such it can be released under a very permissive CC-BY license.
You can also access this registry using the API (see API Docs).