-
Prague Dependency Treebank - Consolidated 2.0 (PDT-C 2.0)
A manually annotated and genre-diversified language resource with rich linguistic information from morphology and syntax to semantics, the Prague Dependency Treebank –... -
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated... -
WUT Relations Between Sentences Corpus
WUT Relations Between Sentences Corpus contains 2827 pairs of related sentences. Relationships are derived from Cross-document Structure Theory (CST), which enables... -
Slovene corpus for general relation extraction SloREL 1.1
The SloREL corpus contains annotations for training relation extraction models on Slovene documents. It contains documents from Slovene Wikipedia with annotated entities and... -
Slovene corpus for general relation extraction SloREL 1.0
The SloREL corpus contains annotations for training relation extraction models on Slovene documents. It contains documents from Slovene Wikipedia with annotated entities and... -
ILSP Conceptual Dictionary of Modern Greek (ELEXIS)
ConceptNet-el (Εννοιολογικό Λεξικό της Νέας Ελληνικής ΙΕΛ). ConceptNet-el is a conceptual dictionary of Modern Greek that assumes the form of a linguistic ontology. It... -
TermFrame: Terms, definitions and semantic annotations for karstology
The resource contains several datasets containing domain-specific data in three languages, English, Slovenian and Croatian, which can be used for various knowledge extraction or... -
Czech Legal Text Treebank 2.0
The Czech Legal Text Treebank 2.0 (CLTT 2.0) annotates the same texts as the CLTT 1.0. These texts come from the legal domain and they are manually syntactically annotated. The... -
COSTRA 1.0: A Dataset of Complex Sentence Transformations
COSTRA 1.0 is a dataset of Czech complex sentence transformations. The dataset is intended for the study of sentence-level embeddings beyond simple word alternations or standard... -
Prague Dependency Treebank 3.5
The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied...