-
Liner2.5 model Minos
A model for Liner2.5 to recognize verbs without an explicit subject. -
Lithuanian Coreference Corpus
Lithuanian Coreference Corpus The corpus is made out of 100 articles from news portals focusing on political news, as such texts are rich in quotations and named entity... -
KPWr annotation guidelines - coreference
Coreference annotation guidelines describing the process of manual annotation of documents in Polish Corpus of Wrocław University of Technology (KPWr) -
Prague Czech-English Dependency Treebank 2.0 - Russian translation
Prague Czech-English Dependency Treebank - Russian translation (PCEDT-R) is a project of translating a subset of Prague Czech-English Dependency Treebank 2.0 (PCEDT 2.0) to... -
Coreference in Universal Dependencies 1.0 (CorefUD 1.0)
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version... -
DiscoMT 2017 Shared Task on Cross-lingual Pronoun Prediction
Data used in the 2017 shared task on cross-lingual pronoun prediction. -
PAWS
PAWS is a multi-lingual parallel treebank with coreference annotation. It consists of English texts from the Wall Street Journal translated into Czech, Russian and Polish. In... -
Prague Dependency Treebank 3.5
The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied... -
Prague Czech-English Dependency Treebank 2.0 Coref
The Prague Czech-English Dependency Treebank 2.0 Coref (PCEDT 2.0 Coref) is a parallel treebank building upon the original PCEDT 2.0 release and enriching it with the extended... -
DiscoMT 2016 Shared Task on Cross-lingual Pronoun Prediction
Files for the DiscoMT 2016 shared task on cross-lingual pronoun prediction -
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated... -
Prague Discourse Treebank 2.0
PDiT 2.0 is a new version of the Prague Discourse Treebank. It contains a complex annotation of discourse phenomena enriched by the annotation of secondary connectives. -
Coreference in Universal Dependencies 1.1 (CorefUD 1.1)
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version... -
Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0)
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consisting of 742,316 tokens and 73,835 sentences, representing 7,324 minutes... -
Coreference in Universal Dependencies 0.2 (CorefUD 0.2)
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version... -
Coreference in Universal Dependencies 0.1 (CorefUD 0.1)
CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version... -
ParCorFull: A Parallel Corpus Annotated with Full Coreference
ParCorFull is a parallel corpus annotated with full coreference chains that has been created to address an important problem that machine translation and other multilingual...