Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

PID

Annotated list of dependency bigrams occurring in the PDT more than five times and having part-of-speech patterns that can possibly form a collocation. Each bigram is assigned to one of the six MWE categories by three annotators.

Identifier
PID http://hdl.handle.net/11234/1-1457
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-1457
Provenance
Creator Pecina, Pavel
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2008
Rights Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0); http://creativecommons.org/licenses/by-nc/3.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Czech
Resource Type lexicalConceptualResource
Format application/x-gzip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics