HamleDT 3.0

PID

HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. This version uses Universal Dependencies as the common annotation style.

Update (November 1017): for a current collection of harmonized dependency treebanks, we recommend using the Universal Dependencies (UD). All of the corpora that are distributed in HamleDT in full are also part of the UD project; only some corpora from the Patch group (where HamleDT provides only the harmonizing scripts but not the full corpus data) are available in HamleDT but not in UD.

Identifier
PID http://hdl.handle.net/11234/1-1508
Related Identifier http://hdl.handle.net/11858/00-097C-0000-0023-9551-4
Related Identifier http://ufal.mff.cuni.cz/hamledt
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-1508
Provenance
Creator Zeman, Daniel; Mareček, David; Mašek, Jan; Popel, Martin; Ramasamy, Loganathan; Rosa, Rudolf; Štěpánek, Jan; Žabokrtský, Zdeněk
Publisher Charles University
Publication Year 2015
Rights HamleDT 3.0 License Terms; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-3.0; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Arabic; Basque; Bengali; Bangla; Bulgarian; Catalan; Valencian; Croatian; Czech; Danish; Dutch; Flemish; English; Estonian; Finnish; French; German; Greek, Modern (1453-); Greek; Greek, Ancient (to 1453); Hebrew; Hindi; Hungarian; Indonesian; Irish; Italian; Japanese; Latin; Persian; Farsi; Polish; Portuguese; Romanian; Moldavian; Moldovan; Russian; Slovak; Slovenian; Slovene; Spanish; Castilian; Swedish; Tamil; Telugu; Turkish
Resource Type corpus
Format application/x-tar; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics