WMT16 APE Shared Task Data

PID

Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. Training and development respectively contain 12,000 and 1,000 triplets, while the test set 2,000 instances. All data is provided by the EU project QT21 (http://www.qt21.eu/).

Identifier
PID http://hdl.handle.net/11372/LRT-1632
Related Identifier http://www.statmt.org/wmt16/ape-task.html
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11372/LRT-1632
Provenance
Creator Turchi, Marco; Chatterjee, Rajen; Negri, Matteo
Publisher Fondazione Bruno Kessler, Trento, Italy
Publication Year 2016
Funding Reference info:eu-repo/grantAgreement/EC/H2020/645452
Rights AGREEMENT ON THE USE OF DATA IN QT21 APE Task; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language English; German
Resource Type corpus
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 3
Discipline Linguistics