WMT17 En-De APE Shared Task Data

PID

Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. All data is provided by the EU project QT21 (http://www.qt21.eu/).

Identifier
PID http://hdl.handle.net/11234/1-1966
Related Identifier http://www.statmt.org/wmt17/ape-task.html
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-1966
Provenance
Creator Turchi, Marco; Chatterjee, Rajen; Negri, Matteo
Publisher Fondazione Bruno Kessler, Trento, Italy
Publication Year 2017
Funding Reference info:eu-repo/grantAgreement/EC/H2020/645452
Rights AGREEMENT ON THE USE OF DATA IN QT21 APE Task; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language English; German
Resource Type corpus
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics