Test Data EN-DE MT_PBSMT APE Shared Task WMT18

PID

Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already tokenized. Test set contains 2,000 pairs. A phrase-based machine translation system has been used to generate the target segments. This test set is sampled from the same dataset used for the 2016 and 2017 APE shared task editions. All data is provided by the EU project QT21 (http://www.qt21.eu/).

Identifier
PID http://hdl.handle.net/11372/LRT-2725
Related Identifier http://www.statmt.org/wmt18/ape-task.html
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11372/LRT-2725
Provenance
Creator Turchi, Marco; Negri, Matteo; Chatterjee, Rajen
Publisher Fondazione Bruno Kessler, Trento, Italy
Publication Year 2018
Funding Reference info:eu-repo/grantAgreement/EC/H2020/645452
Rights AGREEMENT ON THE USE OF DATA IN QT21 APE Task; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language English; German
Resource Type corpus
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics