WMT16 APE Shared Task Data - Reference sentences

PID

Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the 2016 edition of the WMT APE shared task. Differently from the data previously released, these sentences are obtained by manually translating the source sentence without leveraging the raw mt outputs. Training and development respectively contain 12,000 and 1,000 segments, while the test set 2,000 items. All data is provided by the EU project QT21 (http://www.qt21.eu/).

Identifier
PID http://hdl.handle.net/11234/1-2334
Related Identifier http://www.statmt.org/wmt16/ape-task.html
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-2334
Provenance
Creator Turchi, Marco; Negri, Matteo; Chatterjee, Rajen
Publisher Fondazione Bruno Kessler, Trento, Italy
Publication Year 2017
Rights AGREEMENT ON THE USE OF DATA IN QT21 APE Task; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21; PUB
OpenAccess true
Contact Fondazione Bruno Kessler, Trento, Italy
Representation
Language German
Resource Type corpus
Format application/octet-stream; text/plain; charset=utf-8; downloadable_files_count: 3
Discipline Linguistics