-
Test Data EN-DE APE Shared Task WMT17
Test data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 2,000 English-German pairs (source and... -
APE Shared Task WMT17: Human Post-edits Test Data EN-DE
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 German sentences belonging to the IT domain and already tokenized. Source... -
IWPT 2020 Shared Task Data and System Outputs
This package contains data used in the IWPT 2020 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal... -
Test Data DE-EN APE Shared Task WMT17
Test data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in German-English triplets (source and... -
WMT17 En-De APE Shared Task Data
Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets... -
WMT16 Tuning Shared Task Models (English-to-Czech)
This item contains models to tune for the WMT16 Tuning shared task for English-to-Czech. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training... -
WMT16 Tuning Shared Task Models (Czech-to-English)
The item contains models to tune for the WMT16 Tuning shared task for Czech-to-English. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training... -
WMT18 APE Shared Task: En-DE NMT Train and Dev Data
Training and development data for the WMT 2018 Automatic post-editing task. They consist in English-German triplets (source, target and post-edit) belonging to the information... -
WMT17 De-En APE Shared Task Data
Training and development data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in German-English... -
Hindi Visual Genome 1.0
Data Hindi Visual Genome 1.0, a multimodal dataset consisting of text and images suitable for English-to-Hindi multimodal machine translation task and multimodal research. We... -
IWPT 2021 Shared Task Data and System Outputs
This package contains data used in the IWPT 2021 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal... -
WMT16 APE Shared Task Data - Reference sentences
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the... -
Test Data EN-DE MT_NMT APE Shared Task WMT18
Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already... -
APE Shared Task WMT17: Human Post-edits Test Data DE-EN
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 English sentences belonging to the IT domain and already tokenized. Source... -
Test Data EN-DE MT_PBSMT APE Shared Task WMT18
Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already... -
WMT16 APE Shared Task Data
Training, development and text data (the same used for the Sentence-level Quality Estimation task) consist in English-German triplets (source, target and post-edit) belonging to...