This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Test Data EN-DE MT_PBSMT APE Shared Task WMT18

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Negri, Matteo and Chatterjee, Rajen, 2018, Test Data EN-DE MT_PBSMT APE Shared Task WMT18, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-2725.
Date issued
2018-05-04
Size
2000 items
Language(s)
Description
Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already tokenized. Test set contains 2,000 pairs. A phrase-based machine translation system has been used to generate the target segments. This test set is sampled from the same dataset used for the 2016 and 2017 APE shared task editions. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
En-De_PBSMT_Test_2018.zip
Size
128.76 KB
Format
application/zip
Description
Zip
MD5
212cb488f31ac9d6e5c7ac3b9f2fcb25
Preview
  File Preview
  • __MACOSX
    • ._en-de.src.test.2018355 B
    • ._en-de.mt.test.2018355 B
    • en-de.mt.test.2018214 kB
    • en-de.src.test.2018171 kB