This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

WMT17 En-De APE Shared Task Data

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Chatterjee, Rajen and Negri, Matteo, 2017, WMT17 En-De APE Shared Task Data, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1966.
Date issued
2017-02-15
Size
11000 items
Language(s)
Description
Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
en-de_train.zip
Size
1.05 MB
Format
application/zip
Description
APE 2017 En-De Train
MD5
4716581f74493979d476d3f6a3d5574f
Preview
  File Preview