This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

WMT17 De-En APE Shared Task Data

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Chatterjee, Rajen and Negri, Matteo, 2017, WMT17 De-En APE Shared Task Data, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-1967.
Date issued
2017-02-15
Size
25000 items
Language(s)
Description
Training and development data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in German-English triplets (source, target and post-edit) belonging to the pharmacological domain and already tokenized. Training and development respectively contain 25,000 and 1,000 triplets. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
de-en_train_dev.zip
Size
2.88 MB
Format
application/zip
Description
APE 2017 De-En
MD5
5a7c5bc8b22d13001b3e330608739da6
Preview
  File Preview