This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

WMT18 APE Shared Task: En-DE NMT Train and Dev Data

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Negri, Matteo and Chatterjee, Rajen, 2018, WMT18 APE Shared Task: En-DE NMT Train and Dev Data, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-2613.
Date issued
2018-02-12
Size
15000 items
Language(s)
Description
Training and development data for the WMT 2018 Automatic post-editing task. They consist in English-German triplets (source, target and post-edit) belonging to the information technology domain and already tokenized. Training and development respectively contain 13,442 and 1,000 triplets. A neural machine translation system has been used to generate the target segments. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
en_de_NMT_train_dev.tgz
Size
1.47 MB
Format
application/x-gzip
Description
gzip Archive
MD5
c34ad252e4f23f85e851e0a0cc7fb308
Preview
  File Preview