This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

WMT17 En-De APE Shared Task Data

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Chatterjee, Rajen and Negri, Matteo, 2017, WMT17 En-De APE Shared Task Data, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1966.
Date issued
2017-02-15
Size
11000 items
Language(s)
Description
Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets (source, target and post-edit) belonging to the IT domain and already tokenized. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
en-de_train.zip
Size
1.05 MB
Format
application/zip
Description
Zip
MD5
4716581f74493979d476d3f6a3d5574f
Preview
  File Preview
  • __MACOSX
    • ._en-de_train180 B
    • en-de_train
      • ._README180 B
      • ._en-de.train.pe180 B
      • ._en-de.train.mt180 B
      • ._en-de.train.src180 B
  • en-de_train
    • en-de.train.mt1 MB
    • README174 B
    • en-de.train.src951 kB
    • en-de.train.pe1 MB