This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

WMT16 APE Shared Task Data - Reference sentences

Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Negri, Matteo and Chatterjee, Rajen, 2017, WMT16 APE Shared Task Data - Reference sentences, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-2334.
Date issued
2017-07-12
Size
12000 items,
2000 items,
1000 items
Language(s)
Description
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the 2016 edition of the WMT APE shared task. Differently from the data previously released, these sentences are obtained by manually translating the source sentence without leveraging the raw mt outputs. Training and development respectively contain 12,000 and 1,000 segments, while the test set 2,000 items. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
train.2016.ref
Size
1.31 MB
Format
application/octet-stream
Description
Training references
MD5
5687b1399221a80962c73f40a58820d1
Preview
  File Preview
Name
dev.2016.ref
Size
118.85 KB
Format
application/octet-stream
Description
Development references
MD5
f76437c8a015f184f62866d773fe1779
Preview
  File Preview
Name
test.2016.ref
Size
210.72 KB
Format
application/octet-stream
Description
Test references
MD5
9793da33a336d496bdb49d7b0a8e4cc4
Preview
  File Preview