WMT16 APE Shared Task Data - Reference sentences
Please use the following text to cite this item or export to a predefined format:
Turchi, Marco; Negri, Matteo and Chatterjee, Rajen, 2017,
WMT16 APE Shared Task Data - Reference sentences, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2334.
Authors
Item identifier
Project URL
Date issued
2017-07-12
Size
12000 items,
2000 items,
1000 items
Language(s)
Description
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the 2016 edition of the WMT APE shared task. Differently from the data previously released, these sentences are obtained by manually translating the source sentence without leveraging the raw mt outputs. Training and development respectively contain 12,000 and 1,000 segments, while the test set 2,000 items. All data is provided by the EU project QT21 (http://www.qt21.eu/).
Acknowledgement
European Union
Project code:H2020-ICT-2014-1-645452
Project name:QT21: Quality Translation 21
Collections
Files in this item
- Name
- train.2016.ref
- Size
- 1.31 MB
- Format
- application/octet-stream
- Description
- Training references
- MD5
- 5687b1399221a80962c73f40a58820d1

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- dev.2016.ref
- Size
- 118.85 KB
- Format
- application/octet-stream
- Description
- Development references
- MD5
- f76437c8a015f184f62866d773fe1779

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- test.2016.ref
- Size
- 210.72 KB
- Format
- application/octet-stream
- Description
- Test references
- MD5
- 9793da33a336d496bdb49d7b0a8e4cc4

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

