dc.contributor.author |
Turchi, Marco |
dc.contributor.author |
Negri, Matteo |
dc.contributor.author |
Chatterjee, Rajen |
dc.date.accessioned |
2017-07-13T11:55:38Z |
dc.date.available |
2017-07-13T11:55:38Z |
dc.date.issued |
2017-07-12 |
dc.identifier.uri |
http://hdl.handle.net/11234/1-2334 |
dc.description |
Training, development and test data consist in German sentences belonging to the IT domain and already tokenized. These sentences are the references of the data released for the 2016 edition of the WMT APE shared task. Differently from the data previously released, these sentences are obtained by manually translating the source sentence without leveraging the raw mt outputs. Training and development respectively contain 12,000 and 1,000 segments, while the test set 2,000 items. All data is provided by the EU project QT21 (http://www.qt21.eu/). |
dc.language.iso |
deu |
dc.publisher |
Fondazione Bruno Kessler, Trento, Italy |
dc.relation |
info:eu-repo/grantAgreement/EC/H2020/645452 |
dc.rights |
AGREEMENT ON THE USE OF DATA IN QT21 APE Task |
dc.rights.uri |
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21 |
dc.source.uri |
http://www.statmt.org/wmt16/ape-task.html |
dc.subject |
machine translation |
dc.subject |
machine learning |
dc.subject |
automatic post-editing |
dc.subject |
shared task |
dc.title |
WMT16 APE Shared Task Data - Reference sentences |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
dc.rights.label |
PUB |
has.files |
yes |
branding |
LINDAT / CLARIAH-CZ |
contact.person |
Marco Turchi turchi@fbk.eu FBK |
contact.person |
Matteo Negri negri@fbk.eu FBK |
contact.person |
Rajen Chatterjee chatterjee@fbk.eu FBK |
sponsor |
European Union H2020-ICT-2014-1-645452 QT21: Quality Translation 21 euFunds info:eu-repo/grantAgreement/EC/H2020/645452 |
size.info |
12000 items |
size.info |
2000 items |
size.info |
1000 items |
files.size |
1714882 |
files.count |
3 |