Show simple item record Dušek, Ondřej Hajič, Jan Hlaváčová, Jaroslava Pecina, Pavel Tamchyna, Aleš Urešová, Zdeňka 2014-04-28T14:28:51Z 2014-04-28T14:28:51Z 2014-04-28
dc.description This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German.
dc.description.sponsorship This work was supported by the EU FP7 project Khresmoi (European Comission contract No. 257528). The language resources are distributed by the LINDAT/Clarin project of the Ministry of Education, Youth and Sports of the Czech Republic (project no. LM2010013). We thank all the data providers and copyright holders for providing the source data and anonymous experts for translating the sentences.
dc.language.iso eng
dc.language.iso ces
dc.language.iso fra
dc.language.iso deu
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation info:eu-repo/grantAgreement/EC/FP7/257528
dc.rights Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
dc.subject corpus
dc.subject test data
dc.subject medical
dc.subject health
dc.subject machine translation
dc.subject Czech
dc.subject French
dc.subject German
dc.subject English
dc.title Khresmoi Summary Translation Test Data 1.1
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Pecina
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Pavel
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#TextInfo#SizeInfo.size 1500
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit sentences
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIN
sponsor European Union FP7-ICT-2010-6-257528 Khresmoi euFunds info:eu-repo/grantAgreement/EC/FP7/257528
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2010013 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
files.size 663136
files.count 2

 Files in this item

 Download all files in item (647.59 KB)
This item is
Publicly Available
and licensed under:
Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
Distributed under Creative Commons Attribution Required Noncommercial
637.26 KB
data package
 Download file  Preview
 File Preview  
  • khresmoi-summary-test-set
    • khresmoi-summary-test.topic_id7 kB
    • kB
    • kB
    • queries.clef2013ehealth.1-50.test.xml21 kB
    • khresmoi-summary-dev.de67 kB
    • khresmoi-summary-test.de144 kB
    • khresmoi-summary-test.cs.sgm151 kB
    • kB
    • khresmoi-summary-dev.en58 kB
    • khresmoi-summary-test.en123 kB
    • khresmoi-summary-test.en.sgm143 kB
    • khresmoi-summary-dev.topic_id3 kB
    • normalize-punctuation.pl2 kB
    • khresmoi-summary-dev.cs.sgm71 kB
    • khresmoi-summary-dev.cs62 kB
    • khresmoi-summary-dev.fr70 kB
    • khresmoi-summary-test.cs132 kB
    • khresmoi-summary-test.doc_id6 kB
    • README.TXT10 kB
    • khresmoi-summary-test.fr150 kB
    • khresmoi-summary-dev.en.sgm68 kB
    • kB
    • khresmoi-summary-dev.doc_id3 kB
10.34 KB
Text file
 Download file  Preview
 File Preview  
Khresmoi Summary Translation Test Data for the Medical Domain version 1.1 Apr 28, 2014 Pavel Pecina 1. Description This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech, English, French, and German. Version 1.1 of this data set differs from version 1.0 in punctuation which was normalized using the attached script 2. Preamble 2.1 Source The original sentences are sampled from summaries of English medical documents crawled from the web in 2012 and identified to be relevant to 50 medical topics. The translations were carried out by the Charles University in Prague. 2.2 License The Khresmoi Summary Test Set is made available under the terms of the Creative C . . .

Show simple item record