dc.contributor.author | Dušek, Ondřej |
dc.contributor.author | Hajič, Jan |
dc.contributor.author | Hlaváčová, Jaroslava |
dc.contributor.author | Pecina, Pavel |
dc.contributor.author | Tamchyna, Aleš |
dc.contributor.author | Urešová, Zdeňka |
dc.date.accessioned | 2014-04-28T14:28:51Z |
dc.date.available | 2014-04-28T14:28:51Z |
dc.date.issued | 2014-04-28 |
dc.identifier.uri | http://hdl.handle.net/11858/00-097C-0000-0023-866E-1 |
dc.description | This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German. |
dc.description.sponsorship | This work was supported by the EU FP7 project Khresmoi (European Comission contract No. 257528). The language resources are distributed by the LINDAT/Clarin project of the Ministry of Education, Youth and Sports of the Czech Republic (project no. LM2010013). We thank all the data providers and copyright holders for providing the source data and anonymous experts for translating the sentences. |
dc.language.iso | eng |
dc.language.iso | ces |
dc.language.iso | fra |
dc.language.iso | deu |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation | info:eu-repo/grantAgreement/EC/FP7/257528 |
dc.relation.isreplacedby | http://hdl.handle.net/11234/1-2122 |
dc.rights | Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/3.0/ |
dc.source.uri | http://khresmoi.eu/ |
dc.subject | corpus |
dc.subject | test data |
dc.subject | medical |
dc.subject | health |
dc.subject | machine translation |
dc.subject | Czech |
dc.subject | French |
dc.subject | German |
dc.subject | English |
dc.title | Khresmoi Summary Translation Test Data 1.1 |
dc.type | corpus |
metashare.ResourceInfo#ContactInfo#PersonInfo.surname | Pecina |
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName | Pavel |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName | Charles University in Prague, UFAL |
metashare.ResourceInfo#DistributionInfo.availability | unrestrictedUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | academic-nonCommercialUse |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#TextInfo#SizeInfo.size | 1500 |
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit | sentences |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email | pecina@ufal.mff.cuni.cz |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
sponsor | European Union FP7-ICT-2010-6-257528 Khresmoi euFunds info:eu-repo/grantAgreement/EC/FP7/257528 |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2010013 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
size.info | 1500 sentences |
files.size | 663136 |
files.count | 2 |
Files in this item
Download all files in item (647.59 KB)This item is
Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
Publicly Available
and licensed under:Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
- Name
- khresmoi-summary-test-set.tgz
- Size
- 637.26 KB
- Format
- application/x-gzip
- Description
- data package
- MD5
- c0d333e3f6d8f2db1cc281821d5bcbe8
- khresmoi-summary-test-set
- khresmoi-summary-test.topic_id7 kB
- khresmoi-summary-dev.de.sgm77 kB
- khresmoi-summary-test.fr.sgm170 kB
- queries.clef2013ehealth.1-50.test.xml21 kB
- khresmoi-summary-dev.de67 kB
- khresmoi-summary-test.de144 kB
- khresmoi-summary-test.cs.sgm151 kB
- khresmoi-summary-dev.fr.sgm80 kB
- khresmoi-summary-dev.en58 kB
- khresmoi-summary-test.en123 kB
- khresmoi-summary-test.en.sgm143 kB
- khresmoi-summary-dev.topic_id3 kB
- normalize-punctuation.pl2 kB
- khresmoi-summary-dev.cs.sgm71 kB
- khresmoi-summary-dev.cs62 kB
- khresmoi-summary-dev.fr70 kB
- khresmoi-summary-test.cs132 kB
- khresmoi-summary-test.doc_id6 kB
- README.TXT10 kB
- khresmoi-summary-test.fr150 kB
- khresmoi-summary-dev.en.sgm68 kB
- khresmoi-summary-test.de.sgm163 kB
- khresmoi-summary-dev.doc_id3 kB
- Name
- README.TXT
- Size
- 10.34 KB
- Format
- Text file
- Description
- Readme
- MD5
- e1f594eb6743282cccfadd155e297ca9
Khresmoi Summary Translation Test Data for the Medical Domain version 1.1 Apr 28, 2014 Pavel Pecina <pecina@ufal.mff.cuni.cz> 1. Description This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech, English, French, and German. Version 1.1 of this data set differs from version 1.0 in punctuation which was normalized using the attached script normalize-punctuation.pl. 2. Preamble 2.1 Source The original sentences are sampled from summaries of English medical documents crawled from the web in 2012 and identified to be relevant to 50 medical topics. The translations were carried out by the Charles University in Prague. 2.2 License The Khresmoi Summary Test Set is made available under the terms of the Creative C . . .