Show simple item record

 
dc.contributor.author Dušek, Ondřej
dc.contributor.author Hajič, Jan
dc.contributor.author Hlaváčová, Jaroslava
dc.contributor.author Pecina, Pavel
dc.contributor.author Tamchyna, Aleš
dc.contributor.author Urešová, Zdeňka
dc.date.accessioned 2014-04-28T14:28:51Z
dc.date.available 2014-04-28T14:28:51Z
dc.date.issued 2014-04-28
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0023-866E-1
dc.description This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German.
dc.description.sponsorship This work was supported by the EU FP7 project Khresmoi (European Comission contract No. 257528). The language resources are distributed by the LINDAT/Clarin project of the Ministry of Education, Youth and Sports of the Czech Republic (project no. LM2010013). We thank all the data providers and copyright holders for providing the source data and anonymous experts for translating the sentences.
dc.language.iso eng
dc.language.iso ces
dc.language.iso fra
dc.language.iso deu
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation info:eu-repo/grantAgreement/EC/FP7/257528
dc.relation.isreplacedby http://hdl.handle.net/11234/1-2122
dc.rights Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc/3.0/
dc.source.uri http://khresmoi.eu/
dc.subject corpus
dc.subject test data
dc.subject medical
dc.subject health
dc.subject machine translation
dc.subject Czech
dc.subject French
dc.subject German
dc.subject English
dc.title Khresmoi Summary Translation Test Data 1.1
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Pecina
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Pavel
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#TextInfo#SizeInfo.size 1500
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit sentences
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email pecina@ufal.mff.cuni.cz
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
sponsor European Union FP7-ICT-2010-6-257528 Khresmoi euFunds info:eu-repo/grantAgreement/EC/FP7/257528
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2010013 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
size.info 1500 sentences
files.size 663136
files.count 2


 Files in this item

 Download all files in item (647.59 KB)
This item is
Publicly Available
and licensed under:
Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
Distributed under Creative Commons Attribution Required Noncommercial
Icon
Name
khresmoi-summary-test-set.tgz
Size
637.26 KB
Format
application/x-gzip
Description
data package
MD5
c0d333e3f6d8f2db1cc281821d5bcbe8
 Download file  Preview
 File Preview  
  • khresmoi-summary-test-set
    • khresmoi-summary-test.topic_id7 kB
    • khresmoi-summary-dev.de.sgm77 kB
    • khresmoi-summary-test.fr.sgm170 kB
    • queries.clef2013ehealth.1-50.test.xml21 kB
    • khresmoi-summary-dev.de67 kB
    • khresmoi-summary-test.de144 kB
    • khresmoi-summary-test.cs.sgm151 kB
    • khresmoi-summary-dev.fr.sgm80 kB
    • khresmoi-summary-dev.en58 kB
    • khresmoi-summary-test.en123 kB
    • khresmoi-summary-test.en.sgm143 kB
    • khresmoi-summary-dev.topic_id3 kB
    • normalize-punctuation.pl2 kB
    • khresmoi-summary-dev.cs.sgm71 kB
    • khresmoi-summary-dev.cs62 kB
    • khresmoi-summary-dev.fr70 kB
    • khresmoi-summary-test.cs132 kB
    • khresmoi-summary-test.doc_id6 kB
    • README.TXT10 kB
    • khresmoi-summary-test.fr150 kB
    • khresmoi-summary-dev.en.sgm68 kB
    • khresmoi-summary-test.de.sgm163 kB
    • khresmoi-summary-dev.doc_id3 kB
Icon
Name
README.TXT
Size
10.34 KB
Format
Text file
Description
Readme
MD5
e1f594eb6743282cccfadd155e297ca9
 Download file  Preview
 File Preview  
Khresmoi Summary Translation Test Data for the Medical Domain version 1.1
                                Apr 28, 2014

                  Pavel Pecina <pecina@ufal.mff.cuni.cz>
 
1. Description

   This package contains data sets for development (Section dev) and testing
   (Section test) of machine translation of sentences from summaries of
   medical articles between Czech, English, French, and German.

   Version 1.1 of this data set differs from version 1.0 in punctuation which
   was normalized using the attached script normalize-punctuation.pl.

2. Preamble

    2.1 Source

        The original sentences are sampled from summaries of English medical
        documents crawled from the web in 2012 and identified to be relevant
        to 50 medical topics. 

        The translations were carried out by the Charles University in Prague.

    2.2 License

        The Khresmoi Summary Test Set is made available under the terms of the
        Creative C . . .
                                            

Show simple item record