Files in this item

 Download all files in item (647.59 KB)
This item is
Publicly Available
and licensed under:
Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
Distributed under Creative Commons Attribution Required Noncommercial
Icon
Name
khresmoi-summary-test-set.tgz
Size
637.26 KB
Format
application/x-gzip
Description
data package
MD5
c0d333e3f6d8f2db1cc281821d5bcbe8
 Download file  Preview
 File Preview  
  • khresmoi-summary-test-set
    • khresmoi-summary-test.topic_id7 kB
    • khresmoi-summary-dev.de.sgm77 kB
    • khresmoi-summary-test.fr.sgm170 kB
    • queries.clef2013ehealth.1-50.test.xml21 kB
    • khresmoi-summary-dev.de67 kB
    • khresmoi-summary-test.de144 kB
    • khresmoi-summary-test.cs.sgm151 kB
    • khresmoi-summary-dev.fr.sgm80 kB
    • khresmoi-summary-dev.en58 kB
    • khresmoi-summary-test.en123 kB
    • khresmoi-summary-test.en.sgm143 kB
    • khresmoi-summary-dev.topic_id3 kB
    • normalize-punctuation.pl2 kB
    • khresmoi-summary-dev.cs.sgm71 kB
    • khresmoi-summary-dev.cs62 kB
    • khresmoi-summary-dev.fr70 kB
    • khresmoi-summary-test.cs132 kB
    • khresmoi-summary-test.doc_id6 kB
    • README.TXT10 kB
    • khresmoi-summary-test.fr150 kB
    • khresmoi-summary-dev.en.sgm68 kB
    • khresmoi-summary-test.de.sgm163 kB
    • khresmoi-summary-dev.doc_id3 kB
Icon
Name
README.TXT
Size
10.34 KB
Format
Text file
Description
Readme
MD5
e1f594eb6743282cccfadd155e297ca9
 Download file  Preview
 File Preview  
Khresmoi Summary Translation Test Data for the Medical Domain version 1.1
                                Apr 28, 2014

                  Pavel Pecina <pecina@ufal.mff.cuni.cz>
 
1. Description

   This package contains data sets for development (Section dev) and testing
   (Section test) of machine translation of sentences from summaries of
   medical articles between Czech, English, French, and German.

   Version 1.1 of this data set differs from version 1.0 in punctuation which
   was normalized using the attached script normalize-punctuation.pl.

2. Preamble

    2.1 Source

        The original sentences are sampled from summaries of English medical
        documents crawled from the web in 2012 and identified to be relevant
        to 50 medical topics. 

        The translations were carried out by the Charles University in Prague.

    2.2 License

        The Khresmoi Summary Test Set is made available under the terms of the
        Creative C . . .