This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

FAUST cs-en 0.5

Please use the following text to cite this item or export to a predefined format:
Hajič, Jan; et al., 2021, FAUST cs-en 0.5, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3775.
Date issued
2021-09-20
Size
2223 sentences
Language(s)
Description
This machine translation test set contains 2223 Czech sentences collected within the FAUST project (https://ufal.mff.cuni.cz/grants/faust, http://hdl.handle.net/11234/1-3308). Each original (noisy) sentence was normalized (clean1 and clean2) and translated to English independently by two translators.
Acknowledgement
 Files in this item
Name
faust-csen.zip
Size
895.51 KB
Format
application/zip
Description
Zip
MD5
ddb9093027913f1883d25dfafc1ecb1a
Preview
  File Preview
  • scripts
    • faust-extract-tmx.pl1 kB
    • faust-merge-tsv.pl1 kB
  • original-tmx
    • faust-csen-rs.tmx1 MB
    • faust-csen-mu.tmx1 MB
    • faust-csen-noisy-cs.txt160 kB
    • README.txt979 B
    • faust-csen-noisy-en.txt338 kB
    • faust-csen-clean2-cs.txt159 kB
    • faust-csen-clean1-cs.txt159 kB