This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

English-Slovak Parallel Corpus

Please use the following text to cite this item or export to a predefined format:
Galuščáková, Petra; Garabík, Radovan and Bojar, Ondřej, 2012, English-Slovak Parallel Corpus, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0006-AAE0-A.
Date issued
2012-05-15
Language(s)
Description
English-Slovak parallel corpus consisting of several freely available corpora (Acquis [1], Europarl [2], Official Journal of the European Union [3] and part of OPUS corpus [4] – EMEA, EUConst, KDE4 and PHP) and downloaded website of European Commission [5]. Corpus is published in both in plaintext format and with an automatic morphological annotation. References: [1] http://langtech.jrc.it/JRC-Acquis.html/ [2] http://www.statmt.org/europarl/ [3] http://apertium.eu/data [4] http://opus.lingfil.uu.se/ [5] http://ec.europa.eu/
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
corpus-en-sk-export-format.tar.gz
Size
686.35 MB
Format
application/x-gzip
Description
gzip Archive
MD5
e6b3cd54b1485893fbc352d3c3becfc8
Preview
  File Preview
Name
corpus-en-sk-plaintext.tar.gz
Size
431.69 MB
Format
application/x-gzip
Description
gzip Archive
MD5
6d6885672e9d40c4d4c31f51796f1aa0
Preview
  File Preview