This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

English-Hindi Parallel Corpus

Please use the following text to cite this item or export to a predefined format:
Bojar, Ondřej; Straňák, Pavel; Zeman, Daniel; Jain, Gaurav and Damani, Om Prakesh, 2010, English-Hindi Parallel Corpus, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0001-BD17-1.
Date issued
2010-05-11
Language(s)
Description
English-Hindi parallel corpus collected from several sources. Tokenized and sentence-aligned. A part of the data is our patch for the Emille parallel corpus.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
English-Hindi-without-Emille.tgz
Size
12.16 MB
Format
application/x-gzip
Description
The complete parallel data, including the patch for the Emille corpus
MD5
fbe1e19c0e80fd7792e900656ce4c1a9
Preview
  File Preview
    • English-Hindi-without-Emille.tgz12 MB