Files in this item
This item is
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Publicly Available
and licensed under:Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
- Name
- English-Hindi-without-Emille.tgz
- Size
- 12.16 MB
- Format
- application/x-gzip
- Description
- The complete parallel data, including the patch for the Emille corpus
- MD5
- fbe1e19c0e80fd7792e900656ce4c1a9
- UMC002-English-Hindi
- wikipedia-named-entities-2008
- en.tok.gz5 kB
- hi.tok.gz6 kB
- agrocorpus
- README693 B
- en.tok.gz17 kB
- hi.tok.gz15 kB
- shabdanjali-dictionary
- en.tok.gz76 kB
- README1 kB
- hi.tok.gz213 kB
- hi.filtered.tok.gz6 kB
- en.filtered.tok.gz4 kB
- tides-cleaned-by-ufal
- hi.test.tok.gz3 MB
- hi.train.tok.gz3 MB
- en.test.tok.gz2 MB
- hi.dev.tok.gz66 kB
- en.dev.tok.gz47 kB
- en.train.tok.gz2 MB
- README680 B
- wikipedia-named-entities-2009
- en.tok.gz4 kB
- hi.tok.gz5 kB
- danielpipes
- README51 B
- en.tok.gz354 kB
- hi.tok.gz319 kB
- acl-2005-shared-task
- en.tok.gz94 kB
- hi.tok.gz148 kB
- wikipedia-named-entities-2008