Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Publicly Available
Licence: Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
- Název
- English-Hindi-without-Emille.tgz
- Velikost
- 12.16 MB
- Formát
- application/x-gzip
- Popis
- The complete parallel data, including the patch for the Emille corpus
- MD5
- fbe1e19c0e80fd7792e900656ce4c1a9
- UMC002-English-Hindi
- wikipedia-named-entities-2008
- en.tok.gz5 kB
- hi.tok.gz6 kB
- agrocorpus
- README693 B
- en.tok.gz17 kB
- hi.tok.gz15 kB
- shabdanjali-dictionary
- en.tok.gz76 kB
- README1 kB
- hi.tok.gz213 kB
- hi.filtered.tok.gz6 kB
- en.filtered.tok.gz4 kB
- tides-cleaned-by-ufal
- hi.test.tok.gz3 MB
- hi.train.tok.gz3 MB
- en.test.tok.gz2 MB
- hi.dev.tok.gz66 kB
- en.dev.tok.gz47 kB
- en.train.tok.gz2 MB
- README680 B
- wikipedia-named-entities-2009
- en.tok.gz4 kB
- hi.tok.gz5 kB
- danielpipes
- README51 B
- en.tok.gz354 kB
- hi.tok.gz319 kB
- acl-2005-shared-task
- en.tok.gz94 kB
- hi.tok.gz148 kB
- wikipedia-named-entities-2008