This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

czes

Please use the following text to cite this item or export to a predefined format:
(:unav) Unknown author, 2011, czes, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0001-CCCF-C.
Date issued
2011-12-15
Size
465102710 tokens
Language(s)
Description
First version of the very large Czech corpus Czes created with a new set of tools. It comprises 465,102,710 tokens.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
czes.xdedupl.onioned.vert.gz
Size
1.31 GB
Format
application/x-gzip
Description
Czes corpus
MD5
47908a912ee1477b4939ce5be8aeb78a
Preview
  File Preview
    • czes.xdedupl.onioned.vert3 GB