This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

skTenTen

Please use the following text to cite this item or export to a predefined format:
(:unav) Unknown author, 2011, skTenTen, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0001-CCDB-0.
Date issued
2011-12-16
Size
876003720 tokens
Language(s)
Description
Slovak large web corpus skTenTen, comprising 876,003,720 tokens.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
skTenTen.vert.xz
Size
1.72 GB
Format
application/x-xz
Description
skTenTen corpus
MD5
fc47ff1edd498a04e935fff8993dd3af
Preview
  File Preview
    • skTenTen.vert8 GB