This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Indonesian web corpus

Please use the following text to cite this item or export to a predefined format:
MEDVEĎ, MAREK and Suchomel, Vít, 2019, Indonesian web corpus, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-2970.
Date issued
2019-04-02
Size
109232712 tokens
Language(s)
Description
Indonesian web corpus crawled in 2010. Encoded in UTF-8, cleaned, deduplicated, tagged by Morphind.
Subject(s)
This item isAcademic Use
and licensed under:
 Files in this item
Name
indonesianwac3_morphind_lempos.vert.7z
Size
207.88 MB
Format
application/octet-stream
Description
Unknown
MD5
f6553682cf576b5868fa8a118d6cbd68
Preview
  File Preview