Indonesian web corpus
Please use the following text to cite this item or export to a predefined format:
MEDVEĎ, MAREK and Suchomel, Vít, 2019,
Indonesian web corpus, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2970.
Authors
Item identifier
Date issued
2019-04-02
Size
109232712 tokens
Language(s)
Description
Indonesian web corpus crawled in 2010. Encoded in UTF-8, cleaned, deduplicated, tagged by Morphind.
Publisher
Subject(s)
Collections
Files in this item
- Name
- indonesianwac3_morphind_lempos.vert.7z
- Size
- 207.88 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- f6553682cf576b5868fa8a118d6cbd68

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

