dc.contributor.author |
MEDVEĎ, MAREK |
dc.contributor.author |
Suchomel, Vít |
dc.date.accessioned |
2019-04-03T09:12:38Z |
dc.date.available |
2019-04-03T09:12:38Z |
dc.date.issued |
2019-04-02 |
dc.identifier.uri |
http://hdl.handle.net/11234/1-2970 |
dc.description |
Indonesian web corpus crawled in 2010. Encoded in UTF-8, cleaned, deduplicated, tagged by Morphind. |
dc.language.iso |
ind |
dc.publisher |
Masaryk University, NLP Centre |
dc.rights |
NLP Centre Web Corpus License |
dc.rights.uri |
https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC |
dc.subject |
Web corpus |
dc.title |
Indonesian web corpus |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
dc.rights.label |
ACA |
has.files |
yes |
branding |
LINDAT / CLARIAH-CZ |
contact.person |
Marek Medveď xmedved1@fi.muni.cz Masaryk University, NLP Centre |
size.info |
109232712 tokens |
files.size |
217976472 |
files.count |
1 |