Show simple item record

 
dc.contributor.author MEDVEĎ, MAREK
dc.contributor.author Suchomel, Vít
dc.date.accessioned 2019-04-03T09:12:38Z
dc.date.available 2019-04-03T09:12:38Z
dc.date.issued 2019-04-02
dc.identifier.uri http://hdl.handle.net/11234/1-2970
dc.description Indonesian web corpus crawled in 2010. Encoded in UTF-8, cleaned, deduplicated, tagged by Morphind.
dc.language.iso ind
dc.publisher Masaryk University, NLP Centre
dc.rights NLP Centre Web Corpus License
dc.rights.uri https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC
dc.subject web corpus
dc.title Indonesian web corpus
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
dc.rights.label ACA
has.files yes
branding LINDAT / CLARIN
contact.person Marek Medveď xmedved1@fi.muni.cz Masaryk University, NLP Centre
size.info 109232712 tokens
files.size 217976472
files.count 1


 Files in this item

This item is
Academic Use
and licensed under:
NLP Centre Web Corpus License
Icon
Name
indonesianwac3_morphind_lempos.vert.7z
Size
207.88 MB
Format
Unknown
Description
vertical text
MD5
f6553682cf576b5868fa8a118d6cbd68
 Download file

Show simple item record