Tigrinya Web Corpus
Please use the following text to cite this item or export to a predefined format:
Suchomel, Vít and Rychlý, Pavel, 2016,
Tigrinya Web Corpus, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2592.
Authors
Item identifier
Project URL
Date issued
2016
Size
2531443 tokens,
2087613 words,
139357 sentences
Language(s)
Description
Tigrinya web corpus. Crawled by SpiderLing in January 2016. Encoded in UTF-8, cleaned, deduplicated.
Publisher
Acknowledgement
Norway Grants
Project code:7F14047
Project name:Harvesting big text data for under-resourced languages (HaBiT)
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2015071
Project name:LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat
Collections
Files in this item
- Name
- ti16.tag.vert.gz
- Size
- 13.36 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f88d42ad6c989e472a35d56a1aed4003

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

