Search
Search Results
- Creator:
- Suchomel, Vít and Rychlý, Pavel
- Publisher:
- Masaryk University, NLP Centre
- Type:
- text and corpus
- Subject:
- text corpora, Ethiopian languages, web corpora, under-resourced languages, and Somali
- Language:
- Somali
- Description:
- Somali web corpus. Crawled by SpiderLing in January 2016. Encoded in UTF-8, cleaned, deduplicated.
- Rights:
- NLP Centre Web Corpus License, https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC, and ACA