dc.contributor.author | Rychlý, Pavel |
dc.date.accessioned | 2018-01-11T15:31:58Z |
dc.date.available | 2018-01-11T15:31:58Z |
dc.date.issued | 2016 |
dc.identifier.uri | http://hdl.handle.net/11234/1-2593 |
dc.description | Substantially cleaned version of existing morphologically annotated WIC Corpus. |
dc.language.iso | amh |
dc.publisher | Masaryk University, NLP Centre |
dc.relation.isreferencedby | https://link.springer.com/chapter/10.1007/978-3-319-45510-5_34 |
dc.relation.isreferencedby | https://www.sketchengine.co.uk/wp-content/uploads/2015/05/Corpus_Factory_2010.pdf |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
dc.source.uri | http://habit-project.eu/wiki/HabitSystemFinal |
dc.subject | text corpora |
dc.subject | Ethiopian languages |
dc.subject | web corpora |
dc.subject | under-resourced languages |
dc.subject | Amharic |
dc.title | Amharic WIC Corpus |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
demo.uri | https://corpora.fi.muni.cz/habit/run.cgi/first_form?corpname=am_wic;align= |
contact.person | Marie Stará nlpassist@aurora.fi.muni.cz Masaryk University, NLP Centre |
sponsor | Norway Grants 7F14047 Harvesting big text data for under-resourced languages (HaBiT) Other |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
size.info | 200561 tokens |
size.info | 195507 words |
files.size | 1300976 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- wic.vert.gz
- Size
- 1.24 MB
- Format
- application/x-gzip
- Description
- Unknown
- MD5
- fce181efd29e144d6ae8ccac4ab481ba