This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

CORMAP - Corpus for Moroccan Arabic Processing

Please use the following text to cite this item or export to a predefined format:
tachicart, ridouane and bouzoubaa, karim, 2017, CORMAP - Corpus for Moroccan Arabic Processing, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-3551.
Date issued
2017-01-01
Size
34000 terms
Description
This resource is a corpus containing 34k Moroccan Colloquial Arabic sentences collected from different sources. The sentences are written in Arabic letters. This resource can be useful in some NLP applications such as Language Identification.
Publisher
Subject(s)
This item isPublicly Available
and licensed under:
 Files in this item
Name
corpus_lid.xml
Size
8.13 MB
Format
text/xml
Description
cormap
MD5
46a84edaf524a69bb9e856371adc3bd6
Preview
  File Preview