Zobrazit minimální záznam

 
dc.contributor.author Namly, Driss
dc.contributor.author Bouzoubaa, Karim
dc.contributor.author El Jihad, Abdelhamid
dc.date.accessioned 2023-03-27T14:29:53Z
dc.date.available 2023-03-27T14:29:53Z
dc.date.issued 2020-10-16
dc.identifier.uri http://hdl.handle.net/11372/LRT-5102
dc.description Comprehensive Arabic LEMmas is a lexicon covering a large list of Arabic lemmas and their corresponding inflected word forms (stems) with details (POS + Root). Each lexical entry represents a lemma followed by all its possible stems and each stem is enriched by its morphological features especially the root and the POS. It is composed of 164,845 lemmas representing 7,200,918 stems, detailed as follow: 757 Arabic particles 2,464,631 verbal stems 4,735,587 nominal stems The lexicon is provided as an LMF conformant XML-based file in UTF8 encoding, which represents about 1,22 Gb of data. Citation: – Namly Driss, Karim Bouzoubaa, Abdelhamid El Jihad, and Si Lhoussain Aouragh. “Improving Arabic Lemmatization Through a Lemmas Database and a Machine-Learning Technique.” In Recent Advances in NLP: The Case of Arabic Language, pp. 81-100. Springer, Cham, 2020.
dc.language.iso ara
dc.publisher ALELM
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri http://arabic.emi.ac.ma/alelm/?page_id=273/#Lexicon
dc.subject lexicon
dc.subject lemmatization
dc.subject stemming;
dc.title CALEM (Comprehensive Arabic LEMmas)
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType lexicon
dc.rights.label PUB
has.files yes
branding LRT + Open Submissions
demo.uri http://arabic.emi.ac.ma/alelm/?page_id=273/#Lexicon
contact.person namly driss namly_driss@yahoo.fr Mohammadia School of Engineers, Mohammed Vth University, Rabat, Morocco.
size.info 7200918 entries
files.size 20504
files.count 1


 Soubory tohoto záznamu

Icon
Název
CALEM.xml
Velikost
20.02 KB
Formát
XML
Popis
Unknown
MD5
1c6a96459de872b9beaa28050bea46eb
 Stáhnout soubor

Zobrazit minimální záznam