Manual Arabic spelling-errors correction for collected documents

Name: Manual Arabic spelling-errors correction for collected documents
License: http://creativecommons.org/licenses/by-sa/4.0/
Keywords: Manual Arabic spelling-errors correction for collected documents

Saty, Ahmed; Aouragh, Si Lhoussain; Bouzoubaa, Karim

Show simple item record

dc.contributor.author	Saty, Ahmed
dc.contributor.author	Aouragh, Si Lhoussain
dc.contributor.author	Bouzoubaa, Karim
dc.date.accessioned	2023-05-09T09:27:45Z
dc.date.available	2023-05-09T09:27:45Z
dc.date.issued	2023-03-06
dc.identifier.uri	http://hdl.handle.net/11372/LRT-4763
dc.description	The file represents a text corpus in the context of Arabic spell checking, where a group of persons edited different files, and all of the committed spelling errors by these persons have been recorded. A comprehensive representation these persons’ profile has been considered: male, female, old-aged, middle-aged, young-aged, high and low computer usage users, etc. Through this work, we aim to help researchers and those interested in Arabic NLP by providing them with an Arabic spell check corpus ready and open to exploitation and interpretation. This study also enabled the inventory of most spelling mistakes made by editors of Arabic texts. This file contains the following sections (tags): people – documents they printed – types of possible errors – errors they made. Each section (tag) contains some data that explains its details and its content, which helps researchers extracting research-oriented results. The people section contains basic information about each person and its relationship of using the computer, while the documents section clarifies all sentences in each document with the numbering of each sentence to be used in the errors section that was committed. We are also adding the “type of errors” section in which we list all the possible errors with their description in the Arabic language and give an illustrative example.
dc.language.iso	eng
dc.language.iso	ara
dc.publisher	Sudan University of Science and Technology
dc.relation.isreferencedby	https://journal.uob.edu.bh/handle/123456789/4934
dc.rights	Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri	http://creativecommons.org/licenses/by-sa/4.0/
dc.source.uri	http://arabic.emi.ac.ma/alelm/?q=Resources
dc.subject	Manual Arabic spelling-errors correction for collected documents
dc.title	Manual Arabic spelling-errors correction for collected documents
dc.type	corpus
metashare.ResourceInfo#ContentInfo.mediaType	text
dc.rights.label	PUB
has.files	yes
branding	LRT + Open Submissions
demo.uri	http://arabic.emi.ac.ma/alelm/?page_id=273/#Corpus
contact.person	Ahmed Sayu wdsaty@hotmail.com Sudan University of Science and Technology
size.info	619 kb
files.size	633710
files.count	1

Files in this item

This item is

Publicly Available

and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Name: manual Spelling-errors correction.xml
Size: 618.86 KB
Format: XML
Description: Unknown
MD5: a2d7a7e10c4f7836079ca15da4952e65

Download file

Show simple item record