An annotated corpus dedicated to the benchmark and evaluation of Arabic morphological analyzers. It consists of 100 words with all their possible analysis. The corpus contains several morphological information such as stem, pattern, root, lemma, etc.
THE LINDAT/CLARIN PROJECT (LM2015071 and CZ.02.1.01/0.0/0.0/16_013/0001781; formerly LM2010013) IS FULLY SUPPORTED BY THE MINISTRY OF EDUCATION, SPORTS AND YOUTH OF THE CZECH REPUBLIC UNDER THE PROGRAMME LM OF "LARGE INFRASTRUCTURES".
Copyright (c) 2018 UFAL MFF UK. All rights reserved.