This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Italian Content Words

Please use the following text to cite this item or export to a predefined format:
Grella, Matteo, 2011, Italian Content Words, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-2476.
Date issued
2011
Size
2342120 items
Language(s)
Description
This resource is an Italian morphological dictionary for content words, encoded in a JSON Lines format text file. It contains correspondences between surface form and lexical forms of words followed by grammatical features. The surface word forms have been generated algorithmically by using stable phonological and morphological rules of the Italian language. Particular attention has been given to the generation of verbs for which rules have been extracted from the famous A.L e G. Lepschy, La lingua italiana. The dictionary with its remarkable coverage is particularly useful used together with the Italian Function Words (http://hdl.handle.net/11372/LRT-2288) for tasks such as POS-Tagging or Syntactic Parsing.
Publisher

Version History

Showing 1 - 3 out of 3 results
VersionDateSummary
2018-01-01 00:00:00
2018-01-01 00:00:00
1*
2011-01-01 00:00:00
* Selected version
 Files in this item
Name
italian_content_words.rar
Size
15.63 MB
Format
application/x-rar-compressed
Description
RAR Archive
MD5
80c31d0f9a7cc541e8b36419cf045ccb
Preview
  File Preview