This is not the latest version of this item. The latest version can be found here.
Italian Content Words v2
Please use the following text to cite this item or export to a predefined format:
Grella, Matteo, 2018,
Italian Content Words v2, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-2630.
Authors
Item identifier
Date issued
2018
Size
2342120 items
Language(s)
Description
This resource is the second version of an Italian morphological dictionary for content words, encoded in a JSON Lines format text file. It contains correspondences between surface form and lexical forms of words followed by standard grammatical properties. Compared to the first release, this version has a better JSON structure. The surface word forms have been generated algorithmically by using stable phonological and morphological rules of the Italian language. Particular attention has been given to the generation of verbs for which rules have been extracted from A.L e G. Lepschy, La Lingua Italiana. The dictionary with its remarkable coverage is particularly useful used together with the Italian Function Words v2 (http://hdl.handle.net/11372/LRT-2629) for tasks such as pos-tagging or syntactic parsing.
Publisher
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- italian_content_words_v2.tar.gz
- Size
- 19.34 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0f46194531cd486515234b8d821b619b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

