This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

MorfFlex CZ 2.1 (2024-12-23)

Please use the following text to cite this item or export to a predefined format:
Hajič, Jan; Hlaváčová, Jaroslava; Mikulová, Marie; Straka, Milan and Štěpánková, Barbora, 2024, MorfFlex CZ 2.1 (2024-12-23), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-5833.
Date issued
2024-12-23
Size
126906921 entries
Language(s)
Description
MorfFlex CZ 2.1 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex CZ 2.1 is a part of the PDT-C 2.0 release https://hdl.handle.net/11234/1-5813. It is a minor upgrade from MorfFlex CZ 2.0, with the tagset unchanged, but with some additions and corrections for full compatibility with PDT-C 2.0 morphological annotation. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full inflectional information is coded in a positional tag. Wordforms are organized into entries (paradigm instances or paradigms in short) according to their formal morphological behavior. The paradigm (set of wordforms) is identified by a unique lemma. Apart from traditional morphological categories, the description also contains some semantic, stylistic and derivational information. For more details see a comprehensive specification of the Czech morphological annotation https://ufal.mff.cuni.cz/techrep/tr64.pdf .
Acknowledgement

Version History

Showing 1 - 5 out of 5 results
VersionDateSummary
5*
2024-12-23 00:00:00
2020-12-07 00:00:00
2016-11-15 00:00:00
2016-03-10 00:00:00
2013-01-01 00:00:00
* Selected version
 Files in this item
Name
czech-morfflex-2.1.tsv.xz
Size
238.88 MB
Format
application/x-xz
Description
xz Archive
MD5
76b4753ab291d53f05a7139596d0be72
Preview
  File Preview