This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

MorfFlex CZ 2.0

Please use the following text to cite this item or export to a predefined format:
Hajič, Jan; Hlaváčová, Jaroslava; Mikulová, Marie; Straka, Milan and Štěpánková, Barbora, 2020, MorfFlex CZ 2.0, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3186.
Date issued
2020-12-07
Size
125348899 entries
Language(s)
Description
MorfFlex CZ 2.0 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full inflectional information is coded in a positional tag. Wordforms are organized into entries (paradigm instances or paradigms in short) according to their formal morphological behavior. The paradigm (set of wordforms) is identified by a unique lemma. Apart from traditional morphological categories, the description also contains some semantic, stylistic and derivational information. For more details see a comprehensive specification of the Czech morphological annotation http://ufal.mff.cuni.cz/techrep/tr64.pdf .
Acknowledgement

Version History

Showing 1 - 5 out of 5 results
VersionDateSummary
2024-12-23 00:00:00
4*
2020-12-07 00:00:00
2016-11-15 00:00:00
2016-03-10 00:00:00
2013-01-01 00:00:00
* Selected version
 Files in this item
Name
czech-morfflex-2.0.tsv.xz
Size
234.84 MB
Format
application/x-xz
Description
xz Archive
MD5
7181c3dd89f605a47b32838651feeb93
Preview
  File Preview