Files in this item

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Distributed under Creative Commons Attribution Required Noncommercial Share Alike
Icon
Name
morfflex-cz.2013-11-12.utf8.conll09.tab.csv.xz
Size
270.94 MB
Format
application/x-xz
Description
Full (morphologically analyzed) wordlist for Czech language, with form, lemma (without sense suffix and without semantic/synt. suffixes), CoNLL-2009 Shared Task format major POS and CoNLL-2009 Shared Task Word Features. Fields are tab separated, always filles by non-empty string, lines end with linefeed only, and coding is UTF-8.
MD5
43bbf20579c759e7e40ac3c40b58abc1
 Download file
Icon
Name
morfflex-cz.2013-11-12.utf8.lemmaID_suff-tag-form.tab.csv.xz
Size
226.83 MB
Format
application/x-xz
Description
Full (morphologically analyzed) wordlist for Czech language, with lemma (which includes sense suffix (-<number>) and semantic/synt. suffixes and comments in PDT format, full positional tag in PDT format, and form (3 fields). Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
MD5
f3c84f257bd47cea5c4b405406be559d
 Download file
Icon
Name
morfflex-cz.2013-05-02.utf8.conll09.tab.csv.gz
Size
458.97 MB
Format
application/x-gzip
Description
Full (morphologically analyzed) wordlist for Czech language, with form, lemma (without sense suffix and without semantic/synt. suffixes), CoNLL-2009 Shared Task format major POS and CoNLL-2009 Shared Task Word Features. Fields are tab separated, always filles by non-empty string, lines end with linefeed only, and coding is UTF-8.
MD5
f24291b8c910b8dd56c31c878fdcbed6
 Download file
Icon
Name
morfflex-cz.2013-05-02.utf8.lemmaID_suff-tag-form.tab.csv.gz
Size
423.64 MB
Format
application/x-gzip
Description
Full (morphologically analyzed) wordlist for Czech language, with lemma (which includes sense suffix (-<number>) and semantic/synt. suffix (format: _<letter>) (but not the comment suffix consisting of an underscore and a parenthesized expression), full positional tag, and form (3 fields). Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
MD5
b12b5513734f3585a1c7ad8436c73fec
 Download file