Files in this item
This item is
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Publicly Available
and licensed under:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
- Name
- morfflex-cz.2013-11-12.utf8.conll09.tab.csv.xz
- Size
- 270.94 MB
- Format
- application/x-xz
- Description
- Full (morphologically analyzed) wordlist for Czech language, with form, lemma (without sense suffix and without semantic/synt. suffixes), CoNLL-2009 Shared Task format major POS and CoNLL-2009 Shared Task Word Features. Fields are tab separated, always filles by non-empty string, lines end with linefeed only, and coding is UTF-8.
- MD5
- 43bbf20579c759e7e40ac3c40b58abc1
- Name
- morfflex-cz.2013-11-12.utf8.lemmaID_suff-tag-form.tab.csv.xz
- Size
- 226.83 MB
- Format
- application/x-xz
- Description
- Full (morphologically analyzed) wordlist for Czech language, with lemma (which includes sense suffix (-<number>) and semantic/synt. suffixes and comments in PDT format, full positional tag in PDT format, and form (3 fields). Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
- MD5
- f3c84f257bd47cea5c4b405406be559d
- Name
- morfflex-cz.2013-05-02.utf8.conll09.tab.csv.gz
- Size
- 458.97 MB
- Format
- application/x-gzip
- Description
- Full (morphologically analyzed) wordlist for Czech language, with form, lemma (without sense suffix and without semantic/synt. suffixes), CoNLL-2009 Shared Task format major POS and CoNLL-2009 Shared Task Word Features. Fields are tab separated, always filles by non-empty string, lines end with linefeed only, and coding is UTF-8.
- MD5
- f24291b8c910b8dd56c31c878fdcbed6
- Name
- morfflex-cz.2013-05-02.utf8.lemmaID_suff-tag-form.tab.csv.gz
- Size
- 423.64 MB
- Format
- application/x-gzip
- Description
- Full (morphologically analyzed) wordlist for Czech language, with lemma (which includes sense suffix (-<number>) and semantic/synt. suffix (format: _<letter>) (but not the comment suffix consisting of an underscore and a parenthesized expression), full positional tag, and form (3 fields). Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
- MD5
- b12b5513734f3585a1c7ad8436c73fec