This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

DeriNet 1.2

Please use the following text to cite this item or export to a predefined format:
Vidra, Jonáš; Žabokrtský, Zdeněk; Ševčíková, Magda and Straka, Milan, 2016, DeriNet 1.2, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1807.
Date issued
2016-10-20
Size
1003590 entries,
91001394 words
Language(s)
Description
DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes (i.e. single lemmas, possibly with only a subset of their senses), edges represent derivational relations between a derived word and its base word. The present version, DeriNet 1.2, contains 1,003,590 lexemes (sampled from the MorfFlex dictionary) with 1,001,394 unique lemmas, connected by 740,750 derivational links. Both rather technical and linguistic changes were made as compared to the previous version of the data; e.g. new version of the MorfFlex dictionary was used, derived words that contain a consonant and/or vowel alternation (e.g. boží) were connected with their base word (bůh).
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
derinet-1-2.tsv
Size
46.56 MB
Format
application/octet-stream
Description
DeriNet 1.2
MD5
9206e46148f21d89325fb62397bd141c
Preview
  File Preview