DeriNet 2.3
Please use the following text to cite this item or export to a predefined format:
Olbrich, Michal; et al., 2025,
DeriNet 2.3, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-5846.
Authors
Olbrich, Michal ; et al.
Item identifier
Project URL
Date issued
2025-01-29
Size
1040126 words
Language(s)
Description
DeriNet is a lexical network modeling derivational and compositional relations in Czech. The nodes of the network represent Czech lexemes, while the edges capture word-formational relations between derived words and their base word(s). The current version, DeriNet 2.3, introduces several key improvements over version 2.2:
(a) the set of 1,040,126 lexemes is aligned with the latest version of MorfFlex CZ (version 2.1),
(b) 5,781 derivational trees containing loanwords are enriched with etymological information specifying their origins, adopted from the Czech Etymological Lexicon,
(c) 8,867 new derivational and 1,262 new compound relations have been identified, resulting in a total of 791,771 derivational and 7,598 compound relations, and
(d) the morphological segmentation and classification of morphs have been significantly enhanced.
Acknowledgement
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2023062
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
Subject(s)
Collections
Version History
You are currently viewing version 8 of the item.
This item isPublicly Available
and licensed under:
Files in this item
- Name
- derinet-2-3.tsv
- Size
- 418.72 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 6e2e4349aa1870602ec9d2df82958dbc

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

