Show simple item record

 
dc.contributor.author Svoboda, Emil
dc.contributor.author Vidra, Jonáš
dc.contributor.author Ševčíková, Magda
dc.contributor.author Žabokrtský, Zdeněk
dc.date.accessioned 2024-07-03T12:14:35Z
dc.date.available 2024-07-03T12:14:35Z
dc.date.issued 2024-06-25
dc.identifier.uri http://hdl.handle.net/11234/1-5538
dc.description DeriNet is a lexical network which models derivational and compositional relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent word-formational relations between a derived word and its base word / words. The present version, DeriNet 2.2, contains: - 1,040,127 lexemes (sampled from the MorfFlex CZ 2.0 ​dictionary), connected by - 782,904 derivational, - 50,511 orthographic variant, - 6,336 compounding, - 288 univerbation, and - 135 conversion relations. Compared to the previous version, version 2.1 contains an overhaul of the compounding annotation scheme, 4384 extra compounds, 83 more affixoid lexemes serving as bases for compounding, more parts of speech serving as bases for compounding (adverbs, pronouns, numerals), and several minor corrections of derivational relations.
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri https://ufal.mff.cuni.cz/derinet
dc.subject derivation
dc.subject compounding
dc.subject word formation
dc.title Derinet 2.2
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType wordnet
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
demo.uri https://quest.ms.mff.cuni.cz/derisearch2/v2/databases/
contact.person Emiol Svoboda svoboda@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
size.info 1039842 words
size.info 1039842 entries
files.size 449344585
files.count 1


 Files in this item

Icon
Name
derinet-2-2.tsv
Size
428.53 MB
Format
Unknown
Description
DeriNet 2.2
MD5
c094f4270fdef364e52cad9854bb3a03
 Download file

Show simple item record