Show simple item record Vidra, Jonáš Žabokrtský, Zdeněk Kyjánek, Lukáš Ševčíková, Magda Dohnalová, Šárka 2019-06-04T09:19:33Z 2019-06-04T09:19:33Z 2019-05-30
dc.description DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent derivational or compositional relations between a derived word and its base word / words. The present version, DeriNet 2.0, contains 1,027,665 lexemes (sampled from the MorfFlex dictionary) connected by 808682 derivational and 600 compositional links. Compared to previous versions, version 2.0 uses a new format and contains new types of annotations: compounding, annotation of several morphological and other categories of lexemes, identification of root morphs of 244,198 lexemes, semantic labelling of 151,005 relations using five labels and identification of 13 fictitious lexemes.
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.subject DeriNet
dc.subject derivation
dc.subject derivational morphology
dc.subject lexical network
dc.subject MorfFlex
dc.title DeriNet 2.0
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType wordnet
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIN
contact.person Jonáš Vidra Charles University in Prague, ÚFAL
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
sponsor Grantová agentura České Republiky 19-14534S Popis slovotvorné struktury českých slov na základě jazykových dat nationalFunds
sponsor Charles University Grant Agency 1176219 Developing derivational networks for multiple languages nationalFunds 1027665 entries 1024922 words
files.size 153312597
files.count 1

 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Distributed under Creative Commons Attribution Required Noncommercial Share Alike
146.21 MB
A tab-separated-values version of DeriNet 2.0, encoded as UTF-8, with Unix line endings. See the project homepage for documentation of the columns.
 Download file

Show simple item record