Show simple item record

 
dc.contributor.author Vidra, Jonáš
dc.contributor.author Žabokrtský, Zdeněk
dc.contributor.author Kyjánek, Lukáš
dc.contributor.author Ševčíková, Magda
dc.contributor.author Dohnalová, Šárka
dc.date.accessioned 2019-06-04T09:19:33Z
dc.date.available 2019-06-04T09:19:33Z
dc.date.issued 2019-05-30
dc.identifier.uri http://hdl.handle.net/11234/1-2995
dc.description DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent derivational or compositional relations between a derived word and its base word / words. The present version, DeriNet 2.0, contains 1,027,665 lexemes (sampled from the MorfFlex dictionary) connected by 808682 derivational and 600 compositional links. Compared to previous versions, version 2.0 uses a new format and contains new types of annotations: compounding, annotation of several morphological and other categories of lexemes, identification of root morphs of 244,198 lexemes, semantic labelling of 151,005 relations using five labels and identification of 13 fictitious lexemes.
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation.replaces http://hdl.handle.net/11234/1-2873
dc.rights Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.source.uri https://ufal.mff.cuni.cz/derinet
dc.subject DeriNet
dc.subject derivation
dc.subject derivational morphology
dc.subject lexical network
dc.subject MorfFlex
dc.title DeriNet 2.0
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType wordnet
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIN
demo.uri https://ufal.mff.cuni.cz/derinet/search
contact.person Jonáš Vidra vidra@ufal.mff.cuni.cz Charles University in Prague, ÚFAL
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
sponsor Grantová agentura České Republiky 19-14534S Popis slovotvorné struktury českých slov na základě jazykových dat nationalFunds
sponsor Charles University Grant Agency 1176219 Developing derivational networks for multiple languages nationalFunds
size.info 1027665 entries
size.info 1024922 words
files.size 153312597
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Distributed under Creative Commons Attribution Required Noncommercial Share Alike
Icon
Name
derinet-2-0.tsv
Size
146.21 MB
Format
Unknown
Description
A tab-separated-values version of DeriNet 2.0, encoded as UTF-8, with Unix line endings. See the project homepage for documentation of the columns.
MD5
1ea9bc62699c96b00f52a25198f6d4ed
 Download file

Show simple item record