Show simple item record Hajič, Jan Hlaváčová, Jaroslava Mikulová, Marie Straka, Milan Štěpánková, Barbora 2021-01-11T19:42:13Z 2021-01-11T19:42:13Z 2020-12-07
dc.description MorfFlex CZ 2.0 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full inflectional information is coded in a positional tag. Wordforms are organized into entries (paradigm instances or paradigms in short) according to their formal morphological behavior. The paradigm (set of wordforms) is identified by a unique lemma. Apart from traditional morphological categories, the description also contains some semantic, stylistic and derivational information. For more details see a comprehensive specification of the Czech morphological annotation .
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.subject morphological dictionary
dc.subject morphology
dc.subject Czech
dc.title MorfFlex CZ 2.0
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Hajič
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Jan
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability restrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse shareAlike
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium hardDisk
metashare.ResourceInfo#ValidationInfo.validated True
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#TextInfo#SizeInfo.size 113537915
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit lexicalTypes
metashare.ResourceInfo#ContentInfo.detailedType computationalLexicon
dc.rights.label PUB
has.files yes
contact.person Milan Straka Charles University in Prague, UFAL
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky CZ.02.1.01/0.0/0.0/16_013/0001781 LINDAT/CLARIN - Výzkumná infrastruktura pro jazykové technologie - rozšíření repozitáře a výpočetní kapacity nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2018101 LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky CZ.02.1.01/0.0/0.0/18_046/0015782 LINDAT/CLARIAH-CZ-EXTENSION Rozšíření repozitáře, služeb a výpočetního klastru výzkumné infrastruktury nationalFunds 125348899 entries
files.size 246247916
files.count 1

 Files in this item

234.84 MB
Morphological dictionary of Czech language, consisting of triples lemma (which includes sense suffix (-<number>) and semantic/synt. suffixes and comments in PDT format), full positional tag in PDT format, and form. Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
 Download file

Show simple item record