dc.contributor.author | Hajič, Jan |
dc.contributor.author | Hlaváčová, Jaroslava |
dc.contributor.author | Mikulová, Marie |
dc.contributor.author | Straka, Milan |
dc.contributor.author | Štěpánková, Barbora |
dc.date.accessioned | 2021-01-11T19:42:13Z |
dc.date.available | 2021-01-11T19:42:13Z |
dc.date.issued | 2020-12-07 |
dc.identifier.uri | http://hdl.handle.net/11234/1-3186 |
dc.description | MorfFlex CZ 2.0 is the Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. MorfFlex is a flat list of lemma-tag-wordform triples. For each wordform, full inflectional information is coded in a positional tag. Wordforms are organized into entries (paradigm instances or paradigms in short) according to their formal morphological behavior. The paradigm (set of wordforms) is identified by a unique lemma. Apart from traditional morphological categories, the description also contains some semantic, stylistic and derivational information. For more details see a comprehensive specification of the Czech morphological annotation http://ufal.mff.cuni.cz/techrep/tr64.pdf . |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.replaces | http://hdl.handle.net/11234/1-1834 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | http://ufal.mff.cuni.cz/morfflex |
dc.subject | morphological dictionary |
dc.subject | morphology |
dc.subject | Czech |
dc.title | MorfFlex CZ 2.0 |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContactInfo#PersonInfo.surname | Hajič |
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName | Jan |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName | Charles University in Prague, UFAL |
metashare.ResourceInfo#DistributionInfo.availability | restrictedUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | academic-nonCommercialUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | attribution |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | shareAlike |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium | hardDisk |
metashare.ResourceInfo#ValidationInfo.validated | True |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#TextInfo#SizeInfo.size | 113537915 |
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit | lexicalTypes |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email | hajic@ufal.mff.cuni.cz |
metashare.ResourceInfo#ContentInfo.detailedType | computationalLexicon |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University in Prague, UFAL |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2015071 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky CZ.02.1.01/0.0/0.0/16_013/0001781 LINDAT/CLARIN - Výzkumná infrastruktura pro jazykové technologie - rozšíření repozitáře a výpočetní kapacity nationalFunds |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2018101 LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy nationalFunds |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky CZ.02.1.01/0.0/0.0/18_046/0015782 LINDAT/CLARIAH-CZ-EXTENSION Rozšíření repozitáře, služeb a výpočetního klastru výzkumné infrastruktury nationalFunds |
size.info | 125348899 entries |
files.size | 246247916 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- czech-morfflex-2.0.tsv.xz
- Velikost
- 234.84 MB
- Formát
- application/x-xz
- Popis
- Morphological dictionary of Czech language, consisting of triples lemma (which includes sense suffix (-<number>) and semantic/synt. suffixes and comments in PDT format), full positional tag in PDT format, and form. Fields are tab separated, always filled by non-empty string, lines end with linefeed only, and coding is UTF-8.
- MD5
- 7181c3dd89f605a47b32838651feeb93