Show simple item record

 
dc.contributor.author Straka, Milan
dc.contributor.author Straková, Jana
dc.date.accessioned 2016-03-23T17:25:57Z
dc.date.available 2016-03-23T17:25:57Z
dc.date.issued 2016-03-10
dc.identifier.uri http://hdl.handle.net/11234/1-1674
dc.description Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ 160310 and the PoS tagger is trained on Prague Dependency Treebank 3.0 (PDT).
dc.description.sponsorship This work has been using language resources developed and/or stored and/or distributed by the LINDAT/CLARIN project of the Ministry of Education of the Czech Republic (project LM2010013). The Czech morphologic system was devised by Jan Hajič. The MorfFlex CZ dictionary was created by Jan Hajič and Jaroslava Hlaváčová. The morphologic guesser research was supported by the projects 1ET101120503 and 1ET101120413 of Academy of Sciences of the Czech Republic and 100008/2008 of Charles University Grant Agency. The research was performed by Jan Hajič, Jaroslava Hlaváčová and David Kolovratník. The tagger algorithm and feature set research was supported by the projects MSM0021620838 and LC536 of Ministry of Education, Youth and Sports of the Czech Republic, GA405/09/0278 of the Grant Agency of the Czech Republic and 1ET101120503 of Academy of Sciences of the Czech Republic. The research was performed by Drahomíra "johanka" Spoustová, Jan Hajič, Jan Raab and Miroslav Spousta. The tagger is trained on morphological layer of Prague Dependency Treebank PDT 2.5, which was supported by the projects LM2010013, LC536, LN00A063 and MSM0021620838 of Ministry of Education, Youth and Sports of the Czech Republic, and developed by Martin Buben, Jan Hajič, Jiří Hana, Hana Hanová, Barbora Hladká, Emil Jeřábek, Lenka Kebortová, Kristýna Kupková, Pavel Květoň, Jiří Mírovský, Andrea Pfimpfrová, Jan Štěpánek and Daniel Zeman.
dc.language.iso ces
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation.replaces http://hdl.handle.net/11858/00-097C-0000-0023-68D8-1
dc.relation.isreplacedby http://hdl.handle.net/11234/1-1836
dc.rights Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.source.uri http://ufal.mff.cuni.cz/morphodita/users-manual#czech-morfflex-pdt
dc.subject MorphoDiTa
dc.subject Czech
dc.subject morphological analysis
dc.subject morphological generation
dc.subject PoS tagging
dc.title Czech Models (MorfFlex CZ 160310 + PDT 3.0) for MorphoDiTa 160310
dc.type languageDescription
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Straka
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Milan
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse shareAlike
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#TextInfo#SizeInfo.size 68
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit mb
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email straka@ufal.mff.cuni.cz
metashare.ResourceInfo#ContentInfo.detailedType mlmodel
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
demo.uri http://lindat.mff.cuni.cz/services/morphodita/
contact.person Milan Straka straka@ufal.mff.cuni.cz Charles University in Prague, UFAL
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LM2010013 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds
sponsor Grantová agentura Akademie věd České republiky 1ET101120503 Integrace jazykových zdrojů za účelem extrakce informací z přirozených textů nationalFunds
sponsor Grantová agentura Akademie věd České republiky 1ET101120413 Data a nástroje pro informační systémy nationalFunds
sponsor Grantová agentura Univerzity Karlovy v Praze GAUK 100008/2008 Zobecnění a reimplementace české morfologie nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky MSM 0021620838 Moderní metody, struktury a systémy informatiky nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LC536 Centrum komputační lingvistiky nationalFunds
sponsor Grantová agentura České republiky GA405/09/0278 Internet jako jazykový korpus nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LN00A063 Centrum komputační lingvistiky nationalFunds
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky MSM 0021620838 Moderní metody, struktury a systémy informatiky nationalFunds
size.info 68 mb
files.size 62771612
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Distributed under Creative Commons Attribution Required Noncommercial Share Alike
Icon
Name
czech-morfflex-pdt-160310.zip
Size
59.86 MB
Format
application/zip
Description
Czech Models (MorfFlex CZ 160310 + PDT 3.0) for MorphoDiTa 160310
MD5
3944b709764d59df1e1708f32385f9ff
 Download file  Preview
 File Preview  
  • czech-morfflex-pdt-160310
    • README.html14 kB
    • README10 kB
    • czech-morfflex-pdt-160310.tagger16 MB
    • czech-morfflex-pdt-160310-no_dia-pos_only.tagger9 MB
    • czech-morfflex-160310-pos_only.dict1 MB
    • czech-morfflex-160310-no_dia.dict3 MB
    • czech-morfflex-160310-no_dia-pos_only.dict2 MB
    • czech-morfflex-pdt-160310-no_dia.tagger20 MB
    • LICENSE21 kB
    • czech-morfflex-pdt-160310-pos_only.tagger3 MB
    • czech-morfflex-160310.dict2 MB

Show simple item record