Zobrazit minimální záznam

 
dc.contributor.author Rosa, Rudolf
dc.date.accessioned 2014-04-07T07:59:24Z
dc.date.available 2014-04-07T07:59:24Z
dc.date.issued 2014-04-05
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0023-7AEB-4
dc.description MSTperl is a Perl reimplementation of the MST parser of Ryan McDonald (http://www.seas.upenn.edu/~strctlrn/MSTParser/MSTParser.html). MST parser (Maximum Spanning Tree parser) is a state-of-the-art natural language dependency parser -- a tool that takes a sentence and returns its dependency tree. In MSTperl, only some functionality was implemented; the limitations include the following: the parser is a non-projective one, curently with no possibility of enforcing the requirement of projectivity of the parse trees; only first-order features are supported, i.e. no second-order or third-order features are possible; the implementation of MIRA is that of a single-best MIRA, with a closed-form update instead of using quadratic programming. On the other hand, the parser supports several advanced features: parallel features, i.e. enriching the parser input with word-aligned sentence in other language; adding large-scale information, i.e. the feature set enriched with features corresponding to pointwise mutual information of word pairs in a large corpus (CzEng). The MSTperl parser is tuned for parsing Czech. Trained models are available for Czech, English and German. We can train the parser for other languages on demand, or you can train it yourself -- the guidelines are part of the documentation. The parser, together with detailed documentation, is avalable on CPAN (http://search.cpan.org/~rur/Treex-Parser-MSTperl/).
dc.description.sponsorship The research has been supported by the EU Seventh Framework Programme under grant agreement 247762 (Faust), and by the grants GAUK116310 and GA201/09/H057.
dc.language.iso ces
dc.language.iso eng
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation info:eu-repo/grantAgreement/EC/FP7/610516
dc.relation info:eu-repo/grantAgreement/EC/FP7/247762
dc.relation.isreplacedby http://hdl.handle.net/11234/1-1480
dc.rights Artistic License 2.0
dc.rights.uri http://opensource.org/licenses/Artistic-2.0
dc.source.uri https://ufal.mff.cuni.cz/tools/mstperl-parser
dc.subject parser
dc.subject NLP
dc.subject Treex
dc.subject parsing
dc.subject dependency
dc.title MSTperl parser
dc.type toolService
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Rosa
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Rudolf
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability unrestrictedUse
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email rosa@ufal.mff.cuni.cz
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
metashare.ResourceInfo#ContentInfo.detailedType tool
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
sponsor European Union FP7-ICT-2013-10-610516 Quality Translation by Deep Language Engineering Approaches (QTLeap) euFunds info:eu-repo/grantAgreement/EC/FP7/610516
sponsor European Union FP7-ICT-2009-4-247762 Faust euFunds info:eu-repo/grantAgreement/EC/FP7/247762
sponsor Grantová agentura Univerzity Karlovy v Praze GAUK 116310/2010 Anglicko-český strojový překlad s využitím hloubkové syntaxe nationalFunds
sponsor Grantová agentura České republiky GD201/09/H057 Res Informatica nationalFunds
files.size 262924
files.count 1


 Soubory tohoto záznamu

Licenční kategorie:
Publicly Available

Licence: Artistic License 2.0
The Open Source Initiative
Icon
Název
Treex-Parser-MSTperl-0.11949.tar.gz
Velikost
256.76 KB
Formát
application/x-gzip
Popis
MSTperl parser
MD5
40ffc1ccf7421ff6442c0485b10243eb
 Stáhnout soubor  Náhled
 Náhled souboru  
  • Treex-Parser-MSTperl-0.11949
    • MANIFEST2 kB
    • README433 B
    • l5 kB
    • META.yml712 B
    • perlcritic.rc8 kB
    • Changes803 B
    • Makefile.PL1 kB
    • kos
      • Treex-Parser-MSTperl-0.09407.tar.gz83 kB
    • lib
      • Treex
        • Tool
          • Parser
            • MSTperl
              • TrainerLabelling.pm25 kB
              • ModelUnlabelled.pm5 kB
              • Parser.pm5 kB
              • ModelLabelling.pm43 kB
              • Reader.pm2 kB
              • ModelBase.pm6 kB
              • Labeller.pm28 kB
              • samples
                • treex_input.txt813 B
                • sample.config5 kB
                • sample_train.sh73 B
                • train_tsv.pl850 B
                • treex_parse.scen721 B
                • train_labeller_tsv.pl1021 B
                • sample_test.sh68 B
                • labeller_test.sh82 B
                • sample.model299 kB
                • test_tsv.pl1 kB
                • labeller_train.sh86 B
                • sample.model.tsv376 kB
                • test_labeller_tsv.pl2 kB
                • sample_test.tsv2 kB
                • sample_train.tsv4 kB
              • FeaturesControl.pm56 kB
              • Node.pm5 kB
              • Edge.pm4 kB
              • TrainerBase.pm8 kB
              • Sentence.pm18 kB
              • TrainerUnlabelled.pm11 kB
              • ModelAdditional.pm7 kB
              • Writer.pm2 kB
              • RootNode.pm1 kB
              • Config.pm30 kB
            • MSTperl.pm10 kB
    • LICENSE18 kB
    • t
      • sample_train.tsv4 kB
      • release-dist-manifest.t310 B
      • release-minimum-version.t342 B
      • sample.config7 kB
      • train_and_test.t5 kB
      • release-pod-linkcheck.t509 B
      • release-synopsis.t307 B
      • release-distmeta.t301 B
      • release-no-tabs.t296 B
      • author-critic.t438 B
      • release-portability.t321 B
      • release-test-version.t577 B
      • release-pod-coverage.t501 B
      • release-cpan-changes.t313 B
      • release-mojibake.t318 B
      • release-kwalitee.t399 B
      • release-unused-vars.t293 B
      • 00-compile.t1 kB
      • release-pod-syntax.t296 B
      • sample_test.tsv2 kB
      • author-test-eol.t397 B

Zobrazit minimální záznam