This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

MSTperl parser

Please use the following text to cite this item or export to a predefined format:
Rosa, Rudolf, 2014, MSTperl parser, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0023-7AEB-4.
Date issued
2014-04-05
Language(s)
Description
MSTperl is a Perl reimplementation of the MST parser of Ryan McDonald (http://www.seas.upenn.edu/~strctlrn/MSTParser/MSTParser.html). MST parser (Maximum Spanning Tree parser) is a state-of-the-art natural language dependency parser -- a tool that takes a sentence and returns its dependency tree. In MSTperl, only some functionality was implemented; the limitations include the following: the parser is a non-projective one, curently with no possibility of enforcing the requirement of projectivity of the parse trees; only first-order features are supported, i.e. no second-order or third-order features are possible; the implementation of MIRA is that of a single-best MIRA, with a closed-form update instead of using quadratic programming. On the other hand, the parser supports several advanced features: parallel features, i.e. enriching the parser input with word-aligned sentence in other language; adding large-scale information, i.e. the feature set enriched with features corresponding to pointwise mutual information of word pairs in a large corpus (CzEng). The MSTperl parser is tuned for parsing Czech. Trained models are available for Czech, English and German. We can train the parser for other languages on demand, or you can train it yourself -- the guidelines are part of the documentation. The parser, together with detailed documentation, is avalable on CPAN (http://search.cpan.org/~rur/Treex-Parser-MSTperl/).
Acknowledgement

Version History

Showing 1 - 2 out of 2 results
VersionDateSummary
2015-05-19 00:00:00
1*
2014-04-05 00:00:00
* Selected version
This item isPublicly Available
and licensed under:
 Files in this item
Name
Treex-Parser-MSTperl-0.11949.tar.gz
Size
256.76 KB
Format
application/x-gzip
Description
gzip Archive
MD5
40ffc1ccf7421ff6442c0485b10243eb
Preview
  File Preview