This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

CoNLL 2009 Shared Task - Czech Data

Please use the following text to cite this item or export to a predefined format:
Hajič, Jan; Straňák, Pavel and Štěpánek, Jan, 2009, CoNLL 2009 Shared Task - Czech Data, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0001-C6D1-9.
Date issued
2009-01-19
Language(s)
Description
Czech data - both train and test+eval sets, as well as the valency dictionary - for the CoNLL 2009 Shared Task. Documentation is included. The data are generated from PDT 2.0. LDC catalog number: LDC2009E34B
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
CoNLL2009-ST-Czech.zip
Size
14.84 MB
Format
application/zip
Description
All the Czech cospus files + documentation and the official eval script
MD5
dd1379817a8e96858651629cde616aac
Preview
  File Preview
  • CoNLL2009-ST-Czech
    • CoNLL2009-ST-evaluation-Czech.txt8 MB
    • CoNLL2009-ST-Czech-trial.txt398 kB
    • CoNLL2009-ST-evaluation-Czech-ood.txt3 MB
    • CoNLL2009-ST-Czech-train.txt81 MB
    • Czech.vallex1 MB
    • CoNLL2009-ST-Czech-development.txt10 MB
    • task-description.html34 kB
    • style.css3 kB
    • logo_ufal_165_bluebg.png1 kB
    • README.TXT3 kB
    • paper-submission.html7 kB
    • scorer.html10 kB
    • naaclhlt2009-conll2009-st-overview-paper-reference.doc47 kB
    • eval09.pl77 kB
    • results.html56 kB
    • index.html10 kB