This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank

Please use the following text to cite this item or export to a predefined format:
Pecina, Pavel, 2008, Gold Standard Reference Data for Multiword Expression Extraction: Czech Dependency Bigrams from the Prague Dependency Treebank, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1457.
Date issued
2008-12-01
Size
12232 items
Language(s)
Description
Annotated list of dependency bigrams occurring in the PDT more than five times and having part-of-speech patterns that can possibly form a collocation. Each bigram is assigned to one of the six MWE categories by three annotators.
This item isPublicly Available
and licensed under:
 Files in this item
Name
pdt-dep-gold-standard-1.0.tgz
Size
211.72 KB
Format
application/x-gzip
Description
Data package
MD5
44fb88e71c7c1a6550a55968ec497477
Preview
  File Preview
    • pdt-dep-gold-standard-1.0.tgz990 kB