This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

PDT-Vallex: Czech Valency lexicon linked to treebanks

Please use the following text to cite this item or export to a predefined format:
Urešová, Zdeňka; Štěpánek, Jan; Hajič, Jan; Panevova, Jarmila and Mikulová, Marie, 2014, PDT-Vallex: Czech Valency lexicon linked to treebanks, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0023-4338-F.
Date issued
2014-02-13
Size
7121 entries,
11933 frames
Language(s)
Description
The valency lexicon PDT-Vallex has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, PCEDT). It contains over 11000 valency frames for more than 7000 verbs which occurred in the PDT or PCEDT. It is available in electronically processable format (XML) together with the aforementioned treebanks (to be viewed and edited by TrEd, the PDT/PCEDT main annotation tool), and also in more human readable form including corpus examples (see the WEBSITE link below). The main feature of the lexicon is its linking to the annotated corpora - each occurrence of each verb is linked to the appropriate valency frame with additional (generalized) information about its usage and surface morphosyntactic form alternatives.
 Files in this item
Name
pdt-vallex2.8.zip
Size
1.24 MB
Format
application/zip
Description
Zip
MD5
edd100c8806fc923fe12684a5e2b62a5
Preview
  File Preview
    • pdt-vallex2.8.xml19 MB