The Prague Dependency Treebank – Consolidated 1.0
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C in the sequel) is a consolidated release of the existing PDT-corpora of Czech data, uniformly annotated using the standard PDT scheme.
More information about the corpus can be found on the PDT-C home page: https://ufal.mff.cuni.cz/pdt-c
PDT-corpora included in PDT-C:
- Prague Dependency Treebank (written texts); latest published version 3.5
- Czech part of Prague Czech-English Dependency Treebank (translated data), latest published version 2.0 and 2.0Coref
- Prague Dependency Treebank of Spoken Czech (spoken data); latest published version 2.0
- PDT-Faust (“user-generated“ texts), unpublished data