This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

Universal Dependencies 2.8.1

Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel; et al., 2021, Universal Dependencies 2.8.1, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3687.
Authors
show everyone
Date issued
2021-05-16
Size
27066479 tokens,
27522188 words,
1575479 sentences
Description
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). Version 2.8.1 fixes a bug in 2.8 where a portion of the Dutch Alpino treebank was accidentally omitted.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
ud-documentation-v2.8.tgz
Size
89.97 MB
Format
application/x-gzip
Description
Documentation
MD5
953bce3e033374da99456b2531f0810d
Preview
  File Preview
    • ud-documentation-v2.8.tgz596 MB
    • ud-documentation-v2.8.tgz596 MB
Name
ud-treebanks-v2.8.tgz
Size
410.63 MB
Format
application/x-gzip
Description
Treebank data
MD5
c5e5e30518fb98c3846f3f11af77e612
Preview
  File Preview
    • ud-treebanks-v2.8.tgz2 GB
    • ud-treebanks-v2.8.tgz2 GB
Name
ud-tools-v2.8.tgz
Size
533.49 KB
Format
application/x-gzip
Description
Tools
MD5
0fee55d28bc2b0698f17332ba7cccbb6
Preview
  File Preview
    • ud-tools-v2.8.tgz4 MB
    • ud-tools-v2.8.tgz4 MB