This is not the latest version of this item. The latest version can be found here.
Universal Dependencies 2.8.1
Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel; et al., 2021,
Universal Dependencies 2.8.1, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-3687.
Authors
Zeman, Daniel ; et al.
Item identifier
Project URL
Referenced by
Date issued
2021-05-16
Size
27066479 tokens,
27522188 words,
1575479 sentences
Description
Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual parser development, cross-lingual learning, and parsing research from a language typology perspective. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008).
Version 2.8.1 fixes a bug in 2.8 where a portion of the Dutch Alpino treebank was accidentally omitted.
Publisher
Acknowledgement
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2018101
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
Collections
Version History


