IWPT 2020 Shared Task Data and System Outputs
Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel; Bouma, Gosse and Seddah, Djamé, 2020,
IWPT 2020 Shared Task Data and System Outputs, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-3238.
Authors
Item identifier
Project URL
Date issued
2020-06-11
Size
402297 sentences,
6627643 words,
6553940 tokens
Description
This package contains data used in the IWPT 2020 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal Dependencies release 2.5 (http://hdl.handle.net/11234/1-3105) but some treebanks contain additional enhanced annotations. Moreover, not all of these additions became part of Universal Dependencies release 2.6 (http://hdl.handle.net/11234/1-3226), which makes the shared task data unique and worth a separate release to enable later comparison with new parsing algorithms. The package also contains a number of Perl and Python scripts that have been used to process the data during preparation and during the shared task. Finally, the package includes the official primary submission of each team participating in the shared task.
Publisher
Acknowledgement
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2018101
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
Collections
Files in this item
- Name
- iwpt2020stdata.tgz
- Size
- 445.62 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d56880d847e0f6b96e84c9bfd7af913d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

