This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

IWPT 2020 Shared Task Data and System Outputs

Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel; Bouma, Gosse and Seddah, Djamé, 2020, IWPT 2020 Shared Task Data and System Outputs, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3238.
Date issued
2020-06-11
Size
402297 sentences,
6627643 words,
6553940 tokens
Description
This package contains data used in the IWPT 2020 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal Dependencies release 2.5 (http://hdl.handle.net/11234/1-3105) but some treebanks contain additional enhanced annotations. Moreover, not all of these additions became part of Universal Dependencies release 2.6 (http://hdl.handle.net/11234/1-3226), which makes the shared task data unique and worth a separate release to enable later comparison with new parsing algorithms. The package also contains a number of Perl and Python scripts that have been used to process the data during preparation and during the shared task. Finally, the package includes the official primary submission of each team participating in the shared task.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
iwpt2020stdata.tgz
Size
445.62 MB
Format
application/x-gzip
Description
gzip Archive
MD5
d56880d847e0f6b96e84c9bfd7af913d
Preview
  File Preview