This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.

IWPT 2021 Shared Task Data and System Outputs

Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel; Bouma, Gosse and Seddah, Djamé, 2021, IWPT 2021 Shared Task Data and System Outputs, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-3728.
Date issued
2021-07-02
Size
407148 sentences,
6622267 tokens,
6696624 words
Description
This package contains data used in the IWPT 2021 shared task. It contains training, development and test (evaluation) datasets. The data is based on a subset of Universal Dependencies release 2.7 (http://hdl.handle.net/11234/1-3424) but some treebanks contain additional enhanced annotations. Moreover, not all of these additions became part of Universal Dependencies release 2.8 (http://hdl.handle.net/11234/1-3687), which makes the shared task data unique and worth a separate release to enable later comparison with new parsing algorithms. The package also contains a number of Perl and Python scripts that have been used to process the data during preparation and during the shared task. Finally, the package includes the official primary submission of each team participating in the shared task.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
iwpt2021stdata.tgz
Size
810.31 MB
Format
application/x-gzip
Description
Shared task data
MD5
138828c69cfcda8d544984e438065a91
Preview
  File Preview
    • iwpt2021stdata.tgz4 GB