CorPipe 23 multilingual CorefUD 1.2 model (corpipe23-corefud1.2-240906)
Please use the following text to cite this item or export to a predefined format:
Straka, Milan, 2024,
CorPipe 23 multilingual CorefUD 1.2 model (corpipe23-corefud1.2-240906), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-5673.
Authors
Item identifier
Project URL
Referenced by
Date issued
2024-09-06
Type
Description
The `corpipe23-corefud1.2-240906` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 23 <https://github.com/ufal/crac2023-corpipe>. It is released under the CC BY-NC-SA 4.0 license.
The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any `mT5` language. However, the model expects empty nodes to be already present on input, predicted by the https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/.
This model was present in the CorPipe 24 paper as an alternative to a single-stage approach, where the empty nodes are predicted joinly with coreference resolution (via http://hdl.handle.net/11234/1-5672), an approach circa twice as fast but of slightly worse quality.
Acknowledgement
Grantová agentura České republiky
Project code:GX20-16819X
Project name:LUSyD – Language Understanding: from Syntax to Discourse
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- corpipe23-corefud1.2-240906.zip
- Size
- 1.82 GB
- Format
- application/zip
- Description
- Zip
- MD5
- c27f81ef8b998588a0a79e80b05140ec

- corpipe23-corefud1.2-240906
- LICENSE20 kB
- README.md5 kB
- options.json2 kB
- model.h52 GB
- tags.txt1 kB

