This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

CorPipe 23 multilingual CorefUD 1.2 model (corpipe23-corefud1.2-240906)

Please use the following text to cite this item or export to a predefined format:
Straka, Milan, 2024, CorPipe 23 multilingual CorefUD 1.2 model (corpipe23-corefud1.2-240906), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-5673.
Date issued
2024-09-06
Description
The `corpipe23-corefud1.2-240906` is a `mT5-large`-based multilingual model for coreference resolution usable in CorPipe 23 <https://github.com/ufal/crac2023-corpipe>. It is released under the CC BY-NC-SA 4.0 license. The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any `mT5` language. However, the model expects empty nodes to be already present on input, predicted by the https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/. This model was present in the CorPipe 24 paper as an alternative to a single-stage approach, where the empty nodes are predicted joinly with coreference resolution (via http://hdl.handle.net/11234/1-5672), an approach circa twice as fast but of slightly worse quality.
Acknowledgement
 Files in this item
Name
corpipe23-corefud1.2-240906.zip
Size
1.82 GB
Format
application/zip
Description
Zip
MD5
c27f81ef8b998588a0a79e80b05140ec
Preview
  File Preview