CorPipe 23 multilingual CorefUD 1.2 model (corpipe23-corefud1.2-240906)

PID

The corpipe23-corefud1.2-240906 is a mT5-large-based multilingual model for coreference resolution usable in CorPipe 23 . It is released under the CC BY-NC-SA 4.0 license.

The model is language agnostic (no corpus id on input), so it can be in theory used to predict coreference in any mT5 language. However, the model expects empty nodes to be already present on input, predicted by the https://www.kaggle.com/models/ufal-mff/crac2024_zero_nodes_baseline/.

This model was present in the CorPipe 24 paper as an alternative to a single-stage approach, where the empty nodes are predicted joinly with coreference resolution (via http://hdl.handle.net/11234/1-5672), an approach circa twice as fast but of slightly worse quality.

Identifier
PID http://hdl.handle.net/11234/1-5673
Related Identifier https://arxiv.org/abs/2410.02756
Related Identifier https://github.com/ufal/crac2023-corpipe
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-5673
Provenance
Creator Straka, Milan
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2024
Rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0); http://creativecommons.org/licenses/by-nc-sa/4.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Catalan; Valencian; Czech; Church Slavic; Old Slavonic; Church Slavonic; Old Bulgarian; Old Church Slavonic; German; English; Spanish; Castilian; French; Greek, Ancient (to 1453); Hungarian; Lithuanian; Bokmål, Norwegian; Norwegian Bokmål; Norwegian Nynorsk; Nynorsk, Norwegian; Polish; Russian; Turkish
Resource Type toolService
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 1
Discipline Linguistics