The CLASSLA-Stanza model for UD dependency parsing of standard Croatian 2.1

PID

The model for UD dependency parsing of standard Croatian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the UD-parsed portion of the hr500k training corpus (http://hdl.handle.net/11356/1792) and using the CLARIN.SI-embed.hr word embeddings (http://hdl.handle.net/11356/1790). The estimated LAS of the parser is ~87.46.

The difference to the previous version of the model is that this version was trained using the new version of the hr500k corpus and the new version of the Croatian word embeddings.

Identifier
PID http://hdl.handle.net/11356/1836
Related Identifier https://aclanthology.org/W19-3704/
Related Identifier http://hdl.handle.net/11356/1259
Related Identifier https://github.com/clarinsi/classla
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1836
Provenance
Creator Terčon, Luka; Ljubešić, Nikola
Publisher Jožef Stefan Institute
Publication Year 2023
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Croatian
Resource Type toolService
Format text/plain; charset=utf-8; application/zip; downloadable_files_count: 2
Discipline Linguistics