The CLASSLA-Stanza model for UD dependency parsing of standard Croatian 2.1

Dataset

PID

The model for UD dependency parsing of standard Croatian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the UD-parsed portion of the hr500k training corpus (http://hdl.handle.net/11356/1792) and using the CLARIN.SI-embed.hr word embeddings (http://hdl.handle.net/11356/1790). The estimated LAS of the parser is ~87.46.

The difference to the previous version of the model is that this version was trained using the new version of the hr500k corpus and the new version of the Croatian word embeddings.

Identifier
PID	http://hdl.handle.net/11356/1836
Related Identifier	https://aclanthology.org/W19-3704/
Related Identifier	http://hdl.handle.net/11356/1259
Related Identifier	https://github.com/clarinsi/classla
Metadata Access	http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1836

Provenance
Creator	Terčon, Luka; Ljubešić, Nikola
Publisher	Jožef Stefan Institute
Publication Year	2023
Rights	Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess	true
Contact	info(at)clarin.si

Representation
Language	Croatian
Resource Type	toolService
Format	text/plain; charset=utf-8; application/zip; downloadable_files_count: 2
Discipline	Linguistics