CoNLL 2017 Shared Task - UDPipe Baseline Models and Supplementary Materials

PID

Baseline UDPipe models for CoNLL 2017 Shared Task in UD Parsing, and supplementary material.

The models require UDPipe version at least 1.1 and are evaluated using the official evaluation script.

The models are trained on a slightly different split of the official UD 2.0 CoNLL 2017 training data, so called baselinemodel split, in order to allow comparison of models even during the shared task. This baselinemodel split of UD 2.0 CoNLL 2017 training data is available for download.

Furthermore, we also provide UD 2.0 CoNLL 2017 training data with automatically predicted morphology. We utilize the baseline models on development data and perform 10-fold jack-knifing (each fold is predicted with a model trained on the rest of the folds) on the training data.

Finally, we supply all required data and hyperparameter values needed to replicate the baseline models.

Identifier
PID http://hdl.handle.net/11234/1-1990
Related Identifier http://ufal.mff.cuni.cz/udpipe
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-1990
Provenance
Creator Straka, Milan
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2017
Rights Licence Universal Dependencies v2.0; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Multiple languages
Resource Type languageDescription
Format text/plain; charset=utf-8; application/x-tar; application/x-xz; downloadable_files_count: 4
Discipline Linguistics