UDPipe 2

PID

UDPipe 2 is a POS tagger, lemmatizer and dependency parser.

Compared to UDPipe 1:

  • UDPipe 2 is Python-only and tested only in Linux,

  • UDPipe 2 is meant as a research tool, not as a user-friendly UDPipe 1 replacement,

  • UDPipe 2 achieves much better performance, but requires a GPU for reasonable performance,

  • UDPipe 2 does not perform tokenization by itself – it uses UDPipe 1 for that.

UDPipe 2 is available in the udpipe-2 branch of the UDPipe repository at https://github.com/ufal/udpipe/tree/udpipe-2. It is a free software under Mozilla Public License 2.0 (http://www.mozilla.org/MPL/2.0/) and the models are free for non-commercial use and distributed under CC BY-NC-SA (http://creativecommons.org/licenses/by-nc-sa/4.0/) license, although for some models the original data used to create the model may impose additional licensing conditions.

UDPipe 2 is also available as a REST service running at https://lindat.mff.cuni.cz/services/udpipe. If you like, you can use the https://github.com/ufal/udpipe/blob/udpipe-2/udpipe2_client.py script to interact with it.

Identifier
PID http://hdl.handle.net/11234/1-4816
Related Identifier https://ufal.mff.cuni.cz/udpipe/2
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-4816
Provenance
Creator Straka, Milan; Straková, Jana
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2022
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Resource Type toolService
Format downloadable_files_count: 0
Discipline Linguistics