Derinet 2.2

PID

DeriNet is a lexical network which models derivational and compositional relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent word-formational relations between a derived word and its base word / words.

The present version, DeriNet 2.2, contains: - 1,040,127 lexemes (sampled from the MorfFlex CZ 2.0 ​dictionary), connected by - 782,904 derivational, - 50,511 orthographic variant, - 6,336 compounding, - 288 univerbation, and - 135 conversion relations.

Compared to the previous version, version 2.1 contains an overhaul of the compounding annotation scheme, 4384 extra compounds, 83 more affixoid lexemes serving as bases for compounding, more parts of speech serving as bases for compounding (adverbs, pronouns, numerals), and several minor corrections of derivational relations.

Identifier
PID http://hdl.handle.net/11234/1-5538
Related Identifier http://hdl.handle.net/11234/1-3765
Related Identifier http://hdl.handle.net/11234/1-5846
Related Identifier https://ufal.mff.cuni.cz/derinet
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-5538
Provenance
Creator Svoboda, Emil; Vidra, Jonáš; Ševčíková, Magda; Žabokrtský, Zdeněk
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2024
Rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0); http://creativecommons.org/licenses/by-nc-sa/4.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Czech
Resource Type lexicalConceptualResource
Format application/octet-stream; downloadable_files_count: 1
Discipline Linguistics