List of formulaic sequences in standard written Slovenian

PID

This document contains 1,891 formulaic sequences in standard written Slovenian, i.e. frequently recurring strings of two to five words, manually annotated for syntactic structure, pragmatic function, and dictionary relevance. The list of sequences with a minimum frequency threshold of 20/million is based on the Frequency lists of word-level n-grams from lowercase word forms in Gigafida 2.0 (http://hdl.handle.net/11356/1274) and contains the union of top-1,000 formulaic sequences ranked by frequency and five association measures (Dice, t-test, MI, MI3, simple-LL).

Note that there exists a related entry "List of formulaic sequences in spoken Slovenian", http://hdl.handle.net/11356/1279.

Identifier
PID http://hdl.handle.net/11356/1280
Related Identifier http://slovnica.ijs.si/wp-content/uploads/2019/12/NSSS_DS5-nizi_navodila_v6.pdf
Related Identifier http://slovnica.ijs.si/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1280
Provenance
Creator Dobrovoljc, Kaja; Roblek, Rebeka; Vianello, Chiara; Diaci, Ajda; Vuga, Zala
Publisher Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2020
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format application/octet-stream; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics