SIKOR Lule Saami free corpus

PID

The SIKOR Lule Saami free corpus is a monolingual text corpus of Lule Saami that contains news, administrative, law, and religious texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, the following colleagues have contributed to the creation of the ressource: Ciprian Gerstenberger, Børre Gaup, Inga-Lill Mikkelsen, and Sandra Nystø Rahka. Linguistically, the data set (48,307 sentences; 535,367 tokens) features word form, lemma, morphosyntactic analysis, and dependency relations between tokens. The corpus has been automatically processed and linguistically analyzed with the Giellatekno/Divvun tools. Therefore, it may contain wrong annotations. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no. Please note that the Giellatekno resources are dynamic in nature. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata).

Identifier
PID http://hdl.handle.net/11509/101
Related Identifier http://giellatekno.uit.no/index.eng.html
Metadata Access https://repo.clarino.uib.no/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:repo.clarino.uib.no:11509/101
Provenance
Creator Giellatekno - Saami Language Technology, UiT The Arctic University of Norway; The Divvun group at UiT The Arctic University of Norway
Publisher Giellatekno - Saami Language Technology
Publication Year 2015
Rights Creative Commons - Attribution 3.0 Unported (CC BY 3.0); http://creativecommons.org/licenses/by/3.0/; CC
OpenAccess true
Contact clarin(at)uib.no
Representation
Language Lule Sami
Resource Type corpus
Format application/zip; text/plain; charset=utf-8; text/xml; downloadable_files_count: 1
Discipline Linguistics