Lithuanian Hate Speech Corpus v.1

PID

This corpus consists of (1) examples of hate speech based on ethnicity, nationality, or race, and (2) a collection of neutral comments, including both general comments and comments mentioning nationality in a positive or neutral context. All comments are written in Lithuanian and collected from news portals and social networks. The corpus is intended for the development of hate speech detection solutions and research.

Identifier
PID http://hdl.handle.net/20.500.11821/69
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/69
Provenance
Creator Butkienė, Rita; Edgaras, Dambrauskas; Algirdas, Šukys; Voldemaras, Žitkus
Publisher Kaunas University of Technology
Publication Year 2025
Rights PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT; PUB; https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language Lithuanian
Resource Type corpus
Format text/plain; charset=utf-8; application/pdf; application/zip; downloadable_files_count: 4
Discipline Linguistics