-
DALC - Dutch Abusive Language Corpus
This repository contains the full text format of the Dutch Abusive Language Corpus (DALC), which is composed of tweets in Dutch. The corpus is structured as follows: unique... -
GermEval-2018 Corpus (DE)
This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.