Dataset - B2FIND

GermEval-2018 Corpus (DE)

This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.
DALC - Dutch Abusive Language Corpus

This repository contains the full text format of the Dutch Abusive Language Corpus (DALC), which is composed of tweets in Dutch. The corpus is structured as follows: unique...
GermEval-2018 Corpus (DE)

This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.

You can also access this registry using the API (see API Docs).

3 datasets found