-
Real-world misleading visualizations QA dataset
The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains... -
Spoken corpus Berta
The Berta Spoken Corpus contains six hours of recorded speech across a variety of interactional settings. These settings include 57 different speech events, with some captured... -
Multimodal corpus EVA 1.0
EVA Corpus 1.0 consists of one episode of an audio/video session plus corresponding orthographic transcriptions with a duration of 57 minutes. The multi-party spontaneous... -
5Pils dataset
The 5Pils dataset accompanies the paper "'Image, tell me your story!' Predicting the original meta-context of visual misinformation". The dataset contains the meta-context...