-
WoodVIT_V1_Raw
Raw data files on which the ASKIVIT V1 dataset is based. The dataset includes 56 x 4 images captured by four different sensors: a high-resolution VIS/RGB camera, a hyperspectral... -
WoodVIT_V1
This deep learning dataset is designed for image classification and segmentation of bulky waste. It contains 22,659 patches with dimensions of 50 × 50 × 717 px. The dataset... -
M4FC dataset
M4FC dataset, accompanying the paper "M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset". The dataset contains annotations for 4,982... -
Slovenian Dataset for Vision-Language Model Instruction-Tuning SLO-VLM-IT-Dat...
This entry contains the SLO-VLM-IT-Dataset, a comprehensive dataset designed for instruction-tuning vision-language models in the Slovenian language. It is composed of five main... -
5Pils dataset
The 5Pils dataset accompanies the paper "'Image, tell me your story!' Predicting the original meta-context of visual misinformation". The dataset contains the meta-context... -
Multimodal corpus EVA 1.0
EVA Corpus 1.0 consists of one episode of an audio/video session plus corresponding orthographic transcriptions with a duration of 57 minutes. The multi-party spontaneous... -
Spoken corpus Berta
The Berta Spoken Corpus contains six hours of recorded speech across a variety of interactional settings. These settings include 57 different speech events, with some captured... -
Real-world misleading visualizations QA dataset
The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains...
