Dataset - B2FIND

WoodVIT_V1_Raw

Raw data files on which the ASKIVIT V1 dataset is based. The dataset includes 56 x 4 images captured by four different sensors: a high-resolution VIS/RGB camera, a hyperspectral...

WoodVIT_V1

This deep learning dataset is designed for image classification and segmentation of bulky waste. It contains 22,659 patches with dimensions of 50 × 50 × 717 px. The dataset...

M4FC dataset

M4FC dataset, accompanying the paper "M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset". The dataset contains annotations for 4,982...

Slovenian Dataset for Vision-Language Model Instruction-Tuning SLO-VLM-IT-Dat...

This entry contains the SLO-VLM-IT-Dataset, a comprehensive dataset designed for instruction-tuning vision-language models in the Slovenian language. It is composed of five main...

5Pils dataset

The 5Pils dataset accompanies the paper "'Image, tell me your story!' Predicting the original meta-context of visual misinformation". The dataset contains the meta-context...

Multimodal corpus EVA 1.0

EVA Corpus 1.0 consists of one episode of an audio/video session plus corresponding orthographic transcriptions with a duration of 57 minutes. The multi-party spontaneous...

Spoken corpus Berta

The Berta Spoken Corpus contains six hours of recorded speech across a variety of interactional settings. These settings include 57 different speech events, with some captured...

Real-world misleading visualizations QA dataset

The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains...

8 datasets found