8 datasets found

Keywords: multimodal

Filter Results
  • WoodVIT_V1_Raw

    Raw data files on which the ASKIVIT V1 dataset is based. The dataset includes 56 x 4 images captured by four different sensors: a high-resolution VIS/RGB camera, a hyperspectral...
  • WoodVIT_V1

    This deep learning dataset is designed for image classification and segmentation of bulky waste. It contains 22,659 patches with dimensions of 50 × 50 × 717 px. The dataset...
  • M4FC dataset

    M4FC dataset, accompanying the paper "M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset". The dataset contains annotations for 4,982...
  • Slovenian Dataset for Vision-Language Model Instruction-Tuning SLO-VLM-IT-Dat...

    This entry contains the SLO-VLM-IT-Dataset, a comprehensive dataset designed for instruction-tuning vision-language models in the Slovenian language. It is composed of five main...
  • 5Pils dataset

    The 5Pils dataset accompanies the paper "'Image, tell me your story!' Predicting the original meta-context of visual misinformation". The dataset contains the meta-context...
  • Multimodal corpus EVA 1.0

    EVA Corpus 1.0 consists of one episode of an audio/video session plus corresponding orthographic transcriptions with a duration of 57 minutes. The multi-party spontaneous...
  • Spoken corpus Berta

    The Berta Spoken Corpus contains six hours of recorded speech across a variety of interactional settings. These settings include 57 different speech events, with some captured...
  • Real-world misleading visualizations QA dataset

    The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains...
You can also access this registry using the API (see API Docs).