5 datasets found

Keywords: multimodal

Filter Results
  • Slovenian Dataset for Vision-Language Model Instruction-Tuning SLO-VLM-IT-Dat...

    This entry contains the SLO-VLM-IT-Dataset, a comprehensive dataset designed for instruction-tuning vision-language models in the Slovenian language. It is composed of five main...
  • 5Pils dataset

    The 5Pils dataset accompanies the paper "'Image, tell me your story!' Predicting the original meta-context of visual misinformation". The dataset contains the meta-context...
  • Multimodal corpus EVA 1.0

    EVA Corpus 1.0 consists of one episode of an audio/video session plus corresponding orthographic transcriptions with a duration of 57 minutes. The multi-party spontaneous...
  • Spoken corpus Berta

    The Berta Spoken Corpus contains six hours of recorded speech across a variety of interactional settings. These settings include 57 different speech events, with some captured...
  • Real-world misleading visualizations QA dataset

    The real-world misleading visualization QA dataset accompanies the paper "'Protecting multimodal large language models againts misleading visualizations". The dataset contains...
You can also access this registry using the API (see API Docs).