-
PitVQA: A Dataset of Visual Question Answering in Pituitary Surgery
PitVQA dataset comprises 25 videos of endoscopic pituitary surgeries from the National Hospital of Neurology and Neurosurgery in London, United Kingdom, similar to the dataset... -
Mini-dataset for VL-Models fine-tuning (VL-Tune-dataset-mini)
A minimal dataset of 125 image-text pairs and 10 text queries for fine-tuning vision-language models on manuscript images. It is dedicated to the task of... -
Vision-Lanuguage Mini Testset (VL-Mini-Test)
A minimal test dataset of 100 images and 12 textual queries for vision-language models, dedicated to the task of text-based image retrieval, and constructed from the...