Dataset - B2FIND

PitVQA: A Dataset of Visual Question Answering in Pituitary Surgery

PitVQA dataset comprises 25 videos of endoscopic pituitary surgeries from the National Hospital of Neurology and Neurosurgery in London, United Kingdom, similar to the dataset...
Vision-Lanuguage Mini Testset (VL-Mini-Test)

A minimal test dataset of 100 images and 12 textual queries for vision-language models, dedicated to the task of text-based image retrieval, and constructed from the...
Mini-dataset for VL-Models fine-tuning (VL-Tune-dataset-mini)

A minimal dataset of 125 image-text pairs and 10 text queries for fine-tuning vision-language models on manuscript images. It is dedicated to the task of...

You can also access this registry using the API (see API Docs).

3 datasets found