Vision-Lanuguage Mini Testset (VL-Mini-Test)

Dataset

DOI

A minimal test dataset of 100 images and 12 textual queries for vision-language models, dedicated to the task of text-based image retrieval, and constructed from the following sources:

50 images from the DocExplore dataset of medieval manuscripts.
50 images from two manuscripts from Al-Ḥarīrī, Maqāmāt, © Paris, Bibliothèque nationale de France. Département des manuscrits, namely MS arabe 3929 and MS arabe 5847.

The research for this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy – EXC 2176 ‘Understanding Written Artefacts: Material, Interaction and Transmission in Manuscript Cultures', project no. 390893796. The research was conducted within the scope of the Centre for the Study of Manuscript Cultures (CSMC) at Universität Hamburg.

Identifier
DOI	https://doi.org/10.25592/uhhfdm.11755
Related Identifier	https://doi.org/10.25592/uhhfdm.11754
Metadata Access	https://www.fdr.uni-hamburg.de/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:fdr.uni-hamburg.de:11755

Provenance
Creator	Hussein Mohammed
Publisher	Universität Hamburg
Publication Year	2023
Rights	Creative Commons Attribution 4.0 International; Open Access; https://creativecommons.org/licenses/by/4.0/legalcode; info:eu-repo/semantics/openAccess
OpenAccess	true

Representation
Language	English
Resource Type	Dataset
Version	1.0
Discipline	Humanities