Replication data for: SIDTD. Synthetic dataset of ID and Travel Document

Dataset

DOI

The SIDTD dataset is an extension of the MIDV2020 dataset. Initially, the MIDV2020 dataset is composed of forged ID documents, as all documents are generated by means of AI techniques. These generated documents are considered in the SIDTD dataset as representative of bona fide. On the other hand, the documents generated are considered as being forged versions of them. The corpus of the dataset is composed by ten European nationalities that are equally represented: Albanian, Azerbaijani, Estonian, Finnish, Greek, Lithuanian, Russian, Serbian, Slovakian, and Spanish. We employ two techniques for generating composite PAIs: Crop & Replace and inpainting. Datase contains videos, and clips, of captured ID Documents with different backgrounds, we add the same type of data for the forged ID Document images generated using the techniques described. The protocol employed to generate the dataset is as follows: We printed 191 counterfeit ID documents on paper using an HP Color LaserJet E65050 printer. Then, the documents were laminated with 100-micron-thick laminating pouches to enhance realism and manually cropped. CVC’s employees were requested to use their smartphones to record videos of forged ID documents from SIDTD. This approach aimed to capture a diverse range of video qualities, backgrounds, durations, and light intensities

Identifier
DOI	https://doi.org/10.34810/data1815
Related Identifier	IsSupplementedBy https://doi.org/10.1038/s41597-024-04160-9
Metadata Access	https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data1815

Provenance
Creator	Boned Riera, Carlos ; Talarmain, Maxime ; Ramos Terrades, Oriol
Publisher	CORA.Repositori de Dades de Recerca
Contributor	Ramos Terrades, Oriol; Boned Riera, Carlos; Talarmain, Maxime; Centre de Visió per Computador
Publication Year	2024
Rights	CC BY-SA 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by-sa/4.0
OpenAccess	true
Contact	Ramos Terrades, Oriol (Centre de Visió per Computador); Boned Riera, Carlos (Centre de Visió per Computador); Talarmain, Maxime (Centre de Visió per Computador)

Representation
Resource Type	Images; Dataset
Format	application/zip; text/plain
Size	23564788403; 10171; 21236; 460633; 534849; 0; 1273966468; 307269; 7432; 723095; 748455; 54511584181
Version	1.0
Discipline	Other