Replication data for: SIDTD. Synthetic dataset of ID and Travel Document

DOI

The SIDTD dataset is an extension of the MIDV2020 dataset. Initially, the MIDV2020 dataset is composed of forged ID documents, as all documents are generated by means of AI techniques. These generated documents are considered in the SIDTD dataset as representative of bona fide. On the other hand, the documents generated are considered as being forged versions of them. The corpus of the dataset is composed by ten European nationalities that are equally represented: Albanian, Azerbaijani, Estonian, Finnish, Greek, Lithuanian, Russian, Serbian, Slovakian, and Spanish. We employ two techniques for generating composite PAIs: Crop & Replace and inpainting. Datase contains videos, and clips, of captured ID Documents with different backgrounds, we add the same type of data for the forged ID Document images generated using the techniques described. The protocol employed to generate the dataset is as follows: We printed 191 counterfeit ID documents on paper using an HP Color LaserJet E65050 printer. Then, the documents were laminated with 100-micron-thick laminating pouches to enhance realism and manually cropped. CVC’s employees were requested to use their smartphones to record videos of forged ID documents from SIDTD. This approach aimed to capture a diverse range of video qualities, backgrounds, durations, and light intensities

Identifier
DOI https://doi.org/10.34810/data1815
Related Identifier IsSupplementedBy https://doi.org/10.1038/s41597-024-04160-9
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data1815
Provenance
Creator Boned Riera, Carlos ORCID logo; Talarmain, Maxime ORCID logo; Ramos Terrades, Oriol ORCID logo
Publisher CORA.Repositori de Dades de Recerca
Contributor Ramos Terrades, Oriol; Boned Riera, Carlos; Talarmain, Maxime; Centre de Visió per Computador
Publication Year 2024
Rights CC BY-SA 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by-sa/4.0
OpenAccess true
Contact Ramos Terrades, Oriol (Centre de Visió per Computador); Boned Riera, Carlos (Centre de Visió per Computador); Talarmain, Maxime (Centre de Visió per Computador)
Representation
Resource Type Images; Dataset
Format application/zip; text/plain
Size 23564788403; 10171; 21236; 460633; 534849; 0; 1273966468; 307269; 7432; 723095; 748455; 54511584181
Version 1.0
Discipline Other