The Virtual Patent (VP-WPI) Test Collection

DOI

The VP-WPI Test Collection is a novel dataset that implements the Virtual Patent (VP) concept. A Virtual Patent is a synthesized document that represents a single patent, created by merging the most up-to-date information from its various publication stages (e.g., kind codes A1, A2, B1, B2). 

Specifically, VP-WPI is as a specialized vertical of the WPI+ resource, which offers a unified, non-redundant view of patents by aggregating all relevant documents from the WPI test collection at the kind-code level to create unified VP documents. 

This collection serves as an abstraction layer over WPI, designed to:

Simplify analysis by reducing document redundancy.

Enhance data consistency by providing a single source of truth.

Preserve traceability with links back to all original source documents.

Further Information

For full technical details, including collection statistics, data specifications, and the creation process, please refer to:

WPI+ Resource - Documentation & Source Code: WPI+ GitHub Repository

Resources: 

VP-WPI Test Collection on TU-Wien (this page): VP-WPI Collection.

WPI Test Collection on Zenodo: WPI Test Collection.

Comprehensive Thesis (in Greek): Papadopoulos, C., MSc Thesis, International Hellenic University. https://repository.ihu.gr/handle/11544/47881.

Identifier
DOI https://doi.org/10.48436/x309z-a9q08
Related Identifier IsDescribedBy https://doi.org/10.1016/j.wpi.2025.102389
Related Identifier HasPart https://github.com/cs1msa/WPIplus/
Related Identifier IsDerivedFrom https://repository.ihu.gr/handle/11544/47881
Related Identifier IsSupplementTo https://doi.org/10.5281/zenodo.1489994
Related Identifier IsSupplementTo https://doi.org/10.1016/j.wpi.2019.02.002
Related Identifier IsVersionOf https://doi.org/10.48436/2myzm-yyh19
Metadata Access https://researchdata.tuwien.ac.at/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:researchdata.tuwien.ac.at:x309z-a9q08
Provenance
Creator Papadopoulos, Christos ORCID logo; Kamateri, Eleni ORCID logo; Salampasis, Michail (ORCID: 0000-0003-4087-125X); Piroi, Florina ORCID logo
Publisher TU Wien
Publication Year 2025
Rights Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/legalcode
OpenAccess true
Contact tudata(at)tuwien.ac.at
Representation
Resource Type Dataset
Discipline Other