Training dataset for AI-supported subject indexing of research data with DFG classification

DOI

This is a structured JSON dataset for training and evaluating AI models for automated subject indexing of research data and linking of research data and publications. It was created for the project DA-FDM in order to train a vector based model to give automated suggestions for the topic classification of research datasets in DaRUS. In the context of this project, the DFG-classification was integrated as a controlled vocabulary in DaRUS for the Topic Classification field. DFG classes were added manually to datasets from DaRUS that were uploaded prior to the integration. This dataset includes classification tags (DFG, GND, Wikidata), publication links, respective open-access information, and, if the publication is open-access, the respective full texts for datasets from DaRUS as well as TUdatalib.

Example object for the dataset:

{ "name": "doi:10.18419/darus-1234", "tags": [ { "name": "dfg-fs$102-04", "url": "https://w3id.org/dfgfo/2020/102-04" }, { "name": "ResearchDataSet" } ], "links": [ { "name": "doi:10.12345/abc5678", "type": "publication", "is_open_access": true, "open_access_url": "https://www.asdfg.com/10.12345/abc5678", "text": "Extracted publication full text ..." } ] }

Identifier
DOI https://doi.org/10.18419/DARUS-5500
Metadata Access https://darus.uni-stuttgart.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18419/DARUS-5500
Provenance
Creator Weinspach, Karoline ORCID logo
Publisher DaRUS
Contributor Weinspach, Karoline; FoKUS; Roy, Sarbani; Hinrichs, Imma; Iglezakis, Dorothea
Publication Year 2025
Funding Reference Baden-Württemberg Ministry of Science, Research and Arts Az.: MWK42-7532-77/1/1
Rights CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Contact Weinspach, Karoline (University of Stuttgart); FoKUS (University of Stuttgart)
Representation
Resource Type Dataset
Format application/json
Size 20421182
Version 1.0
Discipline Construction Engineering and Architecture; Engineering; Engineering Sciences