Mapping onto EUDAT-B2FIND Metadata Schema

The offered metadata must be mapped to the B2FIND schema in a meaningful way. And this is currently happening through a joint action, i.e. by iterative discussions between the data provider and the B2FIND team.

   Specification of Community Metadata
   Homogenisation and Semantic Mapping
   EUDAT-B2FIND metadata schema
    Concordance with other Standards
    The central facet Discipline

Specification of Community Metadata

The implementation of the mapping, as described in the following subsection, is based on a detailed specification and documentation of the community specific metadata. For this a spreadsheet must be filled out. The excel template can be requested via the support form or sending us an email or download the version in the google drive at Community-B2FIND_template.xlsx

. This template or form is divided in several tabs or sub parts :

  • General Information : Data providers should provide here information about the contact persons and the community.
  • Metadata Specification : More detailed information about teh specific metadata formats, schemas and structure used.
  • Harvesting : Specify here the 'harvesting endpoints' (e.g. OAI-URL's), the protocols and API's used and the sub sets, if available.
  • Mapping : This table specifies the mapping of the community properties to the B2FIND schema andcoverage information. This is iteratively discussed and developed with the data provider during the uptake process.

Homogenisation and Semantic Mapping

To transform and reformat the harvested, ‘raw’ metadata records to datasets, which can be uploaded to the B2FIND catalogue and indexed and displayed in the B2FIND portal, the following processing steps must are carried out :
  1. Select entries from the XML records, based on XPATH rules that depend on community specific metadata formats (see providing metadata)
  2. Parse through the selected values and assign them to the in the XPATH rules specified keys, i.e. fields of the B2FIND schema.
  3. Store the resulting key-value pairs in JSON dictionaries.
  4. Check and validate these JSON records before the upload to the B2FIND repository
This mapping procedure needs regular adaption and extensions according to the needs of the changing requirements of the communities.

EUDAT-B2FIND Metadata Schema

To allow a unique search space, B2FIND established a common, interdisciplinary metadata schema. This schema is based on the DataCite Metadata Schema 4.1 and therefore as well compatible with guidelines of other e-infrastructures as OpenAire, their schemas are based as well on the DataCite schema.

The B2FIND Metadata Schema 1.0 is the current version and was released on August 12, 2017. The associated XSD file is available and downloadable as XSD file from b2find_schema_0.1.xsd .

Currently the schema comprises 19 fields or facets as listed in the following table with their description, allowed values and references to the associated properties in the DataCite Metadata Schema 4.1.

Metadata Type B2FIND Name Description Allowed values DataCite 4.0 reference Obligation Occurence Comments and Issues
General Information Title A name or a title by which a resource is known Textual 3. Title Mandatory 1 Coding must be UTF-8 (unicode)
Description An additional information describing the content of the resource. Could be an abstract, a summary or a Table of Content. Textual 17.Description Recommended 0-1 Coding should be UTF-8 (unicode)
Tags A subject, keyword, classification code, or key phrase describing the content. List of strings, filter out 'non nouns' by using 'stop words' 6.Subject Optional 1 Try to use keyword thesauri from communities
Identifier DOI A persistent, citable identifier (registered at DataCite) that uniquely identifies a resource. Must be resolvable URL, registered at DataCite as DOI 1.Identifier 1.1. identiferType = DOI Mandatory (at least one resource identifier is mandatory) 1-3
PID A persistent identifier (implemented as a handle in a Handleserver) that uniquely identifies a resource. Must be resolvable URL and registered at a handle server 1.Identifier
Source An identifier (URL) that uniquely identifies a resource. Should be resolvable URL 1.Identifier
MetaDataAccess Link to the original harvested metadata record (GetRecord request) Should be resolvable URL N/A 0-1 Recommended
Provenance Creator The main researchers involved in producing the data, or the authors of the publication, in priority order. List of names 2. Creator Recommended 0-n
Publisher The name of the entity that holds, archives, publishes prints, distributes, releases, issues, or produces the resource. This property will be used to formulate the citation, so consider the prominence of the role. List of names 4. Publisher Recommended 0-n
PublicationYear The year when the data was or will be made publicly available. UTC Year format (YYYY) 5. PublicationYear Recommended 0-1
Rights Any rights information for this resource. Textual 16. Rights Optional 0-1
Contact Any contact information for this resource. List of Names [ may be 7. Contributor] Optional 0-n
Representation Language The primary language of the resource. Allowed values are taken from ISO 639‐1 language codes. 9. Language Optional 0-1 Examples: English, German, French
ResourceType A description of the resource Textual 10. ResourceType Recommended 0-1
Format Technical format of the resource Textual 14. Format Optional 0-1
Checksum Checksum of the underlying data resource MD5 checksum N/A Optional 0-1
Coverage Discipline The scientific disciplines linked with the resource. Controlled vocabulary, see b2find_disciplines.json N/A [ sometimes information in 6. Subject ] Recommended 0-n
Spatial Coverage A geolocation where the research data was gathered or/and about which the data is focused and related to. Content of this category is displayed in plain text. f a longitude/latitude information is given it will be displayed at the map. Textual geo spatial description (Spatial region or named place (geonames)) and if longitude/latitude information is given displayed at the map. 18. Geolocation Optional 0-1
Temporal Coverage Period of time the research data itself is related to. Could be a date format or plain text. Date-time representation 8. Date / [8.1 dateType = Collected?] Optional 0-1 Not really provided by DataCite in the sense of coverage

Concordance with other Standards

As said before the EUDAT-B2FIND schema is compatible with other widely used standards. In the following table the compatibility with the core schema of EUDAT-B2SHARE and the open access initiative OpenAIRE is shown by referring to the DataCite schema. The obligation is specified for each field, where M stands for mandatory, R for recommended and O for optional.

DataCite B2FIND B2SHARE OpenAIRE Comments and Issues
ID/Property DC 4.1 MD Group B2F 2.4
1/Identifier Identifier/1.1. identifierType=DOI (M) Resource Identifier(M) DOI(R) DOI(O) DOI(O) DataCite supports only type 'DOI' as identifier. While for B2SHARE always a PID is provided, B2FIND requires at least one URL (DOI or PID is perferred) linked to the underlying data resource. OpenAire supports a Controlled lsit of values for identifier type (DOI,ARK,Handle,PURL,URN,URL).
N/A PID(O) PID(R) Handle or PURL(O)
N/A Source (e.g. URL)(O) [List of] URLs(0) URN or URL
N/A MetaData Identifier MetaDataAccess(R) N/A N/A B2FIND provides here the GetRecord request used to harvest the metadata originally
2/Creator Creator(R) Provenance Creator(R) Creator(R) 2. Creator(M)
3/Title Title(M) General Information Title(M) Title(M) 3. Title(M)
4/Publisher Publisher(M) Provenance Publisher(R) Publisher(R) Publisher(M)
5 PublicationYear(M) PublicationYear(O) Provenance PublicationYear(O) PublicationYear(M)
6 Subject(R) Tags and Discipline(R) Keywords and Discipline(R) Subject(O)
7 Contributor Provenance [ --> Contact] Contributors Contributor (MA/O)
8 Date Coverage [ --> Temporal Coverage] The DataCite definition is here very vague (*Different dates relevant to the work*). For B2FIND we have here *PubicationYear*, i.e. the year the dataset is published, and *TemperalCoverage*, i.e. the interval in time the data covers, with a powerful 'Filter by time' associated.
9 Language(O) Representation Language(O) Language(O) Language(R)
10 ResourceType(M) Representation ResourceType(R) ResourceType(R) ResourceType(R)
11 AlternateIdentifier(O) Identifier N/A Alternate Identifiers(O) AlternateIdentifier(O)
12 RelatedIdentifier(R) Identifier N/A N/A RelatedIdentifier(MA)
13 Size Representation N/A Size per data object (file) Size(O)
14 Format Representation Format Format(O)
15 Version Representation N/A [ --> checksum] Version(O)
16 Rights(O) Provenance Rights(MA)
17 Description Description Description(MA)
18 GeoLocation(R) Coverage SpatialCoverage(O) GeoLocation(O) In B2FIND *SpatialCoverage*, i.e. the geo spatial coverage, is associated with a 'Filter by location' interface.
19 FundingReference Provenance N/A N/A N/A

The central facet Discipline

For the central facet Discipline B2FIND has defined a closed vocabulary with three levels of sub disciplines:

Browse by Text
    1. Humanities
      1.1 Human History
      • 1.1.1 African History
      • 1.1.2 American History
      • 1.1.3 Ancient History
      • 1.1.4 History of Australia|Australian History
      • 1.1.5 History of Asia|Asian History
      • 1.1.6 History of Europe|European History
      • 1.1.7 History of China|Chinese History
      • 1.1.8 Economic History of the world|Economic History
      • 1.1.9 Ancient Greece|Greek History
      • 1.1.10 History of Iran|Iranian History
      • 1.1.11 History of India|Indian History
      • 1.1.12 History of Indonesia|Indonesian History
      • 1.1.13 Intellectual History
      • 1.1.14 History of Latin America|Latin American History
      • 1.1.15 Modern History
      • 1.1.16 History of political thought|Political History
      • 1.1.17 Pre-Columbian era
      • 1.1.18 Ancient Rome|Roman History
      • 1.1.19 History of Russia|Russian History
      • 1.1.20 History of Science|Scientific History
      • 1.1.21 History of Technology|Technological History
      • 1.1.22 World History
    • 1.2 Linguistics

    2. Social Sciences
    3. Natural Sciences
    4. Formal Sciences
    5. Professions

UNDER CONSTRUCTION This section is under construction !

Browse Graphical

Browse by subject

  1. Humanities and Social Sciences
    1. Humanities
      1. Ancient Cultures
        1. Prehistory
        2. Classical Philology
        3. Ancient History
        4. Classical Archaeology
        5. Egyptology and Ancient Near Eastern Studies
      2. History
        1. Medieval History
        2. Early Modern History
        3. Modern and Current History
        4. History of Science
      3. Fine Arts, Music, Theatre and Media Studies
        1. Art History
        2. Musicology
        3. Theatre and Media Studies
      4. Linguistics
        1. General and Applied Linguistics
        2. Individual Linguistics
        3. Typology, Non-European Languages, Historical Linguistics
      5. Literary Studies
        1. Medieval German Literature
        2. Modern German Literature
        3. European and American Literature
        4. General and Comparative Literature and Cultural Studies
      6. Non-European Languages and Cultures, Social and Cultural Anthropology, Jewish Studies and Religious Studies
        1. Social and Cultural Anthropology and Ethnology/Folklore
        2. Asian Studies
        3. African, American and Oceania Studies
        4. Islamic Studies, Arabian Studies, Semitic Studies
        5. Religious Studies and Jewish Studies
      7. Theology
        1. Protestant Theology
        2. Roman Catholic Theology
      8. Philosophy
        1. History of Philosophy
        2. Theoretical Philosophy
        3. Practical Philosophy
    2. Social and Behavioural Sciences
      1. Education Sciences
        1. General Education and History of Education
        2. Research on Teaching, Learning and Training
        3. Research on Socialization and Educational Institutions and Professions
      2. Psychology
        1. General, Biological and Mathematical Psychology
        2. Developmental and Educational Psychology
        3. Social Psychology, Industrial and Organisational Psychology
        4. Differential Psychology, Clinical Psychology, Medical Psychology, Methodology
      3. Social Sciences
        1. Sociological Theory
        2. Empirical Social Research
        3. Communication Science
        4. Political Science
      4. Economics
        1. Economic Theory
        2. Economic and Social Policy
        3. Public Finance
        4. Business Administration
        5. Statistics and Econometrics
        6. Economic and Social History
      5. Jurisprudence
        1. Legal and Political Philosophy, Legal History, Legal Theory
        2. Private Law
        3. Public Law
        4. Criminal Law and Law of Criminal Procedure
        5. Criminology
  2. Life Sciences
    1. Biology
      1. Basic Biological and Medical Research
        1. Biochemistry
        2. Biophysics
        3. Cell Biology
        4. Structural Biology
        5. General Genetics
        6. Developmental Biology
        7. Bioinformatics and Theoretical Biology
        8. Anatomy
      2. Plant Sciences
        1. Plant Systematics and Evolution
        2. Plant Ecology and Ecosystem Analysis
        3. Inter-organismic Interactions of Plants
        4. Plant Physiology
        5. Plant Biochemistry and Biophysics
        6. Plant Cell and Developmental Biology
        7. Plant Genetics
      3. Zoology
        1. Systematics and Morphology
        2. Evolution, Anthropology
        3. Animal Ecology, Biodiversity and Ecosystem Research
        4. Sensory and Behavioural Biology
        5. Biochemistry and Animal Physiology
        6. Animal Genetics, Cell and Developmental Biology
    2. Medicine
      1. Microbiology, Virology and Immunology
        1. Metabolism, Biochemistry and Genetics of Microorganisms
        2. Microbial Ecology and Applied Microbiology
        3. Medical Microbiology, Molecular Infection Biology
        4. Virology
        5. Immunology
      2. Medicine
        1. Epidemiology, Medical Biometry, Medical Informatics
        2. Public Health, Health Services Research, Social Medicine
        3. Human Genetics
        4. Physiology
        5. Nutritional Sciences
        6. Pathology and Forensic Medicine
        7. Clinical Chemistry and Pathobiochemistry
        8. Pharmacy
        9. Pharmacology
        10. Toxicology and Occupational Medicine
        11. Anaesthesiology
        12. Cardiology, Angiology
        13. Pneumology, Clinical Infectiology Intensive Care Medicine
        14. Hematology, Oncology, Transfusion Medicine
        15. Gastroenterology, Metabolism
        16. Nephrology
        17. Endocrinology, Diabetology
        18. Rheumatology, Clinical Immunology, Allergology
        19. Dermatology
        20. Pediatric and Adolescent Medicine
        21. Gynaecology and Obstetrics
        22. Reproductive Medicine/Biology
        23. Urology
        24. Gerontology and Geriatric Medicine
        25. Vascular and Visceral Surgery
        26. Cardiothoracic Surgery
        27. Traumatology and Orthopaedics
        28. Dentistry, Oral Surgery
        29. Otolaryngology
        30. Radiology and Nuclear Medicine
        31. Radiation Oncology and Radiobiology
        32. Biomedical Technology and Medical Physics
      3. Neurosciences
        1. Molecular Neuroscience and Neurogenetics
        2. Cellular Neuroscience
        3. Developmental Neurobiology
        4. Systemic Neuroscience, Computational Neuroscience, Behaviour
        5. Comparative Neurobiology
        6. Cognitive Neuroscience and Neuroimaging
        7. Molecular Neurology
        8. Clinical Neurosciences I - Neurology, Neurosurgery
        9. Biological Psychiatry
        10. Clinical Neurosciences II - Psychotherapy, Psychosomatic Medicine
        11. Clinical Neurosciences III - Ophthalmology
    3. Agriculture, Forestry, Horticulture and Veterinary Medicine
      1. Agriculture, Forestry, Horticulture and Veterinary Medicine
        1. Soil Sciences
        2. Plant Cultivation
        3. Plant Nutrition
        4. Ecology of Agricultural Landscapes
        5. Plant Breeding
        6. Phytomedicine
        7. Agricultural and Food Process Engineering
        8. Agricultural Economics and Sociology
        9. Inventory Control and Use of Forest Resources
        10. Basic Forest Research
        11. Animal Husbandry, Breeding and Hygiene
        12. Animal Nutrition and Nutrition Physiology
        13. Basic Veterinary Medical Science
        14. Basic Research on Pathogenesis, Diagnostics and Therapy and Clinical Veterinary Medicine
  3. Natural Sciences
    1. Chemistry
      1. Molecular Chemistry
        1. Inorganic Molecular Chemistry
        2. Organic Molecular Chemistry
      2. Chemical Solid State and Surface Research
        1. Solid State and Surface Chemistry, Material Synthesis
        2. Physical Chemistry of Solids and Surfaces, Material Characterisation
        3. Theory and Modelling
      3. Physical and Theoretical Chemistry
        1. Physical Chemistry of Molecules, Interfaces and Liquids - Spectroscopy, Kinetics
        2. General Theoretical Chemistry
      4. Analytical Chemistry, Method Development (Chemistry)
        1. Analytical Chemistry, Method Development (Chemistry)
      5. Biological Chemistry and Food Chemistry
        1. Biological and Biomimetic Chemistry
        2. Food Chemistry
      6. Polymer Research
        1. Preparatory and Physical Chemistry of Polymers
        2. Experimental and Theoretical Physics of Polymers
        3. Polymer Materials
    2. Physics
      1. Condensed Matter Physics
        1. Experimental Condensed Matter Physics
        2. Theoretical Condensed Matter Physics
      2. Optics, Quantum Optics and Physics of Atoms, Molecules and Plasmas
        1. Optics, Quantum Optics, Atoms, Molecules, Plasmas
      3. Particles, Nuclei and Fields
        1. Particles, Nuclei and Fields
      4. Statistical Physics, Soft Matter, Biological Physics, Nonlinear Dynamics
        1. Statistical Physics, Soft Matter, Biological Physics, Nonlinear Dynamics
      5. Astrophysics and Astronomy
        1. Astrophysics and Astronomy
    3. Mathematics
      1. Mathematics
        1. Mathematics
    4. Geosciences (including Geography)
      1. Atmospheric Science and Oceanography
        1. Atmospheric Science
        2. Oceanography
      2. Geology and Palaeontology
        1. Geology and Palaeontology
      3. Geophysics and Geodesy
        1. Geophysics
        2. Geodesy, Photogrammetry, Remote Sensing, Geoinformatics, Cartogaphy
      4. Geochemistry, Mineralogy and Crystallography
        1. Geochemistry, Mineralogy and Crystallography
      5. Geography
        1. Physical Geography
        2. Human Geography
      6. Water Research
        1. Hydrogeology, Hydrology, Limnology, Urban Water Management, Water Chemistry, Integrated Water Resources Management
  4. Engineering Sciences
    1. Mechanical and industrial Engineering
      1. Production Technology
        1. Metal-Cutting Manufacturing Engineering
        2. Primary Shaping and Reshaping Technology
        3. Micro-, Precision, Mounting, Joining, Separation Technology
        4. Plastics Engineering
        5. Production Automation, Factory Operation, Operations Manangement
      2. Mechanics and Constructive Mechanical Engineering
        1. Construction, Machine Elements
        2. Mechanics
        3. Lightweight Construction, Textile Technology
        4. Acoustics
    2. Thermal Engineering/Process Engineering
      1. Process Engineering, Technical Chemistry
        1. Chemical and Thermal Process Engineering
        2. Technical Chemistry
        3. Mechanical Process Engineering
        4. Biological Process Engineering
      2. Heat Energy Technology, Thermal Machines, Fluid Mechanics
        1. Energy Process Engineering
        2. Technical Thermodynamics
        3. Fluid Mechanics
        4. Hydraulic and Turbo Engines and Piston Engines
    3. Materials Science and Engineering
      1. Materials Engineering
        1. Metallurgical and Thermal Processes, Thermomechanical Treatment of Materials
        2. Sintered Metallic and Ceramic Materials
        3. Composite Materials
        4. Mechanical Behaviour of Construction Materials
        5. Coating and Surface Technology
      2. Materials Science
        1. Thermodynamics and Kinetics of Materials
        2. Synthesis and Properties of Functional Materials
        3. Microstructural Mechanical Properties of Materials
        4. Structuring and Functionalisation
        5. Biomaterials
    4. Computer Science, Electrical and System Engineering
      1. Systems Engineering
        1. Automation, Control Systems, Robotics, Mechatronics
        2. Measurement Systems
        3. Microsystems
        4. Traffic and Transport Systems, Logistics
        5. Human Factors, Ergonomics, Human-Machine Systems
      2. Electrical Engineering
        1. Electronic Semiconductors, Components, Circuits, Systems
        2. Communication, High-Frequency and Network Technology, Theoretical Electrical Engineering
        3. Electrical Energy Generation, Distribution, Application
      3. Computer Science
        1. Theoretical Computer Science
        2. Software Technology
        3. Operating, Communication and Information Systems
        4. Artificial Intelligence, Image and Language Processing
        5. Computer Architecture and Embedded Systems
    5. Construction Engineering and Architecture
      1. Construction Engineering and Architecture
        1. Architecture, Building and Construction History, Sustainable Building Technology, Building Design
        2. Urbanism, Spatial Planning, Transportation and Infrastructure Planning, Landscape Planning
        3. Construction Material Sciences, Chemistry, Building Physics
        4. Sructural Engineering, Building Informatics, Construction Operation
        5. Applied Mechanics, Statics and Dynamics
        6. Geotechnics, Hydraulic Engineering