Data for Upscaling mineralogy with hyperspectral data: a benchmark dataset and machine learning framework to enable hyperspectral geometallurgy

DOI

Mineral liberation analysis (MLA) dataset accompanying the paper: Upscaling mineralogy with hyperspectral data: a benchmark dataset and machine learning framework to enable hyperspectral geometallurgy. This describes the mineralogy of 204 thick-sections prepared from 49 drillholes sampled across 7 different locations and coregistered with VNIR-SWIR-MWIR-LWIR hyperspectral data. It is intended to help develop, test and benchmark methods for predicting mineralogy from hyperspectral data. 

The data are stored as hycore (https://github.com/samthiele/hycore) Shed directories for easy loading, although individual MLA sections and corresponding hyperspectral images are all in ENVI format (so can be loaded by any hyperspectral analysis code or software). MLA outputs are also stored in their original (high-resolution) form as indexed bitmaps. The AbundanceMapping.xlsx file can be used to translate these MLA class indices into modal mineral abundances.

Finally, jupyter notebooks used to derive the benchmarks presented in the paper are also included, in the Code folder. These illustrate how the data can be loaded and manipulated using hycore and hklearn (https://github.com/samthiele/hklearn), and used to train machine learning models that predict modal mineralogy given hyperspectral data.

Identifier
DOI https://doi.org/10.14278/rodare.4582
Related Identifier IsIdenticalTo https://www.hzdr.de/publications/Publ-43223
Related Identifier IsPartOf https://doi.org/10.14278/rodare.4581
Related Identifier IsPartOf https://rodare.hzdr.de/communities/energy
Related Identifier IsPartOf https://rodare.hzdr.de/communities/hzdr
Related Identifier IsPartOf https://rodare.hzdr.de/communities/rodare
Metadata Access https://rodare.hzdr.de/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:rodare.hzdr.de:4582
Provenance
Creator Thiele, Samuel Thomas ORCID logo; Kirsch, Moritz ORCID logo; Frenzel, Max; Tolosana Delgado, Raimon ORCID logo; Kamath, Akshay Vijay ORCID logo; Guy, Bradley Martin ORCID logo; Kim, Yongwhi; Laura, Tusa; Járóka, Tom; Gloaguen, Richard (ORCID: 0000-0002-4383-473X)
Publisher Rodare
Publication Year 2026
Rights Creative Commons Attribution 4.0 International; Open Access; https://creativecommons.org/licenses/by/4.0/legalcode; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://rodare.hzdr.de/support
Representation
Language English
Resource Type Dataset
Version 1.0
Discipline Life Sciences; Natural Sciences; Engineering Sciences