-
Training sets based on uncertainty estimates in the cluster-expansion method
Cluster expansion (CE) has gained an increasing level of popularity in recent years, and many strategies have been proposed for training and fitting the CE models to... -
A data-science approach to predict the heat capacity of nanoporous materials
The heat capacity of a material is a fundamental property that is of significant practical importance. For example, in a carbon capture process, the heat required to regenerate... -
Shadow-light images of simulated 25 classes of surface roughness for automati...
Many relationships important to civil engineering depend on surface roughness (morphology). Examples are the bond strength between concrete layers, the adhesion of a wheel to... -
Machine learning of superconducting critical temperature from Eliashberg theory
The Eliashberg theory of superconductivity accounts for the fundamental physics of conventional electron-phonon superconductors, including the retardation of the interaction and... -
Physics-inspired equivariant descriptors of non-bonded interactions
One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size... -
Ligand optimization of exchange interaction in Co(II) dimer single molecule m...
This record contains data structures used in the manuscript titled Ligand optimization of exchange interaction in Co(II) dimer single molecule magnet by machine learning.... -
Electronic excited states from physically-constrained machine learning
Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should... -
Electronic excited states from physically-constrained machine learning
Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should... -
Predicting hot-electron free energies from ground-state data
Machine-learning potentials are usually trained on the ground-state, Born-Oppenheimer energy surface, which depends exclusively on the atomic positions and not on the simulation... -
Adsorbate chemical environment-based machine learning framework for heterogen...
Heterogeneous catalytic reactions are influenced by a subtle interplay of atomic-scale factors, ranging from the catalysts' local morphology to the presence of high adsorbate... -
Dynamics of the charge transfer to solvent process in aqueous iodide
Charge-transfer-to-solvent states in aqueous halides are ideal systems for studying the electron-transfer dynamics to the solvent involving a complex interplay between... -
Exploring the design space of machine-learning models for quantum chemistry w...
Traditional atomistic machine learning (ML) models serve as surrogates for quantum mechanical (QM) properties, predicting quantities such as dipole moments and polarizabilities,... -
A dataset for beta-glycine with Wannier centers
The beta-glycine dataset is created with the purpose of validating the electron machine learning potential (eMLP) on crystalline beta glycine. It contains 25,676 configurations... -
Pure isotropic proton NMR spectra in solids using deep learning
The resolution of proton solid-state NMR spectra is usually limited by broadening arising from dipolar interactions between spins. Magic-angle spinning alleviates this... -
Digital soil mapping predicted on mid-infrared (MIR) spectroscopy measurement...
Soil information is valuable for many disciplines (e.g. agriculture, geomorphology, geology, archaeology) and can be used to produce maps or statistics on soil productivity. As... -
Predicted Antarctic Heat Flow and Uncertainties using Machine Learning
We predicted Antarctic Geothermal Heat Flow (GHF) using a machine learning approach. The adopted approach estimates GHF from multiple geophysical and geological data sets,... -
The MSC Data Set
From this page you can download resources we created for modal sense classification as reported in Zhou et al. (2015), Marasović et al. (2016) and Marasović and Frank (2015)... -
Training and development dataset for information extraction in plant epidemio...
The “Training and development dataset for information extraction in plant epidemiomonitoring” is the annotation set of the “Corpus for the epidemiomonitoring of plant”. The... -
Multiple exposure speckle imaging contrast database obtained in vitro on micr...
The dataset is composed of speckle contrast images of microfluidic channels with different flows and different exposure times. Speckle contrast images are provided for two... -
Data from Hackathon "GenHack 3 Generative Modeling Challenge": Predicting mai...
Hackathon Overview The GenHack 3 is a data challenge organized by École Polytechnique in 2024. The task was to construct generative models for predicting maize crop...