Comparing the latent features of universal machine-learning interatomic potentials

DOI

<p>The past few years have seen the development of ``universal'' machine-learning interatomic potentials (uMLIPs) capable of approximating the ground-state potential energy surface across a wide range of chemical structures and compositions with reasonable accuracy. While these models differ in the architecture and the dataset used, they share the ability to compress a staggering amount of chemical information into descriptive latent features. Herein, we systematically analyze what the different uMLIPs have learned by quantitatively assessing the relative information content of their latent features with feature reconstruction errors as metrics, and observing how the trends are affected by the choice of training set and training protocol. We find that the uMLIPs encode chemical space in significantly distinct ways, with substantial cross-model feature reconstruction errors. When variants of the same model architecture are considered, trends become dependent on the dataset, target, and training protocol of choice. We also observe that fine-tuning of a uMLIP retains a strong pre-training bias in the latent features. Finally, we discuss how atom-level features, which are directly output by MLIPs, can be compressed into global structure-level features via concatenation of progressive cumulants, each adding significantly new information about the variability across the atomic environments within a given system.</p>

<div> <p>This repository contains the scripts to reproduce results presented in the associated paper/preprint.</p> </div>

Identifier
DOI https://doi.org/10.24435/materialscloud:gb-5z
Related Identifier https://arxiv.org/abs/2512.05717
Related Identifier https://archive.materialscloud.org/communities/mcarchive
Related Identifier https://doi.org/10.24435/materialscloud:a5-zz
Metadata Access https://archive.materialscloud.org/oai2d?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:materialscloud.org:6ngsf-rtg98
Provenance
Creator Chorna, Sofiia; Tisi, Davide; Malosso, Cesare; How, Wei Bin; Ceriotti, Michele; Chong, Sanggyu
Publisher Materials Cloud
Contributor Ceriotti, Michele; Chong, Sanggyu
Publication Year 2025
Rights info:eu-repo/semantics/openAccess; Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/legalcode
OpenAccess true
Contact archive(at)materialscloud.org
Representation
Language English
Resource Type info:eu-repo/semantics/other
Format text/markdown; application/zip
Discipline Materials Science and Engineering