Maximum volume simplex method for automatic selection and classification of atomic environments and environment descriptor compression


Fingerprint distances, which measure the similarity of atomic environments, are commonly calculated from atomic environment fingerprint vectors. In this work, we present the simplex method that can perform the inverse operation, i.e., calculating fingerprint vectors from fingerprint distances. The fingerprint vectors found in this way point to the corners of a simplex. For a large dataset of fingerprints, we can find a particular largest simplex, whose dimension gives the effective dimension of the fingerprint vector space. We show that the corners of this simplex correspond to landmark environments that can be used in a fully automatic way to analyze structures. In this way, we can, for instance, detect atoms in grain boundaries or on edges of carbon flakes without any human input about the expected environment. By projecting fingerprints on the largest simplex, we can also obtain fingerprint vectors that are considerably shorter than the original ones but whose information content is not significantly reduced.

Metadata Access
Creator Parsaeifard, Behnam; Tomerini, Daniele; De, Deb Sankar; Goedecker, Stefan
Publisher Materials Cloud
Publication Year 2021
Rights info:eu-repo/semantics/openAccess; Creative Commons Attribution 4.0 International
OpenAccess true
Contact archive(at)
Language English
Resource Type Dataset
Discipline Materials Science and Engineering