Using collective knowledge to assign oxidation states

DOI

Knowledge of the oxidation state of a metal centre in a material is essential to understand its properties. Chemists have developed several theories to predict the oxidation state on the basis of the chemical formula. These methods are quite successful for simple compounds but often fail to describe the oxidation states of more complex systems, such as metal-organic frameworks. In this work, we present a data-driven approach to automatically assign oxidation states, using a machine learning algorithm trained on the assignments by chemists encoded in the chemical names in the Cambridge Crystallographic Database. Our approach only considers the immediate local chemical environment around a metal centre and, in this way, is robust to most of the experimental uncertainties in these structures (like incorrect protonation or unbound solvents). We find such excellent accuracy (>98%) in our predictions that we can use our method to identify a large number of incorrect assignments in the database. The predictions of our model follow chemical intuition, without explicitly having taught the model those heuristics. This work nicely illustrates how powerful the collective knowledge of chemists actually is. Machine learning can harvest this knowledge and convert it into a useful tool for chemists.

Identifier
DOI https://doi.org/10.24435/materialscloud:2019.0085/v1
Related Identifier https://doi.org/10.5281/zenodo.3567274
Related Identifier https://doi.org/10.5281/zenodo.3567011
Related Identifier https://archive.materialscloud.org/communities/mcarchive
Related Identifier https://doi.org/10.24435/materialscloud:p2-k0
Metadata Access https://archive.materialscloud.org/oai2d?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:materialscloud.org:274
Provenance
Creator Jablonka, Kevin Maik; Ongari, Daniele; Moosavi, Seyed Mohamad; Smit, Berend
Publisher Materials Cloud
Contributor Smit, Berend
Publication Year 2019
Rights info:eu-repo/semantics/openAccess; Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/legalcode
OpenAccess true
Contact archive(at)materialscloud.org
Representation
Language English
Resource Type info:eu-repo/semantics/other
Format application/zip; text/plain; text/markdown
Discipline Materials Science and Engineering