Determining the optimal structural resolution of proteins through an information-theoretic analysis of their conformational ensemble

DOI

<p>The choice of structural resolution is a fundamental aspect of protein modelling, determining the balance between descriptive power and interpretability. Although atomistic simulations provide maximal detail, much of this information is redundant to understand the relevant large-scale motions and conformational states. Here, we introduce an unsupervised, information-theoretic framework that determines the minimal number of atoms required to retain a maximally informative description of the configurational space sampled by a protein. This framework quantifies the informativeness of coarse-grained representations obtained by systematically decimating atomic degrees of freedom and evaluating the resulting clustering of sampled conformations. Application to molecular dynamics trajectories of dynamically  diverse proteins shows that the optimal number of retained atoms scales linearly with system size, averaging about four heavy atoms per residue--remarkably consistent with the resolution of well-established coarse-grained models, such as MARTINI and SIRAH. Furthermore, the analysis shows that the optimal retained atoms number depends not only on molecular size but also on the extent of conformational exploration, decreasing for systems dominated by collective motions. The proposed method establishes a general criterion to identify the minimal structural detail that preserves the essential configurational information, thereby offering a new viewpoint on the structure-dynamics-function relationship in proteins and guiding the construction of parsimonious yet informative multiscale models.</p>

Identifier
DOI https://doi.org/10.24435/materialscloud:gh-yf
Related Identifier https://doi.org/10.48550/arXiv.2311.08076
Related Identifier https://archive.materialscloud.org/communities/mcarchive
Related Identifier https://doi.org/10.24435/materialscloud:sr-9q
Metadata Access https://archive.materialscloud.org/oai2d?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:materialscloud.org:czba5-18919
Provenance
Creator Mele, Margherita; Fiorentini, Raffaele; Tarenzi, Thomas; Mattiotti, Giovanni; Potestio, Raffaello
Publisher Materials Cloud
Contributor Mele, Margherita; Tarenzi, Thomas; Potestio, Raffaello
Publication Year 2025
Rights info:eu-repo/semantics/openAccess; Creative Commons Attribution 4.0 International; https://creativecommons.org/licenses/by/4.0/legalcode
OpenAccess true
Contact archive(at)materialscloud.org
Representation
Language English
Resource Type info:eu-repo/semantics/other
Format text/plain; application/zip
Discipline Materials Science and Engineering