Are Energy and Forces Really Enough? Using Structure to Evaluate the Accuracy and Transferability of Machine Learning Potentials of Biomolecules
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Molecular simulation is a powerful tool to describe chemical and physical processes across different length scales. Fundamental tradeoffs between the accuracy of quantum mechanics and speed of classical models makes simulating biosystems very difficult. The introduction of machine learning interatomic potentials (MLPs) has offered a potential solution which combines the accuracy of electronic structure theory with the computational scaling of classical approaches. Applying MLPs to biosystems could unlock new developments in drug discovery, biological mechanisms and interactions. Despite this promise, rigorous benchmarks are needed to determine whether MLPs reliably reproduce their first-principles counterparts. It is commonplace to use energies and forces as the primary performance metrics for MLPs. While essential, questions remain about the extent to which models minimized on these losses capture the fidelity of molecular dynamics. Through simulations of two isomeric biomolecules, this work shows that structural information must be added as a metric to assess both the overall accuracy and transferability of an MLP. Here, we show that when structural quantities are included alongside traditional energy- and force-based metrics, MLPs can achieve high fidelity to first-principles reference data and remain transferable across related chemical motifs. These findings underscore the importance of structure-based validation and support the continued use of MLPs for biomolecular simulations.