Search papers, labs, and topics across Lattice.
This paper investigates the scaling challenges of machine-learned potentials for coarse-grained molecular dynamics, focusing on data demands for "bottom-up" coarse-graining objectives. They demonstrate that mean force matching requires significantly fewer training samples (50x) and less atomistic simulation time (87%) compared to other objectives. The resulting models exhibit improved accuracy on the potential of mean force for unseen proteins, highlighting the benefits of noise reduction in the objective function.
Mean force matching slashes the data requirements for training coarse-grained molecular dynamics potentials by 50x while boosting accuracy on unseen proteins.
Coarse-grained molecular dynamics often sacrifices accuracy and transferability for computational efficiency, but the use of machine learned potentials is helping coarse-grained models attain performance on par with atomistic molecular dynamics. Nevertheless, developing representations of the coarse-grained potential energy surface faces severe scaling challenges due to the extreme data demands of widely used "bottom-up" coarse-graining objectives. In this work, we show that mean force matching, a strategy for training thermodynamically consistent coarse-grained models, requires 50x fewer training samples and 87% less total atomistic simulation time, while obtaining better accuracy on the potential of mean force for unseen proteins compared to other commonly used objectives. By systematically removing noise from the objective function, we demonstrate that it is possible to scale machine learning architectures for coarse-graining, enabling highly accurate and transferable models. We show the advantages of mean force matching both theoretically and through exhaustive benchmarking using thermodynamic consistency as the primary metric of accuracy.