Indian Institute of Information Technology Allahabad (IIITA)National Institute of Electronics and Information Technology (NIELIT)Mar 12, 2026arXiv:2603.12183

Proof-Carrying Materials: Falsifiable Safety Certificates for Machine-Learned Interatomic Potentials

Abhinaba Basu, Pavan Chakraborty

AI Summary

The paper introduces Proof-Carrying Materials (PCM), a three-stage framework for generating falsifiable safety certificates for machine-learned interatomic potentials (MLIPs) used in materials screening. PCM employs adversarial falsification, bootstrap envelope refinement, and Lean 4 formal certification to identify and mitigate architecture-specific blind spots in MLIPs like CHGNet, TensorNet, and MACE. Applied to a thermoelectric screening case study, PCM improves the discovery yield of stable materials by 25% compared to single-MLIP screening.

Key Contribution

Machine-learned interatomic potentials miss 93% of stable materials, but a new auditing framework closes this gap and boosts discovery yield by 25%.

Abstract

Machine-learned interatomic potentials (MLIPs) are deployed for high-throughput materials screening without formal reliability guarantees. We show that a single MLIP used as a stability filter misses 93% of density functional theory (DFT)-stable materials (recall 0.07) on a 25,000-material benchmark. Proof-Carrying Materials (PCM) closes this gap through three stages: adversarial falsification across compositional space, bootstrap envelope refinement with 95% confidence intervals, and Lean 4 formal certification. Auditing CHGNet, TensorNet and MACE reveals architecture-specific blind spots with near-zero pairwise error correlations (r<= 0.13; n = 5,000), confirmed by independent Quantum ESPRESSO validation (20/20 converged; median DFT/CHGNet force ratio 12x). A risk model trained on PCM-discovered features predicts failures on unseen materials (AUC-ROC = 0.938 +/- 0.004) and transfers across architectures (cross-MLIP AUC-ROC ~ 0.70; feature importance r = 0.877). In a thermoelectric screening case study, PCM-audited protocols discover 62 additional stable materials missed by single-MLIP screening - a 25% improvement in discovery yield.

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References51

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Proof-Carrying Materials: Falsifiable Safety Certificates for Machine-Learned Interatomic Potentials

Related Papers