Apr 27, 2026arXiv:2604.24076

An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress

AI Summary

This paper introduces a thermodynamic-inspired framework to analyze the stability of LLM outputs under uncertainty, using a composite stability score that integrates task utility, entropy, internal integration, and aligned reflective capacity. They analyze 80 model-scenario observations across four LLMs using the IST-20 benchmark, demonstrating that the proposed formulation yields higher stability scores compared to a utility-entropy baseline, particularly under high entropy conditions. The framework provides a compact and interpretable perspective connecting uncertainty, performance, and internal structure for LLM evaluation.

Key Contribution

LLM stability under uncertainty isn't just about accuracy – a new information-geometric framework reveals how internal model structure non-linearly attenuates the impact of disorder.

Abstract

As large language models (LLMs) are increasingly deployed in high-stakes and operational settings, evaluation strategies based solely on aggregate accuracy are often insucient to characterize system reliability. This study proposes a thermodynamic inspired modeling framework for analyzing the stability of LLM outputs under conditions of uncertainty and perturbation. The framework introduces a composite stability score that integrates task utility, entropy as a measure of external uncertainty, and two internal structural proxies: internal integration and aligned reective capacity. Rather than interpreting these quantities as physical variables, the formulation is intended as an interpretable abstraction that captures how internal structure may modulate the impact of disorder on model behavior. Using the IST-20 benchmarking protocol and associated metadata, we analyze 80 modelscenario observations across four contemporary LLMs. The proposed formulation consistently yields higher stability scores than a reduced utilityentropy baseline, with a mean improvement of 0.0299 (95% CI: 0.02470.0351). The observed gain is more pronounced under higher entropy conditions, suggesting that the framework captures a form of nonlinear attenuation of uncertainty. We do not claim a fundamental physical law or a complete theory of machine ethics. Instead, the contribution of this work is a compact and interpretable modeling perspective that connects uncertainty, performance, and internal structure within a unied evaluation lens. The framework is intended to complement existing benchmarking approaches and to support ongoing discussions in AI safety, reliability, and governance.

Eval Frameworks & Benchmarks Natural Language Processing Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References24

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress

Related Papers