CMU MLMar 3, 2026arXiv:2603.02491

What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty

AI Summary

This paper establishes quantitative "selection theorems" demonstrating that competent agents acting under uncertainty *must* implement a predictive, structured internal state to achieve low average-case regret on structured prediction tasks. The theorems apply to stochastic policies and partial observability, without assuming optimality or access to an explicit model, thereby addressing limitations in prior world-model recovery research. By reducing predictive modeling to binary betting decisions, the authors show that regret bounds constrain probability mass on suboptimal bets, enforcing predictive distinctions necessary for separating high-margin outcomes and recovering approximate interventional transition kernels or belief-like memory.

Key Contribution

Forget hand-engineering world models – this work proves that competent agents *must* internally represent the world in a structured, predictive way to minimize regret under uncertainty.

Abstract

As artificial agents become increasingly capable, what internal structure is *necessary* for an agent to act competently under uncertainty? Classical results show that optimal control can be *implemented* using belief states or world models, but not that such representations are required. We prove quantitative "selection theorems" showing that low *average-case regret* on structured families of action-conditioned prediction tasks forces an agent to implement a predictive, structured internal state. Our results cover stochastic policies, partial observability, and evaluation under task distributions, without assuming optimality, determinism, or access to an explicit model. Technically, we reduce predictive modeling to binary "betting" decisions and show that regret bounds limit probability mass on suboptimal bets, enforcing the predictive distinctions needed to separate high-margin outcomes. In fully observed settings, this yields approximate recovery of the interventional transition kernel; under partial observability, it implies necessity of belief-like memory and predictive state, addressing an open question in prior world-model recovery work.

Scalable Oversight & Alignment Theory Tool Use & Agents World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty

Related Papers