USCApr 20, 2026arXiv:2604.17819

PDDL-Mind: Large Language Models are Capable on Belief Reasoning with Reliable State Tracking

Wang Bill Zhu, Qiutong Tony Yi, Jesse Thomason

AI Summary

This paper introduces PDDL-Mind, a neuro-symbolic framework that enhances large language models' (LLMs) performance on theory-of-mind (ToM) tasks by improving state tracking. The authors argue that previous failures in LLMs stem from unreliable implicit state tracking rather than deficiencies in reasoning capabilities. By employing Planning Domain Definition Language (PDDL) to create explicit representations of world states, PDDL-Mind achieves over a 5% absolute accuracy improvement on ToM benchmarks compared to existing methods.

Key Contribution

LLMs can achieve significant gains in theory-of-mind reasoning by leveraging explicit state representations, challenging the notion that their limitations are solely due to reasoning capabilities.

Abstract

Large language models (LLMs) perform substantially below human level on existing theory-of-mind (ToM) benchmarks, even when augmented with chain-of-thought prompting or probabilistic belief updates. We argue that these failures primarily arise from unreliable implicit state tracking rather than limitations in high-level reasoning. We introduce PDDL-Mind, a neuro-symbolic framework that decouples environment state evolution from belief inference. By translating narrative descriptions into explicit states and actions expressed in Planning Domain Definition Language (PDDL), and by verifying action-induced state transitions against a predefined domain, PDDL-Mind provides LLMs with a logically consistent and explicit representation of world states for ToM tasks. Experiments on MMToM-QA, MuMA and FanToM show that PDDL-Mind achieves over 5% absolute accuracy gain over the best existing state-of-the-art method on ToM benchmark questions.

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

PDDL-Mind: Large Language Models are Capable on Belief Reasoning with Reliable State Tracking

Related Papers