Search papers, labs, and topics across Lattice.
This paper investigates the impact of misaligned Theory of Mind (ToM) orders on LLM-based multi-agent coordination, finding that mismatches in ToM reasoning depth can hinder performance. To address this, they introduce an Adaptive ToM (A-ToM) agent that estimates a partner's ToM order based on prior interactions and uses this to predict actions. Empirical evaluations across four multi-agent tasks demonstrate the effectiveness of A-ToM in improving coordination by aligning ToM orders.
Mismatched levels of "mind-reading" between AI agents tank their ability to collaborate, but a simple adaptive strategy can fix it.
Theory of Mind (ToM) refers to the ability to reason about others' mental states, and higher-order ToM involves considering that others also possess their own ToM. Equipping large language model (LLM)-driven agents with ToM has long been considered to improve their coordination in multiagent collaborative tasks. However, we find that misaligned ToM orders-mismatches in the depth of ToM reasoning between agents-can lead to insufficient or excessive reasoning about others, thereby impairing their coordination. To address this issue, we design an adaptive ToM (A-ToM) agent, which can align in ToM orders with its partner. Based on prior interactions, the agent estimates the partner's likely ToM order and leverages this estimation to predict the partner's action, thereby facilitating behavioral coordination. We conduct empirical evaluations on four multi-agent coordination tasks: a repeated matrix game, two grid navigation tasks and an Overcooked task. The results validate our findings on ToM alignment and demonstrate the effectiveness of our A-ToM agent. Furthermore, we discuss the generalizability of our A-ToM to non-LLM-based agents, as well as what would diminish the importance of ToM alignment.