Search papers, labs, and topics across Lattice.
This study investigates the alignment between large language models (LLMs) and human brain mechanisms related to reasoning, specifically through the lens of deductive reasoning. By employing a neural-predictivity metric, the authors reveal that LLM internal representations can be enhanced by neural signals from reasoning-related brain regions, leading to significant improvements in reasoning accuracy across multiple models. The proposed brain-guided framework not only demonstrates that task-evoked brain signals can enhance LLM reasoning but also achieves up to a 13% absolute accuracy gain, showcasing a novel pathway for aligning AI with human cognitive processes.
Task-evoked brain signals can boost LLM reasoning accuracy by up to 13%, revealing a powerful new avenue for cognitive alignment in AI.
The correspondence between large language models (LLMs) and the neural mechanisms underlying human higher-order cognition remains insufficiently characterized. Given that language and reasoning in the human brain appear dissociable, an open question is whether LLMs align with neural signals from reasoning-related regions and whether such signals can improve them. Here, focusing on deductive reasoning, we show that LLM internal representations are not only partially aligned with task-fMRI activity but can also be directly enhanced by these signals. Using a neural-predictivity metric, we find that LLMs explain a substantial fraction of the explainable variance in reasoning-related regions at the aggregate level, whereas predictivity within specific reasoning types is lower, indicating both alignment and divergence. Building on this, we propose a brain-guided framework: we steer model representations along directions induced by the joint structure of model and brain representations, applying intervention at inference and fine-tuning during training. We demonstrate that task-evoked brain signals can directly enhance LLM reasoning, yielding gains orthogonal to language-only supervision across 10 LLMs (1.5B-72B), with transfer across reasoning types and up to 13\% absolute accuracy gain. Our results advance LLM-brain correspondences from correlation to guidance, establishing a brain-signal-driven pathway toward more robust and cognitively aligned AI.