Search papers, labs, and topics across Lattice.
The paper introduces ComplLLM, a post-training framework that fine-tunes LLMs to generate complementary signals for multi-agent decision-making by using decision-theoretic rewards based on information gain. This approach addresses the challenge of leveraging diverse agent perspectives effectively in decision pipelines. Experiments on synthetic and real-world tasks demonstrate that ComplLLM recovers known complementary information and provides plausible explanations, improving overall decision quality.
Unlock the power of LLMs to boost multi-agent decision pipelines by fine-tuning them to surface hidden, complementary signals that improve overall performance.
Multi-agent decision pipelines can outperform single agent workflows when complementarity holds, i.e., different agents bring unique information to the table to inform a final decision. We propose ComplLLM, a post-training framework based on decision theory that fine-tunes a decision-assistant LLM using complementary information as reward to output signals that complement existing agent decisions. We validate ComplLLM on synthetic and real-world tasks involving domain experts, demonstrating how the approach recovers known complementary information and produces plausible explanations of complementary signals to support downstream decision-makers.