HKUFeb 23, 2026arXiv:2602.19458

ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

Ziyang Guo, Yifan Wu, Jason Hartline, Kenneth Holstein, Jessica Hullman

AI Summary

The paper introduces ComplLLM, a post-training framework that fine-tunes LLMs to generate complementary signals for multi-agent decision-making by using decision-theoretic rewards based on information gain. This approach addresses the challenge of leveraging diverse agent perspectives effectively in decision pipelines. Experiments on synthetic and real-world tasks demonstrate that ComplLLM recovers known complementary information and provides plausible explanations, improving overall decision quality.

Key Contribution

Unlock the power of LLMs to boost multi-agent decision pipelines by fine-tuning them to surface hidden, complementary signals that improve overall performance.

Abstract

Multi-agent decision pipelines can outperform single agent workflows when complementarity holds, i.e., different agents bring unique information to the table to inform a final decision. We propose ComplLLM, a post-training framework based on decision theory that fine-tunes a decision-assistant LLM using complementary information as reward to output signals that complement existing agent decisions. We validate ComplLLM on synthetic and real-world tasks involving domain experts, demonstrating how the approach recovers known complementary information and produces plausible explanations of complementary signals to support downstream decision-makers.

Natural Language Processing RLHF & Preference Learning Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

Related Papers