CMU MLIIT DelhiMax PlanckFeb 17, 2026arXiv:2602.15456

In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations

Mahsa Amani, Soumi Das, Bishwamittra Ghosh, Qinyuan Wu, Krishna P. Gummadi, Abhilasha Ravichander

AI Summary

This paper investigates latent source preferences in LLMs when used as agents that filter and present information from attributed sources. Through controlled experiments on twelve LLMs, the authors demonstrate that LLMs exhibit systematic biases, prioritizing information from certain sources over others, even when content is held constant and despite explicit prompting. These source preferences are sensitive to contextual framing, can outweigh content relevance, and help explain observed biases in downstream tasks like news recommendation.

Key Contribution

LLMs exhibit surprisingly strong and predictable biases towards specific information sources, even overriding content relevance and explicit instructions.

Abstract

Agents based on Large Language Models (LLMs) are increasingly being deployed as interfaces to information on online platforms. These agents filter, prioritize, and synthesize information retrieved from the platforms' back-end databases or via web search. In these scenarios, LLM agents govern the information users receive, by drawing users' attention to particular instances of retrieved information at the expense of others. While much prior work has focused on biases in the information LLMs themselves generate, less attention has been paid to the factors that influence what information LLMs select and present to users. We hypothesize that when information is attributed to specific sources (e.g., particular publishers, journals, or platforms), current LLMs exhibit systematic latent source preferences- that is, they prioritize information from some sources over others. Through controlled experiments on twelve LLMs from six model providers, spanning both synthetic and real-world tasks, we find that several models consistently exhibit strong and predictable source preferences. These preferences are sensitive to contextual framing, can outweigh the influence of content itself, and persist despite explicit prompting to avoid them. They also help explain phenomena such as the observed left-leaning skew in news recommendations in prior work. Our findings advocate for deeper investigation into the origins of these preferences, as well as for mechanisms that provide users with transparency and control over the biases guiding LLM-powered agents.

Constitutional AI & AI Ethics Recommendation & Information Retrieval Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations

Related Papers