Search papers, labs, and topics across Lattice.
This paper introduces FRANZ, an automated framework designed to evaluate how large language models (LLMs) frame their responses to subjective questions, emphasizing the importance of communication style over mere factual correctness. By analyzing responses across four dimensions鈥攃ultural positioning, generalizing language, anthropomorphic cues, and conversational maxims鈥攖he study reveals significant differences in response characteristics among various LLMs. The findings highlight a notable coupling between insider positioning and anthropomorphism, which varies by country, underscoring the nuanced ways LLMs engage with cultural contexts in their responses.
LLMs not only answer questions but also shape user perception through their framing, with insider positioning and anthropomorphism linked in unexpected ways across cultures.
Large language models (LLMs) are being increasingly used to answer subjective, information-seeking questions, where users are sensitive to how responses are communicated, not just whether the answers are correct. Existing LLM evaluations for subjective cultural queries largely focus on factual correctness, ignoring how the response is framed. To this end, we introduce FRANZ, an automated FRAmework for respoNse characteriZation to conduct communicative audit of LLM responses along four dimensions: cultural positioning, use of generalizing language, anthropomorphic cues, and adherence to conversational maxims. To enable this evaluation, we contribute SQUARE - a corpus of 376k subjective questions sourced from 57 subreddits, and mapped to 7 countries and 19 question categories. We demonstrate FRANZ's applicability by scoring responses from three open-weight LLMs. We observe that LLMs show statistically significant differences in the frequency with which they employ each response characteristic. Unlike single-dimensional audits, FRANZ reveals that insider positioning and anthropomorphism are positively coupled, with the degree of coupling varying by country, providing a diagnostic lens for identifying framing divergences.