Search papers, labs, and topics across Lattice.
2
0
5
LLM judges in multi-stakeholder settings suffer from "weighting noise" that gets *worse* as you add more stakeholders, but fixing weights upfront can stabilize the process.
Frontier LLMs still struggle with preference coverage and group fairness when planning travel for multiple users, revealing a critical gap in real-world agent capabilities.