Search papers, labs, and topics across Lattice.
3
0
8
3
LLMs stubbornly stick to task-appropriate reasoning even when explicitly instructed to use conflicting logic, but targeted interventions can nudge them towards better instruction following.
Output diversity in post-trained models collapses due to training data composition, not just post-training methods, challenging assumptions about inference-time fixes.
VLMs are more easily swayed by misleading text than you think, and their impressive reasoning chains can mask, rather than reveal, this over-reliance on language.