Search papers, labs, and topics across Lattice.
University of Maryland College
2
0
5
Reasoning models may boost performance but often sacrifice critical alignment behaviors, revealing a hidden trade-off in AI safety.
Instead of imitating reflections, LLM agents can be trained to reason about action quality by rewarding correct judgments between alternative actions, leading to improved performance and generalization.