Lattice
Lattice

Search

Search papers, labs, and topics across Lattice.

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models | Lattice