Search papers, labs, and topics across Lattice.
1
0
3
RL fine-tuning can make your role-playing agent *worse* at embodying its character, unless you carefully balance task rewards with stylistic constraints.