Search papers, labs, and topics across Lattice.
The University of Texas at Austin
1
0
2
LLMs are twice as likely as humans to repeat the same support tactic in a conversation, but a simple RL reward for tactic novelty can fix it.