Search papers, labs, and topics across Lattice.
University of Pittsburgh
1
0
3
Seemingly strong NLI checkers can actually *hurt* medical RAG training by collapsing the RL gradient or triggering reward-hacking cascades like ultra-short answers and search avoidance.