Search papers, labs, and topics across Lattice.
1
0
2
15
Standard RL critics for LLMs are basically useless, but these two simple methods can fix them.