Search papers, labs, and topics across Lattice.
1
0
2
3
LLMs can reason better when they're not forced to answer in English, and a new RL method leverages this quirk to boost performance across reasoning tasks.