Search papers, labs, and topics across Lattice.
1
0
2
6
LLMs can reason better when they're not forced to answer in English, and a new RL method leverages this quirk to boost performance across reasoning tasks.