Search papers, labs, and topics across Lattice.
Corresponding Author
1
0
3
LLMs can reason more accurately and concisely when RL is guided by token-level entropy, pinpointing and exploring "forks in the road" during the reasoning process.