Search papers, labs, and topics across Lattice.
2
0
5
Decomposing complex reasoning problems into verifiable subproblems unlocks significant performance gains in LLM reasoning, especially on hard problems previously stuck in gradient dead zones.
Fine-tuning LLMs on datasets filtered at the token level, rather than the sentence level, can boost performance by up to 13.7%.