Search papers, labs, and topics across Lattice.
University of New South Wales
2
0
5
0
Turns out, telling LLMs *not* to use the answer when generating reverse chain-of-thought reasoning can actually make them *more* reliant on it—but a skeleton-guided approach breaks the cycle.
A 3B parameter model now rivals models 10x its size in reasoning, alignment, and agentic tasks, challenging the assumption that bigger is always better.