Search papers, labs, and topics across Lattice.
2
0
6
10
A unified assessment framework reveals hidden insights about agent performance, transforming how we evaluate AI systems.
Achieve state-of-the-art reasoning performance with a 15B parameter model that produces 30-50% shorter reasoning traces, demonstrating that efficient reasoning doesn't require massive model size.