Search papers, labs, and topics across Lattice.
3
0
6
ZPPO reveals that embedding teacher responses in prompts rather than gradients can dramatically boost the performance of small student models on challenging tasks.
Achieving six times the inference throughput of current LLMs while maintaining accuracy, Nemotron 3 Ultra redefines performance benchmarks for agentic reasoning tasks.
Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.