Search papers, labs, and topics across Lattice.
2
0
4
Achieving six times the inference throughput of current LLMs while maintaining accuracy, Nemotron 3 Ultra redefines performance benchmarks for agentic reasoning tasks.
Riemannian optimization for low-rank architectures reveals intriguing design choices, yet struggles to surpass traditional AdamW performance.