Search papers, labs, and topics across Lattice.
UC Berkeley 2 UT Austin 3 Stanford University 4 Princeton University 5 Together AI
Berkeley AI Research (BAIR)2
10
3
14
Verifier-free evolution can now match or exceed the performance of verifier-based methods, while slashing API costs by 3x and boosting throughput by 10x, thanks to a clever model orchestration strategy.
Forget sparse KV caches – QuantSpec's hierarchical 4-bit quantization unlocks 2.5x speedups in long-context LLM inference with >90% acceptance rates.