Search papers, labs, and topics across Lattice.
5
19
10
29
K-means gets a 17.9x speed boost on modern GPUs thanks to a clever redesign that avoids memory bottlenecks and atomic write contention.
Models are substantially better at pairwise self-verification than independent scoring, unlocking a more efficient and accurate approach to test-time scaling for complex reasoning.
Uncertainty-driven dynamic compute allocation lets web agents outperform naive test-time scaling by 9.1% while using 2.3x fewer tokens.
By explicitly encoding 3D geometry, GeoDrive achieves more realistic and controllable autonomous driving scene modeling, outperforming prior world models in action accuracy and spatial awareness.
Forget sparse KV caches – QuantSpec's hierarchical 4-bit quantization unlocks 2.5x speedups in long-context LLM inference with >90% acceptance rates.