Search papers, labs, and topics across Lattice.
2
0
4
10
LLMs can reason better if you force them to explore *different* ways of being right, not just be more random.
Forget fixed uniform intervals: BPDQ unlocks high-fidelity 2-bit quantization for LLMs by adaptively shaping the quantization grid.