Search papers, labs, and topics across Lattice.
University of Science and Technology of China
1
0
2
Forget fixed uniform intervals: BPDQ unlocks high-fidelity 2-bit quantization for LLMs by adaptively shaping the quantization grid.