Search papers, labs, and topics across Lattice.
Hong Kong University of Science and Technology (Guangzhou)
1
0
2
Forget fixed uniform intervals: BPDQ unlocks high-fidelity 2-bit quantization for LLMs by adaptively shaping the quantization grid.