Search papers, labs, and topics across Lattice.
Z. Lin, T. Wang and S. Zhang are with the College of Electronics and Information Engineering, Shenzhen University, Shenzhen 518052, China (e-mail: linaacc9595@gmail.com; ttwang@szu.edu.cn; zsl@szu.edu.cn)L. Shi is with the School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China (e-mail: longshi@njust.edu.cn)Shui Yu is with the School of Computer Science, University of Technology Sydney, Sydney, NSW 2007, Australia (e-mail: shui.yu@uts.edu.au)Part of this work was previously published in the Proceedings of the ACM Web Conference 2025 (WWW’25). Corresponding Author: T. Wang
1
0
2
1
Forget fixed uniform intervals: BPDQ unlocks high-fidelity 2-bit quantization for LLMs by adaptively shaping the quantization grid.