Search papers, labs, and topics across Lattice.
Dalian University of Technology
1
0
2
Achieve near-lossless 4-bit quantization for LLMs in under a minute, without full fine-tuning, by correcting for non-uniform activation distributions.