Search papers, labs, and topics across Lattice.
Xidian University
1
0
3
Forget fixed-precision quantization: STQuant slashes optimizer memory by 84% in large model training by dynamically adapting bit-widths across layers and training steps.