Search papers, labs, and topics across Lattice.
Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences
1
0
3
1
Squeezing intermediate tensors with FP8 quantization and adaptive transforms can nearly double the throughput of tensor-parallel LLM training without sacrificing accuracy.