Search papers, labs, and topics across Lattice.
Chinese University of Hong Kong, Hong Kong, China
1
0
2
3
Naive quantization can paradoxically *slow down* LLM inference, but Quantix flips the script with 11x speedups via hardware-aware data layout and kernel fusion.