Search papers, labs, and topics across Lattice.
School of Software Engineering, University of Science and Technology of China
2
0
4
OpenPangu-7B inference on NPUs gets a serious speed boost via a custom-tailored speculative decoding scheme.
Forget slow NTTs: Hermes' hybrid dataflow architecture delivers up to 13.6x faster throughput for homomorphic encryption than GPUs.