Search papers, labs, and topics across Lattice.
National University of Defense Technology
2
4
6
6
21
Denoising diffusion models can disentangle user preferences from noisy micro-video interactions, leading to better recommendations.
Speculative decoding for Llama just got 10% faster, thanks to production-scale optimizations that unlock new levels of inference efficiency.