Search papers, labs, and topics across Lattice.
Corresponding Author
1
0
3
dMoE slashes the memory footprint of Mixture-of-Experts Diffusion LLMs by up to 80% without sacrificing performance, finally making them practical.