Search papers, labs, and topics across Lattice.
2
0
5
0
Evolutionary search beats hand-tuned heuristics to find optimal, stage-wise pruning schedules for diffusion models, achieving better speed/quality tradeoffs.
Attention sinks, considered essential in autoregressive language models, turn out to be surprisingly prunable in diffusion language models, leading to better efficiency.