Search papers, labs, and topics across Lattice.
1
0
3
Sparsity isn't just for efficiency in LLMs; it's a secret weapon against the "curse of depth," boosting layer utilization and downstream accuracy by taming variance.