Search papers, labs, and topics across Lattice.
1
0
2
By selectively amplifying updates to highly-utilized experts, Excitation rescues deep Mixture-of-Experts models from "structural confusion," enabling stable training where standard optimizers fail.