Search papers, labs, and topics across Lattice.
Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore
1
0
1
Adjusting gate sharpness based on routing confidence leads to significant performance gains in Mixture-of-Experts models without the burden of extra parameters.