Search papers, labs, and topics across Lattice.
KAIST
1
0
4
2
Diversity-aware scoring transforms MoE models into dense architectures, boosting downstream accuracy by over 6% while speeding up training.