Search papers, labs, and topics across Lattice.
University of Maryland
2
0
5
SPRI achieves a remarkable 3.39 BLEU point improvement over the best existing MoE upcycling method, demonstrating that pretrained weight structures can be effectively leveraged for better expert diversity.
Reinforcement learning can significantly enhance adaptive sampling in large language models, leading to better performance with fewer resources.