Search papers, labs, and topics across Lattice.
Ant Group
2
0
4
GraphPO slashes redundancy in reasoning model training, enabling more efficient exploration and improved performance on complex tasks.
SPRI achieves a remarkable 3.39 BLEU point improvement over the best existing MoE upcycling method, demonstrating that pretrained weight structures can be effectively leveraged for better expert diversity.