Search papers, labs, and topics across Lattice.
1
0
2
Forget training experts sequentially – Co-Evolving Policy Distillation (CoPD) unlocks all-in-one integration of diverse reasoning capabilities by training experts in parallel with mutual teaching, outperforming even domain-specific experts.