Search papers, labs, and topics across Lattice.
Shandong University
1
0
3
Cross-domain RL can actually *boost* reasoning in large models, if you use contrastive learning to transform harmful interference into beneficial knowledge transfer.