Search papers, labs, and topics across Lattice.
University of Windsor
1
0
3
DPO-based post-training can significantly boost the translation quality of pre-trained NMT models like gemma3-1b, even without additional parallel data.