Search papers, labs, and topics across Lattice.
Monash University
2
0
4
Forget dataset-specific selection: a single, lightweight selector trained on one dataset can boost VLM instruction tuning performance across diverse datasets and model scales, even outperforming full data training.
Multi-level preference alignment in SignDPO significantly reduces semantic drift, outperforming traditional gloss-free models and challenging gloss-based benchmarks.