Search papers, labs, and topics across Lattice.
evolvinglmms-lab.github.io/ParaVT
1
0
3
RL fine-tuning LMMs for tool use can collapse structural formats due to strong pretrained tool priors, but a surprisingly simple fix of targeted format rewards and frame-budget randomization can restore stability and boost performance.