Search papers, labs, and topics across Lattice.
University of British Columbia, Vancouver, BC, Canada, Vector Institute, Toronto, ON, Canada
2
0
6
Current vision-language models falter in ultra-resolution reasoning, with errors primarily stemming from evidence grounding and local perception.
Reward hacking isn't just about incentives, it's about wild directional swings in your model's parameter space – and constraining those swings can keep your LM on the straight and narrow.