Search papers, labs, and topics across Lattice.
2
0
3
0
Steer LVLMs' attention with caption guidance and watch object hallucinations drop by 6%鈥攏o training required.
LVLMs can be boosted by 18.7% simply by focusing RLHF training on the few tokens that actually depend on visual input.