Search papers, labs, and topics across Lattice.
2
0
4
0
Steer LVLMs' attention with caption guidance and watch object hallucinations drop by 6%鈥攏o training required.
Adversarial training of large vision models doesn't have to break the bank: CAAT achieves comparable robustness to standard methods by tuning just 6% of the parameters.