Search papers, labs, and topics across Lattice.
2
0
5
3
Adversarially perturbing model weights during fine-tuning dramatically improves generalization in composed image retrieval, a counterintuitive approach that yields state-of-the-art results.
Multimodal models are surprisingly vulnerable to subtle text-based backdoor attacks using common words, making them far more practical and stealthy than previous visual trigger methods.