Search papers, labs, and topics across Lattice.
Provable Responsible AI and Data Analytics (PRADA) Lab, King Abdullah University of Science and Technology
1
0
3
Fine-tuning VLMs on threat-related images alone can significantly improve safety without any explicit safety labels, revealing a surprising visual pathway for alignment.