Search papers, labs, and topics across Lattice.
Wadhwani AI, IIIT Hyderabad
1
0
2
24
Forget salient cues – now you can *steer* visual representations in ViTs with language, focusing on any object you want without hurting overall performance.