Search papers, labs, and topics across Lattice.
1
0
2
5
Forget salient cues – now you can *steer* visual representations in ViTs with language, focusing on any object you want without hurting overall performance.