Search papers, labs, and topics across Lattice.
1
0
2
Forget salient cues – now you can *steer* visual representations in ViTs with language, focusing on any object you want without hurting overall performance.