Search papers, labs, and topics across Lattice.
1
0
3
Mimicking the human visual system's "glance and gaze" strategy lets video transformers capture long-range dependencies and achieve state-of-the-art action recognition.