Search papers, labs, and topics across Lattice.
1
0
3
Turns out, just showing a vision-language model the last few frames of a video stream beats fancy architectures designed for long-term memory.