Search papers, labs, and topics across Lattice.
1
0
3
7
Even the best multimodal LLMs are surprisingly bad at understanding and remembering the "self" in egocentric video, lagging human performance by 40-50% on personalized question answering.