Search papers, labs, and topics across Lattice.
Xiamen University
2
0
6
Autoregressive video generation gets a 6x speed boost without sacrificing quality, thanks to a motion-aware caching strategy that finally respects the fact that not all pixels are created equal.
Current Omni-modal LLMs can ace perception tasks but still fail at basic social interactions like knowing when and how to jump into a conversation.