Search papers, labs, and topics across Lattice.
3
0
5
0
Finally, a blind face restoration method that doesn't just hallucinate details, but lets you precisely control facial attributes via text prompts while maintaining high fidelity.
Interactive avatars can now exhibit more emotionally appropriate and contextually aware facial behaviors thanks to a novel architecture that disentangles audio-driven lip movements from user-driven non-lip facial expressions.
Forget end-to-end video understanding: RieMind shows that explicitly grounding LLMs in 3D scene graphs unlocks a 16% jump in spatial reasoning, suggesting structured representations are the key.