Search papers, labs, and topics across Lattice.
7
0
12
Achieving photorealistic 3D human avatars from a single image in under a second could revolutionize virtual reality and gaming applications.
Forget object-centric prompts: Function2Scene designs 3D indoor scenes directly from natural language descriptions of *how* the space will be used, not just *what* furniture to put there.
A 440MB multilingual translation model now rivals commercial APIs, opening the door for performant on-device translation.
LLMs can achieve state-of-the-art unsupervised multimodal entity linking by reasoning over diverse evidence types, including graph-based neighborhood information.
ControlFoley lets you generate audio from video with unprecedented control over text descriptions and reference audio, even when those inputs conflict.
Forget complex memory architectures: simple retrieval and generation, when carefully tuned for signal density, can outperform sophisticated methods in conversational agents.
LLMs can be jailbroken with 90% success by subtly "salami slicing" harmful intent across multiple turns, even against state-of-the-art models like GPT-4o and Gemini.