Search papers, labs, and topics across Lattice.
MiLM Plus, Xiaomi Inc., Xiaomi AI Lab
3
5
9
12
ControlFoley lets you generate audio from video with unprecedented control over text descriptions and reference audio, even when those inputs conflict.
Open LLMs can now rival proprietary systems in multilingual translation, as demonstrated by MiLMMT-46's competitive performance against Google Translate and Gemini 3 Pro.
Mobile GUI agents, despite benchmark successes, stumble badly when faced with the messy reality of third-party content, failing almost half the time.