Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Multimodal Models (2)Speech & Audio (1)Computer Vision (1)

Frequent co-authors

Nvidia Amala Sanjay Deshmukh (1)K. Chumachenko (1)Tuomas Rintamaki (1)Matthieu Le (1)

Papers (2)

Apr 27, 2026

NVIDIAApr 27, 2026·also Amazon Science, Microsoft Research, UW, Music X Lab +1

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Multimodal models can now achieve state-of-the-art performance in real-world tasks like document understanding and audio-video comprehension with significantly reduced inference latency thanks to novel token-reduction techniques.

Nvidia Amala Sanjay Deshmukh, K. Chumachenko, Tuomas Rintamaki +209

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

Mar 12, 2026

NVIDIAMar 12, 2026·also BAIR, MIT CSAIL, Clarifai, K-frame

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

MLLMs can now handle 4K videos up to 100x faster thanks to AutoGaze, which selectively attends to only the most informative patches.

Baifeng Shi, Stephanie Fu, Long Lian +12

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

Hanrong Ye

Research focus

Frequent co-authors

Papers (2)