Lattice AI Research

Research focus

Tool Use & Agents (3)Architecture Design (Transformers, SSMs, MoE) (2)Scaling Laws & Emergent Abilities (1)Inference & Quantization (1)

Frequent co-authors

Mohammad Shoeybi (3)Aaron Blakeman (2)Abhibha Gupta (2)Abhinav Khattar (2)

Papers (3)

Jun 12, 2026

AI2Jun 12, 2026·also NVIDIA, Gusu Laboratory of Materials, HKUST, NJU +5

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Achieving six times the inference throughput of current LLMs while maintaining accuracy, Nemotron 3 Ultra redefines performance benchmarks for agentic reasoning tasks.

NVIDIA, Aaron Blakeman, Aaron Thomas +538

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Tool Use & Agents

Apr 14, 2026

AI2Apr 14, 2026·also NVIDIA, BIT, Gusu Laboratory of Materials, NJU +4

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +448

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Feb 24, 2026

NVIDIAFeb 24, 2026

On Data Engineering for Scaling LLM Terminal Capabilities

Forget hand-crafted datasets: a new synthetic data pipeline lets smaller LLMs beat giants at terminal tasks.

Renjie Pi, Grace Lam, Mohammad Shoeybi +5

Data Curation & Synthetic Data Tool Use & Agents Training Efficiency & Optimization