Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Scaling Laws & Emergent Abilities (1)Tool Use & Agents (1)Distributed Systems & Hardware (1)

Frequent co-authors

Abhinav Khattar (2)Evan Wu (2)Michael Andersch (2)Mohammad Shoeybi (2)

Papers (2)

Jun 12, 2026

AI21w ago·also NVIDIA, HKUST, Institute of Medical Technology, Motional +3

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Achieving six times the inference throughput of current LLMs while maintaining accuracy, Nemotron 3 Ultra redefines performance benchmarks for agentic reasoning tasks.

NVIDIA, Aaron Blakeman, Aaron Thomas +570

Architecture Design (Transformers, SSMs, MoE)Scaling Laws & Emergent Abilities Tool Use & Agents

Mar 8, 2026

NVIDIAMar 8, 2026·also Tongji

Scalable Training of Mixture-of-Experts Models with Megatron Core

Training trillion-parameter Mixture-of-Experts models just got a whole lot faster: Megatron Core now achieves >1 PFLOP/GPU on NVIDIA's latest hardware.

Zijie Yan, Hongxiao Bai, Dennis Liu +32

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Sangkug Lym

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)

Search

Sangkug Lym

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)