Abhinav Khattar

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (1)Tool Use & Agents (1)Distributed Systems & Hardware (1)

Frequent co-authors

Ashwath Aithal (2)Evan Wu (2)Michael Andersch (2)Mohammad Shoeybi (2)

Papers (2)

Apr 14, 2026

NVIDIA1w ago·also AI2, Communication University of China, LARK Lab, UT Austin +1

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +529

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Mar 8, 2026

NVIDIAMar 8, 2026·also Tongji

Scalable Training of Mixture-of-Experts Models with Megatron Core

Training trillion-parameter Mixture-of-Experts models just got a whole lot faster: Megatron Core now achieves >1 PFLOP/GPU on NVIDIA's latest hardware.

Zijie Yan, Hongxiao Bai, Xin Yao +35

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

Abhinav Khattar

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)