Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Tool Use & Agents (1)Code Generation & Program Synthesis (1)Distributed Systems & Hardware (1)

Frequent co-authors

Aakshita Chandiramani (1)Aaron Blakeman (1)Abdullahi Olaoye (1)Abhibha Gupta (1)

Papers (2)

Apr 14, 2026

AI2Apr 14, 2026·also NVIDIA, BIT, Gusu Laboratory of Materials, NJU +5

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Nemotron 3 Super proves you can achieve comparable accuracy to existing 120B models, but with significantly higher inference throughput, by combining Mamba, Attention, and Mixture-of-Experts.

Aakshita Chandiramani, Aaron Blakeman, Abdullahi Olaoye +448

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Mar 19, 2026

Project LeadMar 19, 2026

SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits

Agentic AI systems are still far from maximizing hardware potential: SOL-ExecBench reveals a significant gap between current GPU kernel performance and analytically derived Speed-of-Light bounds across a wide range of AI models.

Edward Lin, Sahil Modi, S. Hari +37

Code Generation & Program Synthesis Distributed Systems & Hardware Eval Frameworks & Benchmarks

Search

Nestor Qin

Research focus

Frequent co-authors

Papers (2)