Arindam Khaled

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Tool Use & Agents (1)

Frequent co-authors

Arindam Khaled (1)

Papers (1)

Feb 23, 2026

Arindam Khaled +13w ago

Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

Slash LLM inference costs by 61% without sacrificing accuracy by dynamically escalating queries to larger models only when smaller models express uncertainty.

Arindam Khaled, Arindam Khaled

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Tool Use & Agents

Search

Arindam Khaled

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)