Sudarshan Srinivasan

Research focus

Distributed Systems & Hardware (2)Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)

Frequent co-authors

Deepak Gangadharan (1)Dip Goswami (1)Hanjiang Wu (1)Abhimanyu Rajeshkumar Bambhaniya (1)

Papers (2)

Jun 22, 2026

3d ago

LMS-AR: LMS Prediction-based Adaptive Regulator for Memory Bandwidth in Multicore Systems

Memory bandwidth regulation can be effectively managed by a non-dedicated master core, leading to substantial performance improvements in multi-core systems.

Sudarshan Srinivasan, Deepak Gangadharan, Dip Goswami

Distributed Systems & Hardware

May 27, 2026

May 27, 2026·also DeepMind, Google Research, AMD Research and Advanced Development, Intel Labs

How Far Can Disaggregation Go? A Design-Space Exploration of Attention-FFN Disaggregation for Efficient MoE LLM Serving

Splitting attention and feedforward networks onto separate GPUs can unlock 4x higher MoE LLM throughput, but only if you carefully tune the GPU partitioning strategy based on the workload.

Hanjiang Wu, Abhimanyu Rajeshkumar Bambhaniya, Sarbartha Banerjee +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Sudarshan Srinivasan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)