Tejaswini Pedapati

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Reasoning & Chain-of-Thought (1)Natural Language Processing (1)

Frequent co-authors

David H. Yang (1)Yuxuan Zhu (1)Mohammad Mohammadi Amiri (1)Keerthiram Murugesan (1)

Papers (2)

Apr 13, 2026

2w ago·also IBM Research

ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval

LLMs can slash memory use by 4x during reasoning without sacrificing accuracy, simply by "zooming in" on relevant cached information instead of attending to everything.

David H. Yang, Yuxuan Zhu, Mohammad Mohammadi Amiri +4

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Reasoning & Chain-of-Thought

Mar 8, 2026

AI Steerability 360: A Toolkit for Steering Large Language Models

Steer LLMs like never before with AI Steerability 360, an open-source toolkit that unifies input, structural, state, and output steering methods under a common pipeline.

Erik Miehling, Karthikeyan Natesan Ramamurthy, Praveen Venkateswaran +10

Natural Language Processing Open-Source Models & Weights Tool Use & Agents

Search

Tejaswini Pedapati

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)