Shikhar Shukla

University of Kentucky

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Natural Language Processing (1)

Papers (1)

May 4, 2026

2w ago

SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

Pushing speculative decoding to new heights, SpecKV adaptively tunes speculation length based on draft model confidence, achieving a 56% speedup compared to fixed-length speculation, especially crucial for compressed models.

Shikhar Shukla

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Natural Language Processing

Search

Shikhar Shukla

Publication activitypapers/week, last 8 weeks

Research focus

Papers (1)