Alish Kanani

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Distributed Systems & Hardware (2)Inference & Quantization (2)

Frequent co-authors

Sangwan Lee (1)Han Lyu (1)Jiahao Lin (1)Jaehyun Park (1)

Papers (2)

Mar 16, 2026

Alish Kanani +51d ago

DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages

Hybrid Mamba-Transformer LLMs get a 4x speed boost in time-to-first-token and 1.4x higher throughput thanks to a new disaggregated accelerator architecture tailored to prefill and decode phases.

Alish Kanani, Sangwan Lee, Han Lyu +3

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

CMU ML1d ago

LEXI: Lossless Exponent Coding for Efficient Inter-Chiplet Communication in Hybrid LLMs

By exploiting the low entropy of BF16 exponents with Huffman coding, LEXI slashes inter-chiplet communication latency in LLMs by up to 45% without sacrificing accuracy.

Miao Sun, Alish Kanani, Kaushik Shroff +1