Michael Cunningham

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Papers (1)

Feb 18, 2026

Michael Cunningham3w ago

Privacy-Aware Split Inference with Speculative Decoding for Large Language Models over Wide-Area Networks

Running LLMs privately on your laptop without sacrificing speed is now practical: split inference and lookahead decoding can deliver near-native throughput even over high-latency networks.

Michael Cunningham

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Michael Cunningham

Publication activitypapers/week, last 8 weeks

Research focus

Papers (1)