Sunghwan Shim

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Sunghyeon Woo (1)Hoseung Kim (1)Minjung Jo (1)Dongsoo Lee (1)

Papers (1)

Feb 12, 2026

Sunghyeon Woo +4Feb 12, 2026

PrefillShare: A Shared Prefill Module for KV Reuse in Multi-LLM Disaggregated Serving

Squeezing 4.5x lower latency and 3.9x higher throughput from multi-LLM systems, PrefillShare lets you share the KV cache across models, slashing redundancy without sacrificing accuracy.

Sunghyeon Woo, Hoseung Kim, Sunghwan Shim +2

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Sunghwan Shim

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)