Seungwoo Seo

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Sunjung Lee (1)Sanghoon Cha (1)Hyeonsu Kim (1)Yuhwan Ro (1)

Papers (1)

Mar 10, 2026

Sunjung Lee +10Mar 10, 2026·also Samsung Electronics

PIM-SHERPA: Software Method for On-device LLM Inference by Resolving PIM Memory Attribute and Layout Inconsistencies

On-device LLM inference with PIM is now more practical: PIM-SHERPA resolves memory inconsistencies, slashing memory capacity needs by ~50% without sacrificing performance.

Sunjung Lee, Sanghoon Cha, Hyeonsu Kim +8

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Seungwoo Seo

Research focus

Frequent co-authors

Papers (1)