Woosang Lim

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (2)Inference & Quantization (2)Training Efficiency & Optimization (2)

Frequent co-authors

Minkyu Kim (1)Vincent-Daniel Yun (1)Youngrae Kim (1)Y. Heo (1)

Papers (2)

Apr 27, 2026

Minkyu Kim +7Apr 27, 2026·also SNU, University, USC

Rethinking Layer Redundancy in Large Language Models: Calibration Objectives and Search for Depth Pruning

The secret to effectively pruning LLMs might not be *how* you search for redundant layers, but *what* you're optimizing for.

Minkyu Kim, Vincent-Daniel Yun, Youngrae Kim +5

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Mar 19, 2026

Minsoo Cheong +6Mar 19, 2026

EntropyCache: Decoded Token Entropy Guided KV Caching for Diffusion Language Models

Diffusion language models can achieve up to 26x inference speedups with almost no accuracy loss, thanks to a clever entropy-based KV caching strategy that avoids costly full forward passes.

Minsoo Cheong, Minsoo Cheong, Donghyun Son +4

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization