Shouwei Gao

Oregon State University

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Junqi Yin (1)Feiyi Wang (1)Wenqian Dong (1)

Papers (1)

Feb 26, 2026

2w ago·also ORNL

FLYING SERVING: On-the-Fly Parallelism Switching for Large Language Model Serving

Achieve up to 4.79x higher throughput in LLM serving by dynamically switching between data and tensor parallelism on the fly, without restarting workers.

Shouwei Gao, Junqi Yin, Feiyi Wang +1

Distributed Systems & Hardware Inference & Quantization

Search

Shouwei Gao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)