Feiyi Wang

Oak Ridge National Laboratory

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Shouwei Gao (1)Junqi Yin (1)Wenqian Dong (1)

Papers (1)

Feb 26, 2026

2w ago·also ORNL

FLYING SERVING: On-the-Fly Parallelism Switching for Large Language Model Serving

Achieve up to 4.79x higher throughput in LLM serving by dynamically switching between data and tensor parallelism on the fly, without restarting workers.

Shouwei Gao, Junqi Yin, Feiyi Wang +1

Distributed Systems & Hardware Inference & Quantization

Search

Feiyi Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)