Dong Yuan

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (3)Inference & Quantization (2)Data Curation & Synthetic Data (1)Tool Use & Agents (1)

Frequent co-authors

Xuanyu Chen (1)Nan Yang (1)Yan Yan (1)Nan Yang (1)

Papers (3)

Jul 2, 2026

Xuanyu Chen +21w ago

Understanding the Robustness of Distributed Self-Supervised Learning Frameworks Against Non-IID Data

MIM's superior robustness against non-IID data could redefine the benchmarks for distributed self-supervised learning frameworks.

Xuanyu Chen, Nan Yang, Dong Yuan

Data Curation & Synthetic Data Distributed Systems & Hardware

Mar 11, 2026

AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU

AgentServe achieves up to 2.8x improvement in time-to-first-token and 2.7x in tokens-per-output-token for agentic workloads on a single GPU by strategically isolating prefills and decodes.

Yan Yan, Nan Yang, Dong Yuan

Distributed Systems & Hardware Inference & Quantization Tool Use & Agents

Oct 31, 2025

Joint Optimization of Resource Allocation and Request Batching for Multi-Tenant Inference Serving on GPU

Stop leaving performance on the table: jointly optimizing resource allocation and request batching with reinforcement learning can yield up to 24x speedups for multi-tenant GPU inference.

Yuning Zhang, Nan Yang, Chen Pan +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Dong Yuan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)