Zhiyong Wang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Distributed Systems & Hardware (2)Recommendation & Information Retrieval (2)Inference & Quantization (2)Multimodal Models (1)

Frequent co-authors

Yuchen Huang (1)Baiteng Ma (1)Yiping Sun (1)Yang Shi (1)

Papers (4)

Jun 11, 2026

B) and hour-level builds for ultra-large-scale4d ago·also Shanghai Jiaotong University

The Clustering Strikes Back: Building Cost-Effective and High-Performance ANNS at Scale with Helmsman

HELMSMAN slashes hardware costs by over 90% while enabling billion-scale index rebuilds in mere hours, revolutionizing ANNS for large-scale applications.

Yuchen Huang, Baiteng Ma, Yiping Sun +7

Distributed Systems & Hardware Recommendation & Information Retrieval

Jun 4, 2026

Tsinghua AI1w ago·also Huawei, PKU, Xiaohongshu

RedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttention

Transforming the KV cache from a monolithic structure into a dynamic, head-aware system could revolutionize LLM serving efficiency and scalability.

Yang Liu, ZhaoKai Luo, HuaYi Jin +5

Distributed Systems & Hardware Inference & Quantization

May 26, 2026

INFIFORCE2w ago·also EIT, SUSTech

Can VLA Models Learn from Real-World Data Continually without Forgetting?

Real-world robots forget how to fold towels after learning to pick-and-place, but this work shows experience replay can help, if you do it right.

Jiarun Zhu, Yijun Hong, Xiaoquan Sun +5

Multimodal Models Robotics & Embodied AI Training Efficiency & Optimization

Feb 12, 2026

LASER: An Efficient Target-Aware Segmented Attention Framework for End-to-End Long Sequence Modeling

Xiaohongshu's LASER framework slashes latency and boosts revenue by 2% in real-world recommendation systems via a novel segmented attention mechanism and a hybrid DRAM-SSD indexing strategy.

Tianhe Lin, Baoyuan Ou, Yingjie Qin +4

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

Search

Zhiyong Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)