Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Junhao Hu | Lattice

Junhao Hu

Peking University

Papers on Lattice

2

Total citations

0

Topics

4

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (2)Architecture Design (Transformers, SSMs, MoE) (1)Recommendation & Information Retrieval (1)Distributed Systems & Hardware (1)

Frequent co-authors

Yang Liu (2)Yifei Liu (1)Juntong Wu (1)Xiaoxu Chen (1)

Papers (2)

Jul 1, 2026

3w ago·also Tsinghua AI, PKU, SJTU, Xiaohongshu

HYPIC: Accelerating Hybrid-Attention LLM Serving with Position-Independent Caching

Hypic slashes time-to-first-token by 2.45x and doubles throughput for hybrid-attention LLMs, all while preserving near-full accuracy.

Yifei Liu, Juntong Wu, Yang Liu +3

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Recommendation & Information Retrieval

Jun 4, 2026

Tsinghua AIJun 4, 2026·also Huawei, HUST, PKU, Xiaohongshu

RedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttention

Transforming the KV cache from a monolithic structure into a dynamic, head-aware system could revolutionize LLM serving efficiency and scalability.

Yang Liu, Zhaokai Luo, Huayi Jin +4

Distributed Systems & Hardware Inference & Quantization

Weihang Chen (1)

Zhaokai Luo (1)

Zhiyong Wang (1)