Chaoning Zhang

School of Computer Science and Engineering, University of Electronic Science and Technology of China

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (2)Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)Inference & Quantization (1)

Frequent co-authors

Yutong Li (1)Jiehui Xie (1)Md. Tamim Iqbal (1)Dongshen Han (1)

Papers (3)

Apr 21, 2026

1w ago·also BUET, Kyung Hee University, PolyU

DASH-KV: Accelerating Long-Context LLM Inference via Asymmetric KV Cache Hashing

Attention's quadratic complexity is no longer a bottleneck: DASH-KV achieves linear O(N) inference without sacrificing accuracy by reformulating attention as an approximate nearest-neighbor search.

Yutong Li, Jiehui Xie, Md. Tamim Iqbal +5

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Apr 14, 2026

Xudong Wang +82w ago·also UESTC

Transforming External Knowledge into Triplets for Enhanced Retrieval in RAG of LLMs

Ditch the haystack: Tri-RAG structures external knowledge into logical triplets, slashing irrelevant context and boosting RAG's reasoning power.

Xudong Wang, Chaoning Zhang, Qigan Sun +6

Natural Language Processing Recommendation & Information Retrieval

2w ago·also CAS, HIT

Relaxing Anchor-Frame Dominance for Mitigating Hallucinations in Video Large Language Models

Video-LLMs hallucinate because they fixate on a single "anchor frame," but a simple decoder-side attention fix can dramatically improve grounding without retraining.

Zijian Liu, Sihan Cao, Pengcheng Zheng +4

Computer Vision Multimodal Models Natural Language Processing

Search

Chaoning Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)