Wangyang Hong

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Multimodal Models (1)

Frequent co-authors

Libo Zhang (1)Zhaoning Zhang (1)Peng Qiao (1)

Papers (1)

Feb 17, 2026

Libo Zhang +33w ago

Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs

Sparrow unlocks 2.8x faster inference for Video LLMs on long videos by cleverly offloading visual computation to the target model using text-anchored attention and semantic-rich intermediate states.

Libo Zhang, Zhaoning Zhang, Wangyang Hong +1

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Multimodal Models

Search

Wangyang Hong

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)