Jie Wu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Inference & Quantization (3)Natural Language Processing (2)Speech & Audio (2)Scaling Laws & Emergent Abilities (1)

Frequent co-authors

Ming Lei (2)Jie Gao (2)Jiaqi Song (1)Guang Qiu (1)

Papers (3)

Apr 20, 2026

Jiaqi Song +11Apr 20, 2026·also UC Santa Cruz

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

LLM-based ASR can be shrunk to 2.3B parameters and still beat larger models in real-world scenarios by carefully delineating encoder and LLM roles and using a multi-stage training approach.

Jiaqi Song, Guang Qiu, Guanghui Qiu +9

Inference & Quantization Natural Language Processing Scaling Laws & Emergent Abilities+1

Apr 9, 2026

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs

LLM-based ASR models can achieve state-of-the-art performance and reduce hallucinations by strategically allocating entropy reduction between the speech encoder and LLM during training.

Yuankun Xie, Jiaqi Song, Guang Qiu +4

Inference & Quantization Natural Language Processing Speech & Audio

Mar 30, 2026

Yufei Xu +14Mar 30, 2026

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Scanning every token to focus attention is now passé: HISA prunes irrelevant context blocks *before* token-level scoring, slashing compute without sacrificing selection fidelity.

Yufei Xu, Fanxu Meng, Fan Jiang +12

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Search

Jie Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)