Jing Peng

X-LANCE Lab, School of Computer Science, Shanghai Jiao Tong University, China

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (4)Multimodal Models (2)Architecture Design (Transformers, SSMs, MoE) (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Chenghao Wang (2)Kai Yu (2)Zhisheng Zhang (1)Guoyang Zeng (1)

Papers (4)

May 27, 2026

Tsinghua AI2w ago·also SJTU

LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation

Compressing audio semantics into just 128 dimensions doesn't just reduce DiT modeling burden; it actually *improves* audio generation quality across diverse domains.

Zhisheng Zhang, Jing Peng, Guoyang Zeng

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

ETH2w ago·also Hunyuan Team, NJU, Northwestern, NTU +4

Audio-Mind: An Auditable Agentic Framework for Audio Understanding

Over-reliance on agentic decomposition can actually *hurt* audio understanding when a strong audio frontend already provides sufficient information, highlighting the importance of conditional evidence acquisition.

Yucheng Wang, Jing Peng, Hanqi Li +6

Interpretability & Mechanistic Interp Speech & Audio Tool Use & Agents

Apr 27, 2026

Apr 27, 2026·also SJTU

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

ASR systems can now be more trustworthy: this work shows how to train them to abstain from transcribing uncertain segments, leading to more reliable outputs.

Wen-Chin Huang, Yuhang Qiu, Bohan Li +4

Eval Frameworks & Benchmarks Natural Language Processing Speech & Audio

Apr 9, 2026

Apr 9, 2026·also AI Lab, XJTU

TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs

Forget expensive audio-text data collection: TASU2 lets you dial in the perfect amount of noise for training your speech LLM, all from text.

Jing Peng, Jing Peng, Chenghao Wang +8

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Search

Jing Peng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)