Yiwen Shao

Tencent Hunyuan

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Speech & Audio (2)Robotics & Embodied AI (1)

Frequent co-authors

Zhiyuan Zhu (1)Yixuan Chen (1)Wenxiang Guo (1)Changhao Pan (1)

Papers (2)

Jun 9, 2026

Jun 9, 2026·also Tencent AI

Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding

Spatial-Omni achieves superior spatial audio understanding by seamlessly integrating FOA encoding into existing LLMs, outperforming traditional models without compromising general audio processing.

Zhiyuan Zhu, Yixuan Chen, Yiwen Shao +10

Multimodal Models Speech & Audio

Feb 20, 2026

Zhan Liu +12Feb 20, 2026·also CUHK, National Technology Innovation Center, Tencent AI

JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments

By explicitly modeling 3D space with learned spatial audio representations, JAEGER enables AV-LLMs to perform joint spatial grounding and reasoning far beyond the capabilities of 2D-centric models.

Zhan Liu, Changli Tang, Changli Tang +10

Multimodal Models Robotics & Embodied AI Speech & Audio

Search

Yiwen Shao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)