Yanghai Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Eval Frameworks & Benchmarks (2)Speech & Audio (1)Tool Use & Agents (1)

Frequent co-authors

Zijie Zhang (3)Jiafu Tang (2)Zhe Cao (2)Yuanxing Zhang (1)

Papers (3)

Jul 14, 2026

Yanghai Wang +91w ago

AVSCap: Orchestrating Audio-Visual Synergy for Omni-modal Video Captioning

AVSCap-7B achieves superior audio-visual synergy, outperforming existing models by effectively linking non-speech sounds to visual actions.

Yanghai Wang, Jiafu Tang, Yuanxing Zhang +7

Multimodal Models Speech & Audio

Jun 2, 2026

OmniHalluc-L: Counterfactual Benchmarking and Modality-Perturbation Reliability Calibration for Long-Form Omni Hallucination

Open-weight Omni models struggle with binding accuracy, achieving only 41.55% on a new counterfactual benchmark, highlighting a critical gap in long-video comprehension.

Zixuan Dong, Jiafu Tang, Zhide Lei +9

Eval Frameworks & Benchmarks Multimodal Models

Apr 16, 2026

Apr 16, 2026·also JIUTIAN Research, Kling Team

DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Current research agents still struggle with retrieval robustness and hallucination control, even when evaluated in a static, verifiable research environment.

Qianqian Xie, Qing Xiong, He Zhu +16

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Yanghai Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)