Hao Lin

The Chinese University of Hong Kong

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Eval Frameworks & Benchmarks (1)Speech & Audio (1)Computer Vision (1)

Frequent co-authors

Zixuan Chen (1)Depeng Wang (1)Li Luo (1)Ke Xu (1)

Papers (3)

Apr 15, 2026

2w ago·also Ant Group, CUHK

AVID: A Benchmark for Omni-Modal Audio-Visual Inconsistency Understanding via Agent-Driven Construction

Omni-modal LLMs can ace captioning and QA, but AVID reveals they're surprisingly bad at spotting audio-visual inconsistencies in videos, a crucial skill for trustworthy AI.

Zixuan Chen, Depeng Wang, Hao Lin +6

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Feb 24, 2026

Feb 24, 2026·also CUHK

Not Just What's There: Enabling CLIP to Comprehend Negated Visual Descriptions Without Fine-tuning

CLIP can now understand "no dog" without any fine-tuning, thanks to a plug-and-play module that disentangles negated semantics and penalizes false positive matches.

Jun Xiao, Junhao Xiao, Zhiyu Wu +9

Computer Vision Multimodal Models Natural Language Processing

Feb 15, 2026

Zhongguancun AcademyFeb 15, 2026·also Tsinghua AI, CAS, CUHK, D VAE for spatiotemporal latent encoding +1

WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

Key contribution not extracted.

Shangqing Zhou, Yutong Jiang, Zefang Huang +4

Multimodal Models Robotics & Embodied AI World Models & Planning

Search

Hao Lin

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)