Yifan Hou

ETH Zurich

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Computer Vision (1)Interpretability & Mechanistic Interp (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Mrinmaya Sachan (2)Xingzhou Pang (1)Junling Wang (1)Yucheng Wang (1)

Papers (2)

May 28, 2026

ETH2w ago

Unveiling the Visual Counting Bottleneck in Vision-Language Models

VLMs don't lack visual understanding of quantity, they just can't connect what they see to symbolic number representations, revealing a fractured magnitude space.

Xingzhou Pang, Yifan Hou, Junling Wang +1

Computer Vision Interpretability & Mechanistic Interp Multimodal Models

Sep 28, 2025

M-A-PSep 28, 2025·also ETH

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

Multimodal LLMs often perform worse with more modalities because they struggle to jointly recognize and reason across modalities, a problem solvable with simple prompting strategies.

Yucheng Wang, Yifan Hou, Aydin Javadov +2

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Search

Yifan Hou

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)