Ziyu Yao

Model internals, not just outputs, hold the key to predicting generalization: circuit-based metrics beat standard proxies by up to 34% in assessing ViT performance under distribution shift.

Yunxiang Peng, Mengmeng Ma, Ziyu Yao +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision Interpretability & Mechanistic Interp

Mar 15, 2026

Mohamed Aghzal +1Mar 15, 2026·also George Mason University

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

LLM web agents struggle more with perceptual grounding and low-level execution than high-level reasoning, challenging the assumption that better reasoning alone will solve web navigation.

Mohamed Aghzal, Ziyu Yao

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Search

Ziyu Yao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)