Yue Zhang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Multimodal Models (1)Tool Use & Agents (1)

Frequent co-authors

Zhaochen Su (1)Jincheng Gao (1)Hangyu Guo (1)Zhenhua Liu (1)

Papers (1)

Feb 26, 2026

May2w ago·also UNC

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Even the best multimodal agents struggle with realistic visual scenarios, achieving only 27% accuracy on the new AgentVista benchmark that demands long-horizon tool use across web search, image search, and code.

Zhaochen Su, Jincheng Gao, Hangyu Guo +12

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Yue Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)