Yuxin Hu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Multimodal Models (1)Tool Use & Agents (1)

Frequent co-authors

Fangda Ye (1)Peng Zhu (1)Pengxiang Zhu (1)Yibo Li (1)

Papers (1)

Mar 30, 2026

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Current research agent benchmarks miss critical flaws, as MiroEval reveals that process quality is a reliable predictor of research outcome, and multimodal tasks expose weaknesses invisible to output-level metrics.

Fangda Ye, Yuxin Hu, Peng Zhu +23

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Yuxin Hu

Research focus

Frequent co-authors

Papers (1)