Hangyu Guo

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (2)Multimodal Models (2)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Artyom Gadetsky (1)M. Kodryan (1)Siba Smarak Panigrahi (1)Maria Brbic (1)

Papers (3)

May 11, 2026

Unsupervised Process Reward Models

Forget expensive human annotations: this unsupervised method trains reward models that steer LLM reasoning just as well as, or even better than, their supervised counterparts.

Artyom Gadetsky, M. Kodryan, Siba Smarak Panigrahi +2

Reasoning & Chain-of-Thought RLHF & Preference Learning

Mar 11, 2026

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Multimodal LLMs still struggle to faithfully recreate webpages from videos, particularly in capturing fine-grained style and motion, despite advances in other areas.

Yuhong Dai, Yanlin Lai, Mitt Huang +6

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Feb 26, 2026

Feb 26, 2026·also HKUST, Qian Xuesen Laboratory of Space Technology, Soochow, UESTC +1

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Even the best multimodal agents struggle with realistic visual scenarios, achieving only 27% accuracy on the new AgentVista benchmark that demands long-horizon tool use across web search, image search, and code.

Zhaochen Su, Jincheng Gao, Hangyu Guo +12

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Hangyu Guo

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)