Handong Zhao

Adobe Research

Papers on Lattice

Total citations

Topics

Research focus

Multimodal Models (3)Computer Vision (2)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)Recommendation & Information Retrieval (1)

Frequent co-authors

Jianglin Lu (2)Simon Jenni (2)Kushal Kafle (2)Jing Shi (2)

Papers (3)

Feb 27, 2026

Feb 27, 2026·also Adobe Research, HKU, PKU, UIUC

Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks

MLLMs that ace standard Referring Expression Comprehension benchmarks still stumble when faced with images designed to eliminate shortcuts, revealing a surprising lack of robust visual reasoning.

Qihua Dong, Kuo Yang, Lin Ju +5

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Feb 24, 2026

Seeing Through Words: Controlling Visual Retrieval Quality with Language Models

You can drastically improve text-to-image retrieval from short, ambiguous queries by using a language model to generate richer, quality-aware descriptions.

Jianglin Lu, Simon Jenni, Kushal Kafle +2

Computer Vision Multimodal Models Recommendation & Information Retrieval

Feb 19, 2026

Feb 19, 2026·also SJTU

RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward

Forget handcrafted metrics: RetouchIQ uses an RL-tuned MLLM to generate its own reward signals for instruction-based image editing, leading to more semantically consistent and perceptually pleasing results.

Qiucheng Wu, Jing Shi, Simon Jenni +3

Computer Vision Multimodal Models Tool Use & Agents

Search

Handong Zhao

Research focus

Frequent co-authors

Papers (3)