Yi R. Fung

OPD's unique update geometry reveals that it operates in a low-dimensional channel, fundamentally altering our understanding of model training dynamics.

Zhennan Shen, Yanshu Li, Qingyu Yin +6

Inference & Quantization Natural Language Processing Training Efficiency & Optimization

Jun 4, 2026

1w ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

LLMs struggle with adaptive planning, achieving only 67.75% accuracy when faced with progressively revealed world and user constraints.

Jiayu Liu, Cheng Qian, Zhenhailong Wang +7

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

May 25, 2026

3w ago·also Ant Group, HKUST

Reinforcement Learning from Denoising Feedback

Forget RLHF, denoising feedback offers a surprisingly effective and scalable alternative for training diffusion language models.

Qi He, Huan Chen, Ya Guo +3

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

Apr 5, 2026

Xinyu Geng +5Apr 5, 2026·also UPenn

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Current multimodal agents still struggle to combine ambiguous visual cues with open-web verification, highlighting a critical gap in their ability to perform complex geolocation tasks.

Xinyu Geng, Yanjing Xiao, Yuyang Zhang +3

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Mar 12, 2026

Guanyu Jiang +5Mar 12, 2026·also HKUST

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Multimodal agents can now continually improve their tool use and orchestration in open-ended settings without parameter updates, thanks to a novel dual-stream framework that learns from both past experiences and structured skills.

Guanyu Jiang, Zhaochen Su, Xiaoye Qu +3

Multimodal Models Tool Use & Agents Training Efficiency & Optimization

Feb 26, 2026

Feb 26, 2026·also HKUST, Qian Xuesen Laboratory of Space Technology, Soochow, UESTC +1

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Even the best multimodal agents struggle with realistic visual scenarios, achieving only 27% accuracy on the new AgentVista benchmark that demands long-horizon tool use across web search, image search, and code.

Zhaochen Su, Jincheng Gao, Hangyu Guo +12

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Yi R. Fung

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (7)