Xiaoye Qu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (4)Multimodal Models (4)Eval Frameworks & Benchmarks (2)Computer Vision (2)

Frequent co-authors

Yafu Li (2)Siyuan Huang (2)T. Zhu (2)Zefeng He (2)

Papers (6)

May 14, 2026

Haoran Zhang +13May 14, 2026·also PKU

$\pi$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Current personal assistant agents struggle to anticipate and act on unstated user needs in long, complex workflows, revealing a critical gap between task completion and genuine proactivity.

Haoran Zhang, Luxin Xu, Zhilin Wang +11

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

May 1, 2026

Siyuan Huang +8May 1, 2026·also WHU

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

LVLMs can maintain sharper visual focus during long-form generation by adding a lightweight, learnable memory module that bypasses attention dilution.

Siyuan Huang, Xiaoye Qu, Yafu Li +6

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Apr 21, 2026

Apr 21, 2026·also Telecom

VCE: A zero-cost hallucination mitigation method of LVLMs via visual contrastive editing

Object hallucination in LVLMs can be significantly reduced *after* training, without any extra data or compute.

Yanbin Huang, Yisen Li, Xiaoye Qu +3

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Mar 30, 2026

Zefeng He +4Mar 30, 2026

GEMS: Agent-Native Multimodal Generation with Memory and Skills

A lightweight 6B model, when harnessed within the GEMS agent framework, leapfrogs state-of-the-art models in multimodal generation, suggesting architectural innovations in agents can compensate for raw parameter count.

Zefeng He, Siyuan Huang, Xiaoye Qu +2

Code Generation & Program Synthesis Multimodal Models Tool Use & Agents

Mar 12, 2026

Guanyu Jiang +5Mar 12, 2026·also HKUST

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Multimodal agents can now continually improve their tool use and orchestration in open-ended settings without parameter updates, thanks to a novel dual-stream framework that learns from both past experiences and structured skills.

Guanyu Jiang, Zhaochen Su, Xiaoye Qu +3

Multimodal Models Tool Use & Agents Training Efficiency & Optimization

Feb 12, 2026

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

LLMs can be taught to "think longer" and explore more diverse reasoning paths in-context via a simple length-incentivized reward, leading to improved generalization.

Futing Wang, Jianhao Yan, Yun Luo +5

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Xiaoye Qu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)