Guanbin Li

Sun Yat-sen University, China

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)Computer Vision (3)Multimodal Models (3)Robotics & Embodied AI (2)

Frequent co-authors

Yang Liu (2)Liang Lin (2)Sibei Yang (2)Tianyue Jiang (1)

Papers (8)

Jul 21, 2026

5d ago·also Huawei, HUST, ZJU

PhoenixRepair: Rethinking Repair Strategy Exploration in Software Agents

PhoenixRepair redefines how software agents explore repair strategies, achieving a 76% resolution rate by leveraging multi-location sampling and iterative refinement.

Tianyue Jiang, Yanlin Wang, Xinabang He +7

Code Generation & Program Synthesis Tool Use & Agents

Jul 14, 2026

1w ago·also SYSU, USTC

ARDepth: Auto-regressive Monocular Depth Estimation with Progressive Visual Conditioning

ARDepth reveals that structured auto-regressive generation can significantly enhance monocular depth estimation by capturing local details without sacrificing global coherence.

Zijie Wang, Weiming Zhang, Xiao Tan +3

Computer Vision Multimodal Models

Jun 25, 2026

Tsinghua AIJun 25, 2026·also HIT, Peng Cheng Laboratory, SYSU

In-Context Model Predictive Generation: Open-Vocabulary Motion Synthesis from Language Models to Physics

ICMPG achieves a groundbreaking balance between semantic fidelity and physical realism in motion synthesis, outperforming traditional methods in both standard and zero-shot scenarios.

Xiaomeng Fu, Junfan Lin, Yang Liu +3

Natural Language Processing Robotics & Embodied AI

Jun 15, 2026

Jun 15, 2026·also Louisiana State University, Shanghai Innovation

RealityBridge: Bridging Editable 3D Gaussian Splatting Driving Simulations and Real-World Videos

RealityBridge closes the Sim-to-Real gap in 3D driving simulations, achieving superior visual fidelity and temporal stability that existing methods fail to deliver.

Zhenhua Wu, Yun Pang, Mingkun Chang +4

Computer Vision Robotics & Embodied AI

Jun 8, 2026

Tsinghua AIJun 8, 2026·also SYSU

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

Text world models can transform LLM-based agents from reactive responders into proactive planners, enhancing their performance in complex interactive tasks.

Yixia Li, Peng Lai, Zhiwen Ruan +10

Tool Use & Agents World Models & Planning

May 27, 2026

Zhendong He +4May 27, 2026·also SYSU

Self-Prophetic Decoding to Unlock Visual Search in LVLMs

LVLMs can now perform visual search far more effectively thanks to a clever decoding strategy that harmonizes pre- and post-training capabilities.

Zhendong He, Qiyuan Dai, Guanbin Li +2

Multimodal Models Reasoning & Chain-of-Thought Recommendation & Information Retrieval

May 25, 2026

May 25, 2026·also SYSU

SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution

Ditch the rigid grid: SP-MoMamba uses superpixels to let Mamba-based super-resolution models "see" images like humans do, boosting performance and efficiency.

Wenbin Zou, Yawen Cui, Lap-Pui Chau +2

Architecture Design (Transformers, SSMs, MoE)Computer Vision Training Efficiency & Optimization

Apr 13, 2026

Apr 13, 2026·also CAS, CUHK

Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games

Training VLMs on collaboratively generated Murder Mystery scripts dramatically improves their ability to reason about hidden facts and deception in complex, multi-agent scenarios.

Keyang Zhong, Junlin Xie, Hefeng Wu +2

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents