Lidong Bing

Papers on Lattice

Total citations

Topics

Research focus

Multimodal Models (3)Tool Use & Agents (2)Robotics & Embodied AI (1)Computer Vision (1)World Models & Planning (1)

Frequent co-authors

Sudong Wang (2)Xiaojuan Qi (2)Shijian Lu (2)L. Bing (2)

Papers (3)

May 19, 2026

May 19, 2026·also evolvinglmms-lab.github.io/ParaVT, HKUST

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

RL fine-tuning LMMs for tool use can collapse structural formats due to strong pretrained tool priors, but a surprisingly simple fix of targeted format rewards and frame-budget randomization can restore stability and boost performance.

Zuhao Yang, Kaichen Zhang, Sudong Wang +6

Multimodal Models Robotics & Embodied AI Tool Use & Agents

Apr 30, 2026

DAMOApr 30, 2026·also HKUST, NTU

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Today's visual generation models are often evaluated on the wrong things, leading to inflated performance claims that mask critical failures in spatial reasoning, temporal consistency, and causal understanding.

Keming Wu, Zuhao Yang, Kaichen Zhang +28

Computer Vision Multimodal Models World Models & Planning

Mar 30, 2026

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Current research agent benchmarks miss critical flaws, as MiroEval reveals that process quality is a reliable predictor of research outcome, and multimodal tasks expose weaknesses invisible to output-level metrics.

Fangda Ye, Yuxin Hu, Pengxiang Zhu +23

Eval Frameworks & Benchmarks Multimodal Models Tool Use & Agents

Search

Lidong Bing

Research focus

Frequent co-authors

Papers (3)