Lijun Zhang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Reasoning & Chain-of-Thought (1)Tool Use & Agents (1)Computer Vision (1)

Frequent co-authors

Wenhao Yang (1)Wenhao Yang (1)Yu Xia (1)Jinlong Huang (1)

Papers (2)

Apr 8, 2026

Wenhao Yang +133w ago·also UCSD

Walk the Talk: Bridging the Reasoning-Action Gap for Thinking with Images via Multimodal Agentic Policy Optimization

MLLMs can "think" with images, but their actions often don't match their reasoning, and this paper solves that with a new training method that forces them to explain what they see.

Wenhao Yang, Wenhao Yang, Yu Xia +11

Multimodal Models Reasoning & Chain-of-Thought Tool Use & Agents

Mar 31, 2026

Corresponding authorsMar 31, 2026

Hallucination-aware intermediate representation edit in large vision-language models

Correcting a vision-language model's "hallucinations" is now as simple as pinpointing and editing the right intermediate representation, sidestepping costly retraining or dual inference.

Wei Suo, Hanzu Zhang, Lijun Zhang +3

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Search

Lijun Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)