Wenshuo Peng

Papers on Lattice

Total citations

Topics

h-index

Research focus

Tool Use & Agents (2)Data Curation & Synthetic Data (1)Reasoning & Chain-of-Thought (1)Multimodal Models (1)RLHF & Preference Learning (1)

Frequent co-authors

Shuwen Xu (1)Jiaxiang Liu (1)Jun Zhao (1)Shitian Zhao (1)

Papers (3)

Mar 30, 2026

Shuwen Xu +3Mar 30, 2026

GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum

Forget hand-crafted KG traversal policies: GraphWalker uses automatically synthesized trajectories to train agents that achieve SOTA performance and generalize to unseen reasoning paths.

Shuwen Xu, Jiaxiang Liu, Wenshuo Peng +1

Data Curation & Synthetic Data Reasoning & Chain-of-Thought Tool Use & Agents

Feb 24, 2026

Tsinghua AIFeb 24, 2026·also Shanghai AI Lab

PyVision-RL: Forging Open Agentic Vision Models via RL

Reinforcement learning for multimodal agents doesn't have to collapse into uselessness: PyVision-RL shows how to stabilize training and encourage multi-turn tool use.

Shitian Zhao, Shitian Zhao, Shaoheng Lin +5

Multimodal Models RLHF & Preference Learning Tool Use & Agents

Jan 21, 2026

HarmoniDPO: Video-guided Audio Generation via Preference-Optimized Diffusion

HarmoniDPO is proposed, a novel framework that integrates preference-based optimization into diffusion-based V2A generation and outperforms state-of-the-art methods in audio-video synchronization and subjective audio quality, offering a robust solution for generating realistic, human-preferred audio from video.

Wenshuo Peng, Kaipeng Zhang

Search

Wenshuo Peng

Research focus

Frequent co-authors

Papers (3)