Jianhong Tu

No single AI model dominates across all professional industries, revealing distinct occupational capability profiles and highlighting the need for specialized AI development.

Jianhong Tu, Yang Su

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Feb 16, 2026

DAMOFeb 16, 2026·also Tsinghua AI

WebWorld: A Large-Scale World Model for Web Agent Training

Training web agents in a simulator can now match real-world performance: Qwen3-14B, fine-tuned with WebWorld-synthesized trajectories, rivals GPT-4o on WebArena.

Zikai Xiao, Jianhong Tu, Chuhang Zou +1

Data Curation & Synthetic Data Tool Use & Agents World Models & Planning

Oct 30, 2025

Tsinghua AIOct 30, 2025·also DAMO, CAS, Shenzhen University of Advanced Technology

ToolRM: Towards Agentic Tool-Use Reward Modeling

ToolRMs drastically improve tool-use accuracy in LLMs, outperforming existing models by up to 17.94%, while also reducing output token usage by over 66% through efficient inference-time scaling.

Renhao Li, Jianhong Tu, Yang Su +6

RLHF & Preference Learning Tool Use & Agents

Search

Jianhong Tu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)