Pengyu Zou

Nanjing University

Papers on Lattice

Total citations

Topics

Research focus

Code Generation & Program Synthesis (2)Tool Use & Agents (2)Eval Frameworks & Benchmarks (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

He Ye (2)Zhaoyang Chu (1)Jiarui Hu (1)Chao Peng (1)

Papers (2)

May 21, 2026

May 21, 2026·also NJU, Tencent AI

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Today's agents are surprisingly bad at real-world terminal tasks, with even frontier models failing nearly 40% of the time on everyday workflows.

Zhaoyang Chu, Jiarui Hu, Pengyu Zou +6

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Apr 13, 2026

Yifan Yao +12Apr 13, 2026·also NJU, UCL

CodeTracer: Towards Traceable Agent States

Debugging complex code agents just got easier: CodeTracer reconstructs full state transition histories, pinpointing failure origins and enabling recovery of failed runs.

Yifan Yao, Letian Zhu, Rili Feng +10

Code Generation & Program Synthesis Interpretability & Mechanistic Interp Tool Use & Agents

Search

Pengyu Zou

Research focus

Frequent co-authors

Papers (2)