Weiwen Liu

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (4)Code Generation & Program Synthesis (2)Eval Frameworks & Benchmarks (2)Red-Teaming & Adversarial Robustness (2)

Frequent co-authors

Weinan Zhang (4)Jiachen Zhu (3)Zihan Guo (2)Rong Shan (2)

Papers (6)

Apr 12, 2026

2w ago·also NTU

AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search

LLMs can be forced to generalize beyond initial constraints by actively searching for adversarial test cases that expose logical divergences in generated code.

Qingyao Li, Weiwen Liu, Weinan Zhang +1

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Apr 9, 2026

2w ago·also CMU ML, SJTU

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

LLM agent progress increasingly hinges on better external cognitive infrastructure, not just stronger models.

Chenyu Zhou, Chenyue Zhou, Huacan Chai +20

Reasoning & Chain-of-Thought Tool Use & Agents

Mar 30, 2026

Xiaohang Nie +133w ago

Synergy: A Next-Generation General-Purpose Agent for Open Agentic Web

Synergy's architecture lets agents evolve through experience by proactively recalling rewarded trajectories, hinting at a new way to build agents that learn and adapt in open, collaborative environments.

Xiaohang Nie, Zihan Guo, Kezhuo Yang +11

Open-Source Models & Weights Robotics & Embodied AI Tool Use & Agents

Feb 24, 2026

Jiachen Zhu +8Feb 24, 2026·also SJTU

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

LMM-based GUI agents stick out like a sore thumb in human-centric mobile environments, but simple techniques can make them blend in without sacrificing utility.

Jiachen Zhu, Lingyu Yang, Rong Shan +6

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Feb 15, 2026

LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation

By steering token selection at the logit level, LogitsCoder achieves more efficient and higher-quality reasoning chains for code generation, outperforming existing methods.

Jizheng Chen, Xinyi Dai, Weiwen Liu +2

Code Generation & Program Synthesis Inference & Quantization Reasoning & Chain-of-Thought

Feb 12, 2026

Feb 12, 2026·also HIT, OPPO

Adaptive Milestone Reward for GUI Agents

GUI agents learn faster and generalize better with a new reward shaping technique that dynamically adapts to successful exploration trajectories, outperforming fixed reward schemes.

Xiaoyun Mo, Jiachen Zhu, Xingyu Lou +4

Robotics & Embodied AI Tool Use & Agents