Licheng Liu

Papers on Lattice

Total citations

Topics

Research focus

Tool Use & Agents (2)Code Generation & Program Synthesis (1)Eval Frameworks & Benchmarks (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

CocoaBench Team (1)Shibo Hao (1)Zhiqi Liang (1)Yuheng Zha (1)

Papers (2)

Apr 13, 2026

CocoaBench Team +19Apr 13, 2026·also Microsoft Research, Hubei University, Hubei University), Institute of Foundation Models +3

CocoaBench: Evaluating Unified Digital Agents in the Wild

Today's best AI agents still fail more than half the time on real-world tasks combining vision, search, and coding, revealing critical gaps in reasoning and tool use.

CocoaBench Team, Shibo Hao, Zhiqi Liang +17

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Tool Use & Agents

Apr 7, 2026

AI2Apr 7, 2026·also Stanford HAI, Oxford

RAGEN-2: Reasoning Collapse in Agentic RL

LLM agents can appear to reason well (high entropy) while completely ignoring the input, and mutual information is a far better metric for catching this failure.

Chi Gui, Chi Gui, Xing Jin +9

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Licheng Liu

Research focus

Frequent co-authors

Papers (2)