Qineng Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Reasoning & Chain-of-Thought (1)Tool Use & Agents (1)Constitutional AI & AI Ethics (1)

Frequent co-authors

Chi Gui (1)Chi Gui (1)Xing Jin (1)Licheng Liu (1)

Papers (2)

Apr 7, 2026

AI2Apr 7, 2026·also BUPT

RAGEN-2: Reasoning Collapse in Agentic RL

LLM agents can appear to reason well (high entropy) while completely ignoring the input, and mutual information is a far better metric for catching this failure.

Chi Gui, Chi Gui, Xing Jin +13

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Feb 19, 2026

Feb 19, 2026·also AI2, UW, Northwestern

ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment

LLMs can be steered more effectively by viewing activation manipulation through the lens of ordinary differential equations and control theory, leading to significant gains in alignment benchmarks.

Hongjue Zhao, Haosen Sun, Jiangtao Kong +4

Constitutional AI & AI Ethics RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Qineng Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)