Zhanqiu Zhang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Xingwu Chen (1)Yiwen Guo (1)Difan Zou (1)

Papers (1)

Mar 5, 2026

Xingwu Chen +31w ago

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

LLMs get stuck in their ways: even explicit corrections can't break their rigid adherence to initial (incorrect) reasoning paths in multi-turn interactions, but a novel RL approach can fix it.

Xingwu Chen, Zhanqiu Zhang, Yiwen Guo +1

Natural Language Processing Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Zhanqiu Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)