Wanxi Deng

Papers on Lattice

Total citations

Topics

h-index

Research focus

Interpretability & Mechanistic Interp (1)RLHF & Preference Learning (1)Open-Source Models & Weights (1)Tool Use & Agents (1)

Frequent co-authors

Lin Qu (2)Yupei Yang (1)Lin Yang (1)Fan Feng (1)

Papers (2)

Jan 29, 2026

Yupei Yang +7Jan 29, 2026

Factored Causal Representation Learning for Robust Reward Modeling in RLHF

Stop reward hacking: disentangling causal and non-causal factors in reward models makes RLHF more robust.

Yupei Yang, Lin Yang, Wanxi Deng +5

Interpretability & Mechanistic Interp RLHF & Preference Learning

Dec 31, 2025

Dec 31, 2025·also CAS, ECNU, Fudan, GIST Guangdong +6

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

An open-source ecosystem for agentic learning, complete with a trained agent and novel policy optimization, promises to accelerate research by providing a standardized, scalable platform.

Weixun Wang, Xiaoxiao Xu, Wanhe An +85

Open-Source Models & Weights Tool Use & Agents

Search

Wanxi Deng

Research focus

Frequent co-authors

Papers (2)