Fan Feng

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Robotics & Embodied AI (1)World Models & Planning (1)Interpretability & Mechanistic Interp (1)RLHF & Preference Learning (1)

Frequent co-authors

Yuejiang Liu (1)Lingjing Kong (1)Weifeng Lu (1)Jinzhou Tang (1)

Papers (2)

Apr 2, 2026

Yuejiang Liu +8Apr 2, 2026·also Stanford HAI

World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry

World models can now self-improve by identifying their own prediction errors, thanks to a clever decomposition of action-conditioned prediction into easier-to-verify components.

Yuejiang Liu, Fan Feng, Lingjing Kong +6

Robotics & Embodied AI World Models & Planning

Jan 29, 2026

Yupei Yang +7Jan 29, 2026

Factored Causal Representation Learning for Robust Reward Modeling in RLHF

Stop reward hacking: disentangling causal and non-causal factors in reward models makes RLHF more robust.

Yupei Yang, Lin Yang, Wanxi Deng +5

Interpretability & Mechanistic Interp RLHF & Preference Learning

Search

Fan Feng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)