Shengtian Yang

Southeast University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)World Models & Planning (1)Architecture Design (Transformers, SSMs, MoE) (1)RLHF & Preference Learning (1)

Frequent co-authors

Shuo He (2)Guangfeng Cai (1)Kaibing Yang (1)Jiaqi Lv (1)

Papers (2)

Jun 24, 2026

Beyond Next-Observation Prediction: Agent-Authored World Modeling for Sequential Decision Making

Decision-aware training signals outperform traditional next-observation predictions, leading to more effective learning in LLM agents.

Guangfeng Cai, Kaibing Yang, Shuo He +3

Tool Use & Agents World Models & Planning

Feb 19, 2026

Feb 19, 2026·also NTU

Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

Overcome simplicity bias in RL agents with PA-MoE, a mixture-of-experts architecture that learns task phases directly from the RL objective, leading to better expert specialization.

Shengtian Yang, Shuo He

Architecture Design (Transformers, SSMs, MoE)RLHF & Preference Learning Tool Use & Agents

Search

Shengtian Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)