Shuo He

Nanyang Technological University, Singapore

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Tool Use & Agents (2)Constitutional AI & AI Ethics (1)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Jianan Chen (1)Zhifang Zhang (1)Linan Yue (1)Lei Feng (1)

Papers (3)

Mar 18, 2026

Mar 18, 2026·also NTU, UQ

Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation

Chain-of-thought prompting makes large language models smarter, but it also makes them less safe, a problem this paper tackles by forcing models to think about safety *before* reasoning.

Jianan Chen, Zhifang Zhang, Shuo He +2

Constitutional AI & AI Ethics Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness

Feb 26, 2026

Feb 26, 2026·also NSFC, SEU

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Context inconsistency in stepwise group-based RL can severely bias advantage estimation, but a hierarchical grouping strategy can fix it without extra compute.

Shuo He, Shuo He, Lang Feng +4

RLHF & Preference Learning Tool Use & Agents World Models & Planning

Feb 19, 2026

Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

Overcome simplicity bias in RL agents with PA-MoE, a mixture-of-experts architecture that learns task phases directly from the RL objective, leading to better expert specialization.

Shengtian Yang, Shuo He

Architecture Design (Transformers, SSMs, MoE)RLHF & Preference Learning Tool Use & Agents

Search

Shuo He

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)