Bo An

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (1)Tool Use & Agents (1)Inference & Quantization (1)Robotics & Embodied AI (1)

Frequent co-authors

Xin Cheng (1)Shuo He (1)Lang Feng (1)HaiYang Xu (1)

Papers (3)

May 26, 2026

Xin Cheng +52w ago

Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning

Don't let valuable steps in failed trajectories go unnoticed: GraphGPO leverages state-transition graphs for fine-grained credit assignment in agentic RL, boosting performance and efficiency.

Xin Cheng, Shuo He, Lang Feng +3

RLHF & Preference Learning Tool Use & Agents

NUS2w ago·also CUHK, NTU, UNC

Adversarial Dual On-Policy Distillation from Expressive Flow-based Teacher

Flow-based imitation learning can be significantly improved by distilling both rewards and actions on-policy, enabling more robust and generalizable policies, especially with limited or noisy demonstrations.

Zhenglin Wan, Jingxuan Wu, Xingrui Yu +4

Inference & Quantization Robotics & Embodied AI Training Efficiency & Optimization

Apr 12, 2026

Apr 12, 2026·also NTU

AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search

LLMs can be forced to generalize beyond initial constraints by actively searching for adversarial test cases that expose logical divergences in generated code.

Qingyao Li, Weiwen Liu, Weinan Zhang +1

Code Generation & Program Synthesis Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness