Changsheng Zhao

Research focus

Tool Use & Agents (2)Eval Frameworks & Benchmarks (1)Inference & Quantization (1)Architecture Design (Transformers, SSMs, MoE) (1)World Models & Planning (1)

Frequent co-authors

Ernie Chang (4)Vikas Chandra (4)Mingchen Zhuge (3)Zechun Liu (3)

Papers (4)

Apr 14, 2026

Dylan R. Ashley +9Apr 14, 2026

RPRA: Predicting an LLM-Judge for Efficient but Performant Inference

Smaller LLMs can learn to predict when they'll fail, paving the way for efficient "ask for help" systems that rival the performance of much larger models.

Dylan R. Ashley, Gaël Le Lan, Changsheng Zhao +7

Eval Frameworks & Benchmarks Inference & Quantization Tool Use & Agents

Apr 7, 2026

Mingchen Zhuge +16Apr 7, 2026·also L1)

Neural Computers

Forget agents and world models – the future of computing could be learned directly from I/O traces, turning the model itself into the computer.

Mingchen Zhuge, Changsheng Zhao, Haozhe Liu +14

Architecture Design (Transformers, SSMs, MoE)Tool Use & Agents World Models & Planning

Mar 19, 2026

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Scale up offline policy training for diffusion LLMs without breaking the bank: dTRPO slashes trajectory computation costs while boosting performance up to 9.6% on STEM tasks.

Wenxuan Zhang, Lemeng Wu, Changsheng Zhao +11

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

Sep 29, 2025

Meta AISep 29, 2025

MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes

Forget scaling laws: this work shows you can get SOTA reasoning from sub-billion parameter models with *less* data, if you're smart about curation and resampling.

Changsheng Zhao, Ernie Chang, Zechun Liu +8

Open-Source Models & Weights Reasoning & Chain-of-Thought Scaling Laws & Emergent Abilities

Search

Changsheng Zhao

Research focus

Frequent co-authors

Papers (4)