Shijin Gong

University of Science and Tech- nology

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (3)RLHF & Preference Learning (2)Training Efficiency & Optimization (2)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Erhan Xu (2)Kai Ye (2)Giulia Livieri (2)Chengchun Shi (2)

Papers (3)

May 26, 2026

University of Science and Tech- nology3w ago·also London School of Economics and Political, University of Ox- ford

BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning

Single-rollout RL can rival multi-rollout performance for LLM reasoning, thanks to a new batchwise advantage estimation technique that dramatically improves value function accuracy.

Shijin Gong, Erhan Xu, Kai Ye +3

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

May 24, 2026

Pingfan Su +6May 24, 2026·also London School of Economics and Political, MiniMax, University of Science and Tech- nology

READER: Reasoning-Enhanced AI-Generated Text Detection

Reasoning beats scale: a 1.5B parameter model, READER, outperforms models 100-1000x larger in detecting AI-generated text by explicitly generating a rationale for its decision.

Pingfan Su, Kai Ye, Shijin Gong +4

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Apr 30, 2026

University of Science and Tech- nologyApr 30, 2026·also Tsinghua AI

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Kernel smoothing, a classic technique from nonparametric statistics, can make reinforcement learning with LLMs more sample efficient.

Shijin Gong, Kai Ye, Jin Zhu +1