Naibin Gu

EasyVideoR1 achieves a 1.47 times throughput improvement in video understanding tasks by eliminating redundant video decoding and leveraging a comprehensive task-aware reward system.

Chuanyu Qin, Chenxu Yang, Qingyi Si +4

Computer Vision Multimodal Models RLHF & Preference Learning

Apr 14, 2026

1w ago·also Baidu, CAS

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Forget brute-force hinting: KnowRL distills knowledge into atomic units, then uses subset selection to find the *least* amount of guidance needed to supercharge LLM reasoning.

Linhao Yu, Tianmeng Yang, Siyu Ding +9

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Apr 3, 2026

Chenxu Yang +93w ago

Self-Distilled RLVR

Self-distillation in LLMs can leak information and destabilize training, but combining it with verifiable rewards yields a sweet spot for improved convergence and stability.

Chenxu Yang, Chuanyu Qin, Qingyi Si +7

Inference & Quantization RLHF & Preference Learning Training Efficiency & Optimization

Mar 16, 2026

Mar 16, 2026·also JD.com

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

LLMs can fail to generalize knowledge edits to instruction-following scenarios due to a "Covariance Trap," but RoSE unlocks robust interactive parametric memory by aligning representations and smoothing the optimization landscape.

Xiyu Liu, Zhengxiao Liu, Naibin Gu

Eval Frameworks & Benchmarks Natural Language Processing

Mar 5, 2026

Mar 5, 2026·also Baidu

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

Forget scaling depth and width—MOUE unlocks a new "virtual width" dimension for Mixture-of-Experts by cleverly reusing a single expert pool across layers.

Naibin Gu, Junyuan Shang, Zhenyu Zhang +7

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

Naibin Gu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (6)