Chuanyu Qin

Papers on Lattice

Total citations

Topics

h-index

Research focus

Training Efficiency & Optimization (3)RLHF & Preference Learning (3)Inference & Quantization (2)Computer Vision (1)Multimodal Models (1)

Frequent co-authors

Naibin Gu (4)Chenxu Yang (4)Qingyi Si (4)Dingyu Yao (4)

Papers (4)

Apr 29, 2026

Co-Evolving Policy Distillation

By co-evolving experts through bidirectional policy distillation, CoPD achieves all-in-one integration of text, image, and video reasoning, outperforming domain-specific experts and suggesting a new training paradigm.

Naibin Gu, Chenxu Yang, Qingyi Si +7

Inference & Quantization Training Efficiency & Optimization

Apr 22, 2026

Apr 22, 2026·also BAAI

Near-Future Policy Optimization

Forget external teachers – the best way to boost your RL model's performance is to learn from its future self.

Chuanyu Qin, Chenxu Yang, Chen Yang +9

RLHF & Preference Learning Training Efficiency & Optimization

Apr 18, 2026

EasyVideoR1: Easier RL for Video Understanding

EasyVideoR1 achieves a 1.47 times throughput improvement in video understanding tasks by eliminating redundant video decoding and leveraging a comprehensive task-aware reward system.

Chuanyu Qin, Chenxu Yang, Qingyi Si +4

Computer Vision Multimodal Models RLHF & Preference Learning

Apr 3, 2026

Chenxu Yang +9Apr 3, 2026

Self-Distilled RLVR

Self-distillation in LLMs can leak information and destabilize training, but combining it with verifiable rewards yields a sweet spot for improved convergence and stability.

Chenxu Yang, Chuanyu Qin, Qingyi Si +7

Inference & Quantization RLHF & Preference Learning Training Efficiency & Optimization

Search

Chuanyu Qin

Research focus

Frequent co-authors

Papers (4)