Chenxing Wei

College of Computer Science and Software Engineering, Shenzhen University, China, Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), China

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (3)Tool Use & Agents (3)Natural Language Processing (3)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Ying He (2)F. Yu (2)Yao Shu (2)Bo Jiang (2)

Papers (4)

Mar 16, 2026

College of Computer Science and Software EngineeringMar 16, 2026·also Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

SAGE: Multi-Agent Self-Evolution for LLM Reasoning

Forget RLHF and massive datasets: SAGE co-evolves reasoning abilities in LLMs using only a small seed set and a clever quartet of self-improving agents.

Yulin Peng, Xinxin Zhu, Chenxing Wei +4

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Mar 9, 2026

Yunfei Xie +11Mar 9, 2026·also College of Computer Science and Software Engineering, Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), UT Austin

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

LLM win rates in multi-agent games can nearly double (from 25% to 50%) simply by optimizing the context provided during inference.

Yunfei Xie, Kevin A. Wang, Bobby Cheng +9

Eval Frameworks & Benchmarks Natural Language Processing Tool Use & Agents

Mar 2, 2026

College of Computer Science and Software EngineeringMar 2, 2026·also Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Ditch the likelihood approximations: LFPO directly optimizes denoising logits in diffusion LMs via contrastive updates, leading to faster inference and better code/reasoning performance.

Chenxing Wei, Jiazheng Kang, Jianqing Zhang +7

Code Generation & Program Synthesis Natural Language Processing RLHF & Preference Learning

College of Computer Science and Software EngineeringMar 2, 2026·also Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ)

Words&Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

LLMs learn faster and perform better when you optimize prompts and weights together, boosting performance by 30% and cutting interaction turns by 40%.

Chenxing Wei, Ying He, Zhongxiang Dai +3

Inference & Quantization Natural Language Processing RLHF & Preference Learning+2

Search

Chenxing Wei

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)