Pengjun Xie

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Natural Language Processing (2)RLHF & Preference Learning (1)Recommendation & Information Retrieval (1)Tool Use & Agents (1)

Frequent co-authors

Xin Guan (1)Xiaomeng Hu (1)Shen Huang (1)Zhenyi Wang (1)

Papers (2)

May 28, 2026

May 28, 2026·also DAMO, Guangdong University of Technology

EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation

Forget static rubrics and expensive external models: EvoRubric co-evolves a single policy to generate both responses and the rubrics to evaluate them, outperforming traditional RLHF methods in open-ended generation tasks.

Xin Guan, Xiaomeng Hu, Shen Huang +6

Natural Language Processing RLHF & Preference Learning

Mar 29, 2026

Zhaopeng Feng +28Mar 29, 2026·also HKU, Tongyi Lab

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

LLM agents can achieve 3x faster web search and higher accuracy by dynamically routing between multiple context management strategies.

Zhaopeng Feng, Zhaopeng Feng, Liangcai Su +26

Natural Language Processing Recommendation & Information Retrieval Tool Use & Agents

Search

Pengjun Xie

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)