Miao Lu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Natural Language Processing (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Weiwei Sun (1)Weihua Du (1)Zhan Ling (1)Xuesong Yao (1)

Papers (1)

Oct 8, 2025

CMU MLOct 8, 2025

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management

Forget context window limits: this RL method uses LLM-generated summaries to train agents for long-horizon tasks, achieving higher success rates with less context.

Miao Lu, Weiwei Sun, Weihua Du +48

Natural Language Processing RLHF & Preference Learning Tool Use & Agents

Search

Miao Lu

Research focus

Frequent co-authors

Papers (1)