CASDeakinECNUFudanGIST GuangdongHarvardHKUSTHUSTRUCTongjiXidianDec 31, 2025arXiv:2512.24873

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Weixun Wang, Xiaoxiao Xu, Wanhe An, Fangwen Dai, Wei Gao, Yancheng He, Ju Huang, Qiang Ji, Hanqi Jin, Xiaoyang Li, Yang Li, Zhongwen Li, Shirong Lin, Jiashun Liu, Zenan Liu, Tao Luo, Dilxat Muhtar, Yuan Qu, Jiaqiang Shi, Qinghui Sun, Yingshui Tan, Haowen Tang, Runze Wang, Yi Wang, Zhaoguo Wang, Yanan Wu, Shaopan Xiong, Binchen Xu, Xander Xu, Yuchi Xu, Qipeng Zhang, Xixia Zhang, Haizhou Zhao, Jie Zhao, Shu-Man Zhao, Bai-xiu Zheng, Jianhui Zheng, Suhang Zheng, Yanni Zhu, Meng-Yao Cai, Ke-Fan Cao, Xitong Chen, Yue Dai, Lifan Du, Tao Feng, Tao He, Jin Hu, Yijie Hu, Ziyu Jiang, Cheng Li, Xiang Li, Jing Liang, Xin Lin, Chong-Yuan Liu, Liu Zhendong, Zhiqiang Lv, H. Mi, Ya-Qi Mo, Junjia Ni, Shixin Pei, Jing-Lei Shen, Xiaoshuai Song, Cecilia Wang, Chao-hui Wang, Kangyu Wang, Pei Wang, Tao Wang, Wei Wang, Kejun Xiao, Mingyu Xu, Tiange Xu, Nan Ya, Siran Yang, Jia-Yu Ye, Yaxing Zang, Duo Zhang, Junbo Zhang, Bo Zheng, Wanxi Deng, Ling Pan, Lin Qu, Wenbo Su, Jiamang Wang, HuanWen Wei, Minggang Wu, Chenghua Yu, Bing Zhao, Zhicheng Zheng

AI Summary

This paper introduces the Agentic Learning Ecosystem (ALE), an open-source infrastructure comprising ROLL (a post-training framework), ROCK (a sandbox environment manager), and iFlow CLI (an agent framework), designed to streamline agentic model development. They release ROME, an agent trained within ALE on over a million trajectories, utilizing data composition protocols for complex behavior synthesis and a novel Interaction-Perceptive Agentic Policy Optimization (IPA) algorithm for improved long-horizon training. Empirical evaluations on benchmarks like SWE-bench Verified and Terminal Bench Pro demonstrate ROME's strong performance, validating the effectiveness of the ALE ecosystem.

Key Contribution

An open-source ecosystem for agentic learning, complete with a trained agent and novel policy optimization, promises to accelerate research by providing a standardized, scalable platform.

Abstract

Agentic crafting requires LLMs to operate in real-world environments over multiple turns by taking actions, observing outcomes, and iteratively refining artifacts. Despite its importance, the open-source community lacks a principled, end-to-end ecosystem to streamline agent development. We introduce the Agentic Learning Ecosystem (ALE), a foundational infrastructure that optimizes the production pipeline for agentic model. ALE consists of three components: ROLL, a post-training framework for weight optimization; ROCK, a sandbox environment manager for trajectory generation; and iFlow CLI, an agent framework for efficient context engineering. We release ROME, an open-source agent grounded by ALE and trained on over one million trajectories. Our approach includes data composition protocols for synthesizing complex behaviors and a novel policy optimization algorithm, Interaction-Perceptive Agentic Policy Optimization (IPA), which assigns credit over semantic interaction chunks rather than individual tokens to improve long-horizon training stability. Empirically, we evaluate ROME within a structured setting and introduce Terminal Bench Pro, a benchmark with improved scale and contamination control. ROME demonstrates strong performance across benchmarks like SWE-bench Verified and Terminal Bench, proving the effectiveness of ALE.

Open-Source Models & Weights Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References57

Year2025

VenuearXiv.org

Related Papers

Finding related papers...

Search

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Related Papers