Mar 31, 2026arXiv:2603.29656

6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management

Jiao Chen, Jianhua Tang, Xiaotong Yang, Zuohong Lv

AI Summary

6GAgentGym is introduced as a closed-loop environment for training agents in autonomous 6G network management, featuring 42 typed tools and a learned Experiment Model calibrated on NS-3 simulation data. The framework uses 6G-Forge to bootstrap training trajectories via Self-Instruct generation and execution verification. Supervised fine-tuning and reinforcement learning on this data allows an 8B open-source model to match GPT-4's performance on the 6GAgentBench, particularly excelling in long-horizon tasks.

Key Contribution

An 8B open-source model, trained with a new closed-loop environment for 6G network management, achieves performance comparable to GPT-4, suggesting a viable path to autonomous network control.

Abstract

Autonomous 6G network management requires agents that can execute tools, observe the resulting state changes, and adapt their decisions accordingly. Existing benchmarks based on static questions or scripted episode replay, however, do not support such closed-loop interaction, limiting agents to passive evaluation without the ability to learn from environmental feedback. This paper presents 6GAgentGym to provide closed-loop capability. The framework provides an interactive environment with 42 typed tools whose effect classification distinguishes read-only observation from state-mutating configuration, backed by a learned Experiment Model calibrated on NS-3 simulation data. 6G-Forge bootstraps closed-loop training trajectories from NS-3 seeds via iterative Self-Instruct generation with execution verification against the Experiment Model. Supervised fine-tuning on the resulting corpus followed by reinforcement learning with online closed-loop interaction enables an 8B open-source model to achieve comparable overall success rate to GPT-5 on the accompanying 6GAgentBench, with stronger performance on long-horizon tasks. Together, these components provide a viable path toward autonomous, closed-loop network management.

Data Curation & Synthetic Data Eval Frameworks & Benchmarks Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management

Related Papers