Tsinghua AIHITHuaweiApr 13, 2026arXiv:2604.11188

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

Zixiong Yu, Jun Rao, Guhan Chen, Songtao Tian, Bohan Li, Jiansheng Wei, Xiaojun Meng

AI Summary

MathAgent introduces a hierarchical framework for synthesizing mathematical reasoning data by adversarially evolving constraint graphs that represent problem specifications. A "Legislator" module optimizes these constraint graphs, while an "Executor" module instantiates them into natural language scenarios, decoupling logical structure design from linguistic realization. Fine-tuning models on just 1K samples synthesized with MathAgent outperforms models trained on existing datasets of comparable size (LIMO, s1K) across eight mathematical benchmarks, demonstrating improved out-of-distribution generalization.

Key Contribution

Forget human-annotated datasets: MathAgent synthesizes mathematical reasoning data so effectively that models trained on just 1K generated examples outperform those trained on existing datasets.

Abstract

Synthesizing high-quality mathematical reasoning data without human priors remains a significant challenge. Current approaches typically rely on seed data mutation or simple prompt engineering, often suffering from mode collapse and limited logical complexity. This paper proposes a hierarchical synthesis framework that formulates data synthesis as an unsupervised optimization problem over a constraint graph followed by semantic instantiation, rather than treating it as a direct text generation task. We introduce a Legislator-Executor paradigm: The Legislator adversarially evolves structured generation blueprints encoding the constraints of the problem, while the Executor instantiates these specifications into diverse natural language scenarios. This decoupling of skeleton design from linguistic realization enables a prioritized focus on constructing complex and diverse logical structures, thereby guiding high-quality data synthesis. Experiments conducted on a total of 10 models across the Qwen, Llama, Mistral, and Gemma series demonstrate that our method achieves notable results: models fine-tuned on 1K synthesized samples outperform widely-used datasets of comparable scale (LIMO, s1K) across eight mathematical benchmarks, exhibiting superior out-of-distribution generalization.

Data Curation & Synthetic Data Reasoning & Chain-of-Thought Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

Related Papers