Jun 14, 2026arXiv:2606.16038

Open-SWE-Traces: Advancing Dual-Mode Multilingual Distillation for Software Engineering Agents

Wasi Uddin Ahmad, Nikolai Ludwig, Somshubra Majumdar, Boris Ginsburg

AI Summary

This paper introduces Open-SWE-Traces, a comprehensive dataset comprising 207,489 agentic trajectories across nine programming languages, sourced from real-world pull requests. The dataset employs a hybrid-reasoning synthesis approach, combining Minimax-M2.5 for explicit reasoning and Qwen3.5-122B for high-quality non-reasoning traces, enabling the training of models that can perform long-horizon reasoning in software engineering tasks. Validation through fine-tuning the Qwen3-30B-A3B series demonstrates significant performance improvements, with resolve rates reaching up to 61.7% on SWE-bench Verified, highlighting the dataset's potential to enhance autonomous software engineering capabilities.

Key Contribution

Open-SWE-Traces reveals that a hybrid approach to trajectory data can significantly boost the reasoning capabilities of software engineering agents.

Abstract

The path toward autonomous software engineering is currently bottlenecked by a severe deficit of diverse, large-scale trajectory data. We address this by introducing \ourdataset, an expansive dataset of 207,489 agentic trajectories spanning nine programming languages (Python, Go, TS, JS, Rust, Java, PHP, C, C++). Sourced from 20,000 real-world PRs via OpenHands and SWE-agent harnesses, the dataset utilizes a hybrid-reasoning synthesis: Minimax-M2.5 generates trajectories with explicit "thinking" processes, while Qwen3.5-122B provides high-quality "non-thinking" traces. Filtered for permissive licenses (MIT, Apache, BSD) from SWE-rebench-V2, this data facilitates the training of models capable of long-horizon reasoning. We validate the dataset by fine-tuning the Qwen3-30B-A3B series (Thinking, Instruct, and Coder). The best performing model achieves resolve rates of 61.7% on SWE-bench Verified, 57.1% on SWE-bench Multilingual, and 36.8% on SWE-bench Pro. These results establish Open-SWE-Traces as a premier resource for distilling human-level software engineering capabilities into efficient, open-source agentic LLMs.

Code Generation & Program Synthesis Data Curation & Synthetic Data Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Open-SWE-Traces: Advancing Dual-Mode Multilingual Distillation for Software Engineering Agents

Related Papers