SEUSYSUMar 10, 2026arXiv:2603.09290

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

Xujie Yuan, Hanghui Guo, Zhangze Chen, Jia Zhu, Yong Rui

AI Summary

ToolRosetta automates the conversion of open-source code repositories into Model Context Protocol (MCP)-compatible tools usable by LLMs, addressing the scalability bottleneck of manual tool standardization. It autonomously plans toolchains, identifies relevant codebases, converts them into executable MCP services, and incorporates a security inspection layer. Experiments across scientific domains show ToolRosetta improves task completion performance by leveraging specialized open-source tools, outperforming commercial LLMs and existing agent systems.

Key Contribution

Automating the messy process of turning open-source code into LLM tools unlocks a new level of agent capabilities, outperforming even commercial LLMs.

Abstract

Reusing and invoking existing code remains costly and unreliable, as most practical tools are embedded in heterogeneous code repositories and lack standardized, executable interfaces. Although large language models (LLMs) and Model Context Protocol (MCP)-based tool invocation frameworks enable natural language task execution, current approaches rely heavily on manual tool curation and standardization, which fundamentally limits scalability. In this paper, we propose ToolRosetta, a unified framework that automatically translates open-source code repositories and APIs into MCP-compatible tools that can be reliably invoked by LLMs. Given a user task, ToolRosetta autonomously plans toolchains, identifies relevant codebases, and converts them into executable MCP services, enabling end-to-end task completion with minimal human intervention. In addition, ToolRosetta incorporates a security inspection layer to mitigate risks inherent in executing arbitrary code. Extensive experiments across diverse scientific domains demonstrate that ToolRosetta can automatically standardize a large number of open-source tools and reduce the human effort required for code reproduction and deployment. Notably, by seamlessly leveraging specialized open-source tools, ToolRosetta-powered agents consistently improve task completion performance compared to commercial LLMs and existing agent systems.

Code Generation & Program Synthesis Open-Source Models & Weights Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

Related Papers