Apr 20, 2026arXiv:2604.17827

Learning to Seek Help: Dynamic Collaboration Between Small and Large Language Models

Hang Zeng, Xiangyu Liu, Yong Hu, Chaoyue Niu, Jiarui Zhang, Shaojie Tang, Fan Wu, Guihai Chen

AI Summary

This paper introduces a dynamic collaboration framework where a small language model (SLM) learns to strategically request assistance from a large language model (LLM) during multi-step reasoning tasks. The LLM provides adaptive feedback, enabling a more nuanced interaction than static pipelines. Experiments demonstrate that stronger SLMs become more independent, while stronger LLMs facilitate fewer but more informative interactions, leading to performance gains and robust transferability to unseen LLMs.

Key Contribution

Forget static pipelines: SLMs can learn to dynamically seek help from LLMs, leading to better performance and transferability.

Abstract

Large language models (LLMs) offer strong capabilities but raise cost and privacy concerns, whereas small language models (SLMs) facilitate efficient and private local inference yet suffer from limited capacity. To synergize the complementary strengths, we introduce a dynamic collaboration framework, where an SLM learns to proactively decide how to request an LLM during multi-step reasoning, while the LLM provides adaptive feedback instead of acting as a passive tool. We further systematically investigate how collaboration strategies are shaped by SLM and LLM capabilities as well as efficiency and privacy constraints. Evaluation results reveal a distinct scaling effect: stronger SLMs become more self-reliant, while stronger LLMs enable fewer and more informative interactions. In addition, the learned dynamic collaboration strategies significantly outperform static pipelines and standalone inference, and transfer robustly to unseen LLMs.

Natural Language Processing Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Learning to Seek Help: Dynamic Collaboration Between Small and Large Language Models

Related Papers