Search papers, labs, and topics across Lattice.
This paper introduces a dynamic collaboration framework where a small language model (SLM) learns to strategically request assistance from a large language model (LLM) during multi-step reasoning tasks. The LLM provides adaptive feedback, enabling a more nuanced interaction than static pipelines. Experiments demonstrate that stronger SLMs become more independent, while stronger LLMs facilitate fewer but more informative interactions, leading to performance gains and robust transferability to unseen LLMs.
Forget static pipelines: SLMs can learn to dynamically seek help from LLMs, leading to better performance and transferability.
Large language models (LLMs) offer strong capabilities but raise cost and privacy concerns, whereas small language models (SLMs) facilitate efficient and private local inference yet suffer from limited capacity. To synergize the complementary strengths, we introduce a dynamic collaboration framework, where an SLM learns to proactively decide how to request an LLM during multi-step reasoning, while the LLM provides adaptive feedback instead of acting as a passive tool. We further systematically investigate how collaboration strategies are shaped by SLM and LLM capabilities as well as efficiency and privacy constraints. Evaluation results reveal a distinct scaling effect: stronger SLMs become more self-reliant, while stronger LLMs enable fewer and more informative interactions. In addition, the learned dynamic collaboration strategies significantly outperform static pipelines and standalone inference, and transfer robustly to unseen LLMs.