Search papers, labs, and topics across Lattice.
The paper introduces Supplement Generation Training (SGT), a method to train smaller LLMs to generate supplemental text that enhances the performance of larger, frozen LLMs on agentic tasks. SGT decouples task-specific optimization from the large foundation model, allowing for efficient adaptation to new tasks and domains. Experiments demonstrate that SGT improves performance by generating context-aware supplements, offering a more sustainable alternative to retraining or fine-tuning large models.
Forget fine-tuning behemoth LLMs for every new task – this paper shows how a tiny, nimble model generating smart supplements can unlock surprisingly strong agentic performance from frozen giants.
Training large foundation models for agentic tasks is increasingly impractical due to the high computational costs, long iteration cycles, and rapid obsolescence as new models are continuously released. Instead of post-training massive models for every new task or domain, we propose Supplement Generation Training (SGT), a more efficient and sustainable strategy. SGT trains a smaller LLM to generate useful supplemental text that, when appended to the original input, helps the larger LLM solve the task more effectively. These lightweight models can dynamically adapt supplements to task requirements, improving performance without modifying the underlying large models. This approach decouples task-specific optimization from large foundation models and enables more flexible, cost-effective deployment of LLM-powered agents in real-world applications.