UHApr 9, 2026arXiv:2604.08335

Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models

M. Armstrong, Marcus Armstrong, Navid Ayoobi, Arjun Mukherjee

AI Summary

This paper introduces a feedforward graph architecture composed of multiple frozen LLMs communicating through a shared latent space via learned linear projections. By optimizing projection matrices via backpropagation through residual stream injection hooks, the architecture effectively aggregates knowledge from smaller models (Llama-3.2-1B, Qwen2.5-1.5B, Gemma-2-2B) to enhance the performance of larger models (Phi-3-mini, Mistral-7B). The resulting model, with only 17.6M trainable parameters, outperforms its constituent models on ARC-Challenge, OpenBookQA, and MMLU, demonstrating the viability of knowledge aggregation through frozen LLM graphs.

Key Contribution

Forget fine-tuning: unlock surprisingly strong performance by stitching together frozen LLMs with lightweight, trainable projections that route information between models.

Abstract

We present a feedforward graph architecture in which heterogeneous frozen large language models serve as computational nodes, communicating through a shared continuous latent space via learned linear projections. Building on recent work demonstrating geometric compatibility between independently trained LLM latent spaces~\cite{armstrong2026thinking}, we extend this finding from static two-model steering to end-to-end trainable multi-node graphs, where projection matrices are optimized jointly via backpropagation through residual stream injection hooks. Three small frozen models (Llama-3.2-1B, Qwen2.5-1.5B, Gemma-2-2B) encode the input into a shared latent space whose aggregate signal is injected into two larger frozen models (Phi-3-mini, Mistral-7B), whose representations feed a lightweight cross-attention output node. With only 17.6M trainable parameters against approximately 12B frozen, the architecture achieves 87.3\% on ARC-Challenge, 82.8\% on OpenBookQA, and 67.2\% on MMLU, outperforming the best single constituent model by 11.4, 6.2, and 1.2 percentage points respectively, and outperforming parameter-matched learned classifiers on frozen single models by 9.1, 5.2, and 6.7 points. Gradient flow through multiple frozen model boundaries is empirically verified to be tractable, and the output node develops selective routing behavior across layer-2 nodes without explicit supervision.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References21

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models

Related Papers