Mar 18, 2026arXiv:2603.17823

Discovering Decoupled Functional Modules in Large Language Models

Yanke Yu, Jin Li, Ying Sun, Ping Li, Zhefeng Wang, Yibo Zheng

AI Summary

This paper introduces ULCMOD, an unsupervised framework for discovering functional modules within LLMs by simultaneously disentangling neurons and identifying related input topics. The method uses a novel objective function and an Iterative Decoupling (IterD) algorithm to achieve this. Experiments demonstrate that ULCMOD discovers high-quality, disentangled modules with semantic coherence, interpretable specializations, and a clear spatial and hierarchical organization, improving performance on downstream tasks.

Key Contribution

LLMs aren't monolithic black boxes: they contain spatially organized, functionally specialized modules that can be automatically discovered.

Abstract

Understanding the internal functional organization of Large Language Models (LLMs) is crucial for improving their trustworthiness and performance. However, how LLMs organize different functions into modules remains highly unexplored. To bridge this gap, we formulate a functional module discovery problem and propose an Unsupervised LLM Cross-layer MOdule Discovery (ULCMOD) framework that simultaneously disentangles the large set of neurons in the entire LLM into modules while discovering the topics of input samples related to these modules. Our framework introduces a novel objective function and an efficient Iterative Decoupling (IterD) algorithm. Extensive experiments show that our method discovers high-quality, disentangled modules that capture more meaningful semantic information and achieve superior performance in various downstream tasks. Moreover, our qualitative analysis reveals that the discovered modules show semantic coherence, correspond to interpretable specializations, and a clear spatial and hierarchical organization within the LLM. Our work provides a novel tool for interpreting the functional modules of LLMs, filling a critical blank in LLM's interpretability research.

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp

Citation Metrics

Citations0

Influential citations0

References42

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Discovering Decoupled Functional Modules in Large Language Models

Related Papers