Tsinghua AIFeb 22, 2026arXiv:2602.19111

Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

Kainan Liu, Ning Cheng, Yun Zhu, Yanmeng Wang, Shaojun Wang, Jing Xiao

AI Summary

The paper introduces Astra, a parameter-efficient fine-tuning (PEFT) method that constructs task-adaptive low-rank adapters by leveraging the tail eigenvectors of model output activations, estimated from a small calibration set. This approach addresses the under-exploitation of activation subspaces corresponding to tail eigenvectors in existing LoRA-based methods, leading to improved fine-tuning performance. Experiments on NLU and NLG tasks demonstrate that Astra outperforms existing PEFT baselines and even surpasses full fine-tuning in some cases, while using a reduced parameter budget.

Key Contribution

By focusing on activation-space tail eigenvectors, Astra unlocks better fine-tuning with fewer parameters, even beating full fine-tuning in some cases.

Abstract

Parameter-Efficient Fine-Tuning (PEFT) methods, especially LoRA, are widely used for adapting pre-trained models to downstream tasks due to their computational and storage efficiency. However, in the context of LoRA and its variants, the potential of activation subspaces corresponding to tail eigenvectors remains substantially under-exploited, which may lead to suboptimal fine-tuning performance. In this work, we propose Astra (Activation-Space Tail-Eigenvector Low-Rank Adaptation), a novel PEFT method that leverages the tail eigenvectors of the model output activations-estimated from a small task-specific calibration set-to construct task-adaptive low-rank adapters. By constraining updates to the subspace spanned by these tail eigenvectors, Astra achieves faster convergence and improved downstream performance with a significantly reduced parameter budget. Extensive experiments across natural language understanding (NLU) and natural language generation (NLG) tasks demonstrate that Astra consistently outperforms existing PEFT baselines across 16 benchmarks and even surpasses full fine-tuning (FFT) in certain scenarios.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

Related Papers