Search papers, labs, and topics across Lattice.
The paper introduces Astra, a parameter-efficient fine-tuning (PEFT) method that constructs task-adaptive low-rank adapters by leveraging the tail eigenvectors of model output activations, estimated from a small calibration set. This approach addresses the under-exploitation of activation subspaces corresponding to tail eigenvectors in existing LoRA-based methods, leading to improved fine-tuning performance. Experiments on NLU and NLG tasks demonstrate that Astra outperforms existing PEFT baselines and even surpasses full fine-tuning in some cases, while using a reduced parameter budget.
By focusing on activation-space tail eigenvectors, Astra unlocks better fine-tuning with fewer parameters, even beating full fine-tuning in some cases.
Parameter-Efficient Fine-Tuning (PEFT) methods, especially LoRA, are widely used for adapting pre-trained models to downstream tasks due to their computational and storage efficiency. However, in the context of LoRA and its variants, the potential of activation subspaces corresponding to tail eigenvectors remains substantially under-exploited, which may lead to suboptimal fine-tuning performance. In this work, we propose Astra (Activation-Space Tail-Eigenvector Low-Rank Adaptation), a novel PEFT method that leverages the tail eigenvectors of the model output activations-estimated from a small task-specific calibration set-to construct task-adaptive low-rank adapters. By constraining updates to the subspace spanned by these tail eigenvectors, Astra achieves faster convergence and improved downstream performance with a significantly reduced parameter budget. Extensive experiments across natural language understanding (NLU) and natural language generation (NLG) tasks demonstrate that Astra consistently outperforms existing PEFT baselines across 16 benchmarks and even surpasses full fine-tuning (FFT) in certain scenarios.