Search papers, labs, and topics across Lattice.
SOLAR is introduced as a post-training compression framework to reduce the communication cost of parameter-efficient fine-tuning (PEFT) adapters. It expresses PEFT updates as a linear combination of basis vectors derived from the foundation model's singular vectors with controlled perturbations, exploiting subspace similarity. Experiments on language and vision tasks demonstrate that SOLAR preserves task performance while significantly reducing model representation sizes, making it suitable for resource-constrained environments.
SOLAR shrinks PEFT adapter sizes without sacrificing performance by cleverly aligning adapter updates with the foundation model's intrinsic subspace.
Parameter-efficient fine-tuning (PEFT) methods, such as LoRA, enable scalable adaptation of foundation models by injecting low-rank adapters. However, their communication and storage costs remain a major bottleneck in resource-constrained settings. We propose SOLAR (Subspace-Oriented Latent Adapter Reparameterization), a post-training compression framework that substantially reduces the communication cost (i.e., the number of parameters to transmit or store) of PEFT adapters. SOLAR expresses each PEFT update as a linear combination of basis vectors formed from the foundation model's singular vectors with controlled random perturbations. By exploiting the subspace similarity (the alignment of principal directions) between the foundation model and task-specific fine-tuned updates, SOLAR decouples the adapter size from PEFT structure and ensures compact yet expressive representations. It is model-agnostic and compatible with existing PEFT methods, including LoRA, AdaLoRA, and other adapter modules. We theoretically establish a bound on the reconstruction error. Experiments on language and vision tasks using LLaMA, GPT, and ViT models demonstrate that SOLAR preserves task performance while significantly reducing model representation sizes, offering an effective and communication-efficient solution for deployment in distributed systems and edge devices.