ETHUppsalaMay 21, 2026arXiv:2605.22743

SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation

Javad Parsa, Enis Simsar, Amir Joudaki, Thomas Hofmann, André M. H. Teixeira

AI Summary

The paper introduces SeqLoRA, a bilevel optimization framework for parameter-efficient fine-tuning of text-to-image diffusion models to compose multiple custom concepts. SeqLoRA jointly optimizes LoRA factors with orthogonal regularization to mitigate representation interference and catastrophic forgetting during continual learning of new concepts. Experiments show SeqLoRA improves identity preservation and scalability across a large number of concepts compared to existing methods, while also providing theoretical convergence and forgetting bounds.

Key Contribution

Forget about messy concept soups – SeqLoRA lets you teach your diffusion model 100+ new tricks without them blurring together.

Abstract

Parameter-efficient fine-tuning enables fast personalization of text-to-image diffusion models, but composing multiple custom concepts remains challenging due to representation interference. Existing modular methods either rely on expensive post-hoc fusion or freeze adaptation subspaces, which limit expressiveness and concept fidelity. To address this trade-off, we propose Sequential regularized LoRA (SeqLoRA), a constrained continual learning framework that jointly optimizes both LoRA factors via bilevel optimization. Theoretically, we establish strong convergence guarantees for our algorithm and model the residual layer activations as a matrix sub-Gaussian process to derive high-probability bounds on catastrophic forgetting. We further prove that learning the LoRA basis from data minimizes residual interference energy more effectively than frozen-basis methods. Experiments on multi-concept image generation demonstrate that SeqLoRA improves identity preservation and scalability across up to 101 concepts, while avoiding costly fusion and reducing attribute interference in composed generations.

Computer Vision Multimodal Models Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation

Related Papers