Artificial Intelligence LaboratorySJTUThe Aberdeen NLP Research GroupTongjiJun 1, 2026arXiv:2606.02437

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Mind Lab, Song Cao, Vic Cao, Kaijie Chen, Bunny Fan, Hera Feng, Huan Feng, Arthur Fu, Hongquan Gu, Aaron Guan, Mutian Hong, Hailee Hou, Peixuan Hua, Charles Huang, Miles Jiang, Nora Jiang, Yuyi Jiang, Autumn Jin, Fancy Kong, Kyrie Lei, Alexy Li, Dawn Li, Ray Li, Theo Li, Wenhao Li, Jiayi Lin, Domini Liu, Heshan Liu, Kairus Liu, Logan Liu, Maeve Luo, Runism Lv, Pony Ma, Verity Niu, Anson Qiu, Vincent Wang, Maxwell Yao, Regis Ye, Wenlin Ye, Yanying Ye, Josh Ying, Danney Zeng, Salmon Zhan, Anya Zhang, Ruijia Zhang, Shiyang Zhang, Sueky Zhang, Ya Zhang, Wei Zhao, Ada Zhou, Sizer Zhou, Xinyue Zhu, Murphy Zhuang

AI Summary

This paper explores the potential of parameter-efficient fine-tuning (PEFT) as a framework for creating persistent personal models by utilizing small trainable adapters on top of robust shared foundation models. The authors investigate three scaling dimensions—Scale Up, Scale Down, and Scale Out—to demonstrate how these adapters can enhance model performance by incorporating instance-specific behaviors while maintaining reliability. Key findings indicate that PEFT can serve as a viable method for developing millions of personalized models with trillions of parameters, shifting the paradigm from merely being a cost-effective alternative to full fine-tuning.

Key Contribution

PEFT can enable the creation of millions of personalized models, each with unique adaptations, leveraging the power of trillion-parameter foundation models.

Abstract

Parameter-efficient fine-tuning (PEFT) is usually treated as a cheaper alternative to full fine-tuning. We study a broader role: small trainable adapters as persistent local state on top of strong shared foundation models. In this framing, the base model provides shared competence while adapters carry instance-specific behavior such as preferences, skills, tool habits, and memory-like updates. We organize the problem around three scaling axes: Scale Up, where stronger shared priors make small local updates more useful; Scale Down, where we study how small adapters can be while remaining reliable; and Scale Out, where many persistent adapted instances coexist. MinT provides one infrastructure example for managing adapter identity, revision, provenance, evaluation, and serving residency. Together, the results suggest that PEFT can be a compact substrate for persistent personal models rather than only a budget substitute for full fine-tuning.

Scaling Laws & Emergent Abilities Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Related Papers