MilaCohereMar 12, 2026arXiv:2603.11510

Tiny Aya: Bridging Scale and Multilingual Depth

Alejandro R. Salamanca, Diana Abagyan, Daniel D'souza, Ammar Khairi, David Mora, Saurabh Dash, Viraat Aryabumi, Sara Rajaee, Mehrnaz Mofakhami, Ananya Sahu, Thomas Euyang, B. Prince, Madeline Smith, Hangyu Lin, Acyr F. Locatelli, Sara Hooker, Tom Kocmi, Aidan Gomez, Ivan Zhang, Phil Blunsom, Nick Frosst, J. Pineau, B. Ermiş, A. Ustun, Julia Kreutzer, Marzieh Fadaee

AI Summary

Tiny Aya, a 3.35B parameter multilingual language model, was trained on 70 languages and refined with region-aware post-training. It achieves state-of-the-art translation quality and strong multilingual understanding, demonstrating high-quality target-language generation. The release includes a pretrained model, an instruction-tuned variant, and three region-specialized models, showcasing a scaling path focused on efficiency and balanced performance.

Key Contribution

Forget brute-force scaling: Tiny Aya proves a 3B parameter model can achieve state-of-the-art multilingual performance with clever training and region-aware specialization.

Abstract

Tiny Aya redefines what a small multilingual language model can achieve. Trained on 70 languages and refined through region-aware posttraining, it delivers state-of-the-art in translation quality, strong multilingual understanding, and high-quality target-language generation, all with just 3.35B parameters. The release includes a pretrained foundation model, a globally balanced instruction-tuned variant, and three region-specialized models targeting languages from Africa, South Asia, Europe, Asia-Pacific, and West Asia. This report details the training strategy, data composition, and comprehensive evaluation framework behind Tiny Aya, and presents an alternative scaling path for multilingual AI: one centered on efficiency, balanced performance across languages, and practical deployment.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Open-Source Models & Weights

Citation Metrics

Citations0

Influential citations0

References78

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Tiny Aya: Bridging Scale and Multilingual Depth

Related Papers