Search papers, labs, and topics across Lattice.
Mila – Quebec AI Institute
1
0
3
4
Language specialization in multilingual MoEs happens mostly in the final layers, suggesting a surprisingly simple recipe for parameter-efficient adaptation.