Search papers, labs, and topics across Lattice.
University of Edinburgh
1
0
3
14
Language specialization in multilingual MoEs happens mostly in the final layers, suggesting a surprisingly simple recipe for parameter-efficient adaptation.