Search papers, labs, and topics across Lattice.
City University of Hong Kong, Hong Kong, China
2
0
6
Explicitly aligning MoE routing behavior during fine-tuning can significantly boost performance on multilingual tasks, especially when the model understands the task in English but struggles in the target language.
Turns out, MLLMs struggle with manufacturing tasks not because they can't "see," but because they lack the domain-specific knowledge to understand what they're looking at.