Search papers, labs, and topics across Lattice.
Mila – Quebec AI Institute, B-Instruct-2507. Here, AIME25 is highly sensitive to the overall calibration mix while both GSM
Mila1
0
2
4
Merging experts in MoE LLMs can actually *improve* performance compared to pruning, offering a new path to compression that preserves capabilities.