M. Hashemzadeh

Mila – Quebec AI Institute, B-Instruct-2507. Here, AIME25 is highly sensitive to the overall calibration mix while both GSM

Papers on Lattice

Total citations

Topics

h-index

Papers (1)

Apr 6, 2026

MilaApr 6, 2026·also AI Center, B-Instruct-2507. Here, McGill

Merging experts in MoE LLMs can actually *improve* performance compared to pruning, offering a new path to compression that preserves capabilities.