Search papers, labs, and topics across Lattice.
Huawei TSC
2
0
3
SPRI achieves a remarkable 3.39 BLEU point improvement over the best existing MoE upcycling method, demonstrating that pretrained weight structures can be effectively leveraged for better expert diversity.
Achieve >95% forget quality in LLMs with minimal side effects by isolating and unlearning tokens within target subdomains using asymmetric LoRA.