Search papers, labs, and topics across Lattice.
2
0
5
0
Kickstart MoE training by initializing experts with semantically meaningful subspaces, leading to faster specialization and better performance than standard upcycling techniques.
LG's EXAONE 4.5 shows that strategically curating training data, particularly document-centric corpora, unlocks substantial gains in specialized tasks like document understanding and Korean contextual reasoning, even while maintaining competitive general performance.