Search papers, labs, and topics across Lattice.
2
0
6
A lightweight 6B model, when harnessed within the GEMS agent framework, leapfrogs state-of-the-art models in multimodal generation, suggesting architectural innovations in agents can compensate for raw parameter count.
Forget expensive MoE training from scratch: ExpertWeaver unlocks inherent MoE structure within dense LLMs using GLU activation patterns, offering a training-free conversion.