Search papers, labs, and topics across Lattice.
2
0
6
1
Squeeze 34% more decode speed out of your MoE model without sacrificing accuracy by intelligently budgeting expert activations.
Autonomous exploration by an LLM agent dramatically outperforms both rigid retrieval workflows and supervised fine-tuning for temporal knowledge graph question answering, achieving state-of-the-art results in a zero-shot setting.