Search papers, labs, and topics across Lattice.
2
0
5
2
Achieve up to 5.8x LLM inference speedup by decoupling causal dependency modeling from autoregressive draft execution in speculative decoding, sidestepping the usual trade-off between draft quality and drafting cost.
AgentDoG 1.5 proves you can achieve GPT-5.4-level agent safety with open-source models trained on just 1k samples, slashing deployment overhead by two orders of magnitude.