Search papers, labs, and topics across Lattice.
2
0
5
3
LLMs can generate recommendations up to 3.1x faster by explicitly modeling token position within items and speculation depth during speculative decoding.
LLMs can master auto-bidding in dynamic ad environments, but only if you give them a hierarchical architecture and offline RL fine-tuning to avoid hallucinating suboptimal decisions.