Search papers, labs, and topics across Lattice.
1
0
3
Stop wasting compute: a learned policy can intelligently allocate LLM inference budgets, boosting accuracy by up to 12.8% compared to uniform allocation.