Search papers, labs, and topics across Lattice.
2
0
4
By strategically delaying tree expansion and dynamically selecting verification methods, OT-based speculative decoding can finally surpass Traversal Verification in throughput.
Verifiable LLM inference becomes practical: privacy-preserving techniques unlock verification at near-zero cost, outperforming ZKPs.