Search papers, labs, and topics across Lattice.
2
0
6
Cut LLM cold starts from minutes to seconds by pre-materializing CUDA graph execution contexts, sidestepping brittle kernel patching and heavyweight checkpointing.
VLMs can be transformed into pixel-precise structural document parsing experts, achieving state-of-the-art OCR performance by enforcing syntactic validity and structural integrity through reinforcement learning.