Search papers, labs, and topics across Lattice.
George Mason University
1
0
3
C$^3$ache reveals that reusing residuals across inference chunks can dramatically speed up World Action Models, achieving a 2.5x reduction in inference time with minimal impact on performance.