Search papers, labs, and topics across Lattice.
Chinese Academy of Sciences,Institute of Information Engineering,Beijing,China
1
0
3
1
Autoregressive generation bottlenecks be gone: a dual-decoder architecture achieves up to 1.6x faster inference without sacrificing quality.