Search papers, labs, and topics across Lattice.
Chinese Academy of Sciences,Institute of Information Engineering,Beijing,China
1
0
3
2
Autoregressive generation bottlenecks be gone: a dual-decoder architecture achieves up to 1.6x faster inference without sacrificing quality.