Search papers, labs, and topics across Lattice.
1
0
3
Integer-only attention is now a viable alternative to floating-point, delivering up to 8.69x speedups and 18.8% energy reduction on Vision Transformers.