Search papers, labs, and topics across Lattice.
1
4
3
1
Quantizing Vision Transformers to 4-bit precision no longer requires a painful trade-off between accuracy, speed, and memory, thanks to a new activation-first training method that's 100x faster.