Search papers, labs, and topics across Lattice.
University of Glasgow, Glasgow, U.K
1
2
3
2
Achieve near-optimal DLRM inference speedups across diverse hardware (NVIDIA, AMD, TPU) with a single optimization pass, eliminating the need for vendor-specific tuning.