Search papers, labs, and topics across Lattice.
Brandeis University, Waltham, MA, USA
1
2
3
2
Achieve near-optimal DLRM inference speedups across diverse hardware (NVIDIA, AMD, TPU) with a single optimization pass, eliminating the need for vendor-specific tuning.