Search papers, labs, and topics across Lattice.
1
0
2
SignSGD can outperform SGD in linear regression when noise dominates, thanks to a unique "noise-reshaping" effect that steepens its compute-optimal scaling law.