Search papers, labs, and topics across Lattice.
The University of Hong Kong
2
0
4
Achieve better compression in low-bit quantization by considering not just numerical sensitivity, but also the structural role of each layer.
Autoregressive video models can now generate 4-minute videos without retraining, thanks to a clever inference-time hack that fixes positional embedding bias and injects dynamic priors.