Search papers, labs, and topics across Lattice.
ByteDance {zhs, ylliu, xbai}@hust.edu.cn, jingquntang@bytedance.com https://github.com/CIawevy/TextPecker
2
0
5
2
SVD-Attention slashes the quadratic cost of attention to linear for recommendation tasks by exploiting the inherent low-rank structure of user behavior sequences, without sacrificing softmax.
FlashEvaluator slashes the computational cost of evaluating multiple sequences in Generator-Evaluator frameworks while boosting accuracy by enabling direct cross-sequence comparisons.