Search papers, labs, and topics across Lattice.
This paper introduces an open-source framework for training long-sequence recommendation models using sliding window approaches, addressing the common perception that such training is impractical. They perform runtime-aware ablations to quantify the accuracy-compute trade-offs of different windowing strategies and introduce a k-shift embedding layer to handle million-scale vocabularies on commodity GPUs. The framework achieves competitive retrieval quality on benchmark datasets with reasonable training overheads, making long-sequence training more accessible to the research community.
Long-sequence recommendation models, previously considered impractical to train, can now be effectively trained on commodity hardware with minimal accuracy loss, thanks to a new open-source framework and a clever embedding layer.
Long interaction histories are central to modern recommender systems, yet training with long sequences is often dismissed as impractical under realistic memory and latency budgets. This work demonstrates that it is not only practical but also effective-at academic scale. We release a complete, end-to-end framework that implements industrial-style long-sequence training with sliding windows, including all data processing, training, and evaluation scripts. Beyond reproducing prior gains, we contribute two capabilities missing from earlier reports: (i) a runtime-aware ablation study that quantifies the accuracy-compute frontier across windowing regimes and strides, and (ii) a novel k-shift embedding layer that enables million-scale vocabularies on commodity GPUs with negligible accuracy loss. Our implementation trains reliably on modest university clusters while delivering competitive retrieval quality (e.g., up to +6.04% MRR and +6.34% Recall@10 on Retailrocket) with $\sim 4 \times $ training-time overheads. By packaging a robust pipeline, reporting training time costs, and introducing an embedding mechanism tailored for low-resource settings, we transform long-sequence training from a closed, industrial technique into a practical, open, and extensible methodology for the community.