Search papers, labs, and topics across Lattice.
The paper introduces Semantics-Aware Implicit Denoising (SAID), a framework that leverages semantic consistency between user interest profiles and item content, encoded by PLMs, to reweight noisy implicit feedback in recommender systems. SAID computes semantic similarity scores between user interest profiles (derived from historical behaviors) and target item descriptions using PLM-based text encoders, and then uses these scores to modulate the training loss, downweighting semantically inconsistent clicks. Experiments on real-world datasets demonstrate that SAID improves recommendation performance and exhibits robustness under high noise conditions, achieving up to 2.2% relative AUC improvement.
Combat noisy click data in recommender systems by simply reweighting training samples based on the semantic similarity between user interest profiles and item descriptions, yielding surprisingly robust performance gains.
Implicit feedback, such as user clicks, serves as the primary data source for modern recommender systems. However, click interactions inherently contain substantial noise, including accidental clicks, clickbait-induced interactions, and exploratory browsing behaviors that do not reflect genuine user preferences. Training recommendation models with such noisy positive samples leads to degraded prediction accuracy and unreliable recommendations. In this paper, we propose SAID (Semantics-Aware Implicit Denoising), a simple yet effective framework that leverages semantic consistency between user interests and item content to identify and downweight potentially noisy interactions. Our approach constructs textual user interest profiles from historical behaviors and computes semantic similarity with target item descriptions using pre-trained language model (PLM) based text encoders. The similarity scores are then transformed into sample weights that modulate the training loss, effectively reducing the impact of semantically inconsistent clicks. Unlike existing denoising methods that require complex auxiliary networks or multi-stage training procedures, SAID only modifies the loss function while keeping the backbone recommendation model unchanged. Extensive experiments on two real-world datasets demonstrate that SAID consistently improves recommendation performance, achieving up to 2.2% relative improvement in AUC over strong baselines, with particularly notable robustness under high noise conditions.