Apr 9, 2026arXiv:2604.07788

PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

AI Summary

The paper introduces PeReGrINE, a new benchmark for evaluating personalized review generation by structuring Amazon Reviews 2023 into a temporal user-item graph and conditioning review generation on bounded evidence from user history, item context, and neighborhood interactions. A key component is the "User Style Parameter," which summarizes a user's linguistic and affective tendencies from past reviews to represent persistent preferences. Experiments using graph-derived retrieval settings (product-only, user-only, neighbor-only, combined) and a novel "Dissonance Analysis" framework demonstrate how evidence composition impacts review fidelity, personalization, and grounding in retrieval-conditioned language models.

Key Contribution

Forget simply generating reviews; PeReGrINE shows how to generate *personalized* reviews by encoding user style from historical data and grounding generation in a user-item graph, revealing the nuanced impact of different evidence types on review fidelity.

Abstract

We introduce PeReGrINE, a benchmark and evaluation framework for personalized review generation grounded in graph-structured user--item evidence. PeReGrINE restructures Amazon Reviews 2023 into a temporally consistent bipartite graph, where each target review is conditioned on bounded evidence from user history, item context, and neighborhood interactions under explicit temporal cutoffs. To represent persistent user preferences without conditioning directly on sparse raw histories, we compute a User Style Parameter that summarizes each user's linguistic and affective tendencies over prior reviews. This setup supports controlled comparison of four graph-derived retrieval settings: product-only, user-only, neighbor-only, and combined evidence. Beyond standard generation metrics, we introduce Dissonance Analysis, a macro-level evaluation framework that measures deviation from expected user style and product-level consensus. We also study visual evidence as an auxiliary context source and find that it can improve textual quality in some settings, while graph-derived evidence remains the main driver of personalization and consistency. Across product categories, PeReGrINE offers a reproducible way to study how evidence composition affects review fidelity, personalization, and grounding in retrieval-conditioned language models.

Eval Frameworks & Benchmarks Natural Language Processing Recommendation & Information Retrieval

Citation Metrics

Citations0

Influential citations0

References19

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

Related Papers