Search papers, labs, and topics across Lattice.
Waseda University
2
0
4
Reward-guided optimization can be selectively applied to enhance generative recommendation performance, avoiding the pitfalls of uniform reinforcement learning.
GenRec proves that generative recommendation can beat existing pipelines in a large-scale industrial setting, achieving nearly 10% gains in key metrics by focusing on preference alignment and efficient sequence encoding.