Roxana Petcu

IRLab, University of Amsterdam

Papers on Lattice

Total citations

Topics

Research focus

Reasoning & Chain-of-Thought (1)Recommendation & Information Retrieval (1)Tool Use & Agents (1)

Frequent co-authors

Evangelos Kanoulas (1)Maarten de Rijke (1)

Papers (1)

Apr 8, 2026

IRLabApr 8, 2026·also UvA

SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval

Forget expensive human annotations: SubSearch unlocks more robust reasoning in LLMs by directly rewarding intermediate steps with intrinsic rewards, outperforming outcome-only supervision.

Roxana Petcu, Evangelos Kanoulas, Maarten de Rijke

Reasoning & Chain-of-Thought Recommendation & Information Retrieval Tool Use & Agents