Gabriele Carrino

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)

Frequent co-authors

Andrea Sassella (1)Nicolò Brunello (1)Nicolo Brunello (1)Federico Toschi (1)

Papers (1)

Mar 19, 2026

Gabriele Carrino +6Mar 19, 2026·also PoliMi

Are complicated loss functions necessary for teaching LLMs to reason?

Stripping away the complexity of GRPO reveals that simple REINFORCE with group relative advantage can actually *improve* LLM reasoning, challenging the assumption that sophisticated loss functions are always better.

Gabriele Carrino, Andrea Sassella, Nicolò Brunello +4

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Search

Gabriele Carrino

Research focus

Frequent co-authors

Papers (1)