Rodrigue de Schaetzen

Papers on Lattice

Total citations

Topics

h-index

Frequent co-authors

Roger Girgis (1)Luke Rowe (1)Azal'ee Robitaille (1)Christopher Pal (1)Liam Paull (1)

Papers (1)

Feb 5, 2026

MilaFeb 5, 2026

Constrained Group Relative Policy Optimization

Mismatched standard deviations in multi-objective RL advantage estimation can completely break constrained learning, but a simple scalarization fixes it.

Roger Girgis, Rodrigue de Schaetzen, Luke Rowe +3

Search

Rodrigue de Schaetzen

Frequent co-authors

Papers (1)