Akira Kawabata

The Graduate University for Advanced Studies (SOKENDAI), National Institute of Informatics, The Asahi Shimbun Company

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Saku Sugawara (1)

Papers (1)

Apr 15, 2026

The Graduate University for Advanced2w ago·also NII, The Asahi Shimbun Company

C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences

Reward models can achieve state-of-the-art performance by critically collaborating with a rubric generator trained solely from binary preferences, eliminating the need for costly rubric annotations.

Akira Kawabata, Saku Sugawara

Constitutional AI & AI Ethics RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Akira Kawabata

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)