Sander Land

Allen Institute for AI (AI2)

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)RLHF & Preference Learning (1)

Frequent co-authors

Saumya Malik (1)Valentina Pyatkin (1)Jacob Daniel Morrison (1)Noah A. Smith (1)

Papers (1)

Jun 2, 2025

AI2Jun 2, 2025·also UW

RewardBench 2: Advancing Reward Model Evaluation

RewardBench 2 exposes a stark reality check for reward models: they struggle significantly on new, human-generated prompts, yet this difficulty is surprisingly predictive of their actual usefulness in downstream tasks.

Saumya Malik, Valentina Pyatkin, Sander Land +453

Eval Frameworks & Benchmarks RLHF & Preference Learning

Search

Sander Land

Research focus

Frequent co-authors

Papers (1)