Xiaoying Ling

Zhipu AI

Tsinghua AI

Papers on Lattice

Total citations

Topics

Research focus

Eval Frameworks & Benchmarks (1)RLHF & Preference Learning (1)

Frequent co-authors

Bosi Wen (1)Bosi Wen (1)Yilin Niu (1)Yilin Niu (1)

Papers (1)

Mar 5, 2026

Tsinghua AIMar 5, 2026·also Westlake, Zhipu

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Current judge models for instruction-following are surprisingly unreliable, but a new benchmark exposes their flaws and offers a path to better alignment.

Bosi Wen, Bosi Wen, Yilin Niu +9

Eval Frameworks & Benchmarks RLHF & Preference Learning

Search

Xiaoying Ling

Research focus

Frequent co-authors

Papers (1)