Bosi Wen

The Conversational Artificial Intelligence (CoAI) Group, Tsinghua University

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)RLHF & Preference Learning (1)Code Generation & Program Synthesis (1)Tool Use & Agents (1)

Frequent co-authors

Xiaoying Ling (2)Hongning Wang (2)Bosi Wen (1)Yilin Niu (1)

Papers (2)

Mar 5, 2026

Tsinghua AIMar 5, 2026·also Westlake, Zhipu

IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation

Current judge models for instruction-following are surprisingly unreliable, but a new benchmark exposes their flaws and offers a path to better alignment.

Bosi Wen, Bosi Wen, Yilin Niu +9

Eval Frameworks & Benchmarks RLHF & Preference Learning

Feb 17, 2026

Feb 17, 2026·also Tsinghua AI, Ant Group, Chengdu Minto Tech, HKUST +7

GLM-5: from Vibe Coding to Agentic Engineering

GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.

GLM-5 Team, Qinkai Zheng, Da Yin +128

Code Generation & Program Synthesis Tool Use & Agents Training Efficiency & Optimization

Search

Bosi Wen

Research focus

Frequent co-authors

Papers (2)