Search papers, labs, and topics across Lattice.
The Conversational Artificial Intelligence (CoAI) Group, Tsinghua University
Tsinghua AI1
0
2
Current judge models for instruction-following are surprisingly unreliable, but a new benchmark exposes their flaws and offers a path to better alignment.