Search papers, labs, and topics across Lattice.
The Conversational Artificial Intelligence (CoAI) Group, Tsinghua University
Tsinghua AI3
0
7
2
Current judge models for instruction-following are surprisingly unreliable, but a new benchmark exposes their flaws and offers a path to better alignment.
LLMs can now actively perceive and react to anomalies during scientific simulations, leading to more reliable and accurate results in complex engineering and modeling tasks.
GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.