Search papers, labs, and topics across Lattice.
GLM-5 is a next-generation foundation model that shifts from "vibe coding" to "agentic engineering" by enhancing agentic, reasoning, and coding (ARC) capabilities. The model uses DSA (likely a novel architecture or training technique) to reduce training and inference costs while preserving long-context fidelity. It also introduces an asynchronous reinforcement learning infrastructure and novel asynchronous agent RL algorithms to improve post-training alignment and autonomy, resulting in state-of-the-art performance on benchmarks and real-world coding tasks.
GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.
We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintaining long-context fidelity. To advance model alignment and autonomy, we implement a new asynchronous reinforcement learning infrastructure that drastically improves post-training efficiency by decoupling generation from training. Furthermore, we propose novel asynchronous agent RL algorithms that further improve RL quality, enabling the model to learn from complex, long-horizon interactions more effectively. Through these innovations, GLM-5 achieves state-of-the-art performance on major open benchmarks. Most critically, GLM-5 demonstrates unprecedented capability in real-world coding tasks, surpassing previous baselines in handling end-to-end software engineering challenges. Code, models, and more information are available at https://github.com/zai-org/GLM-5.