Yukuo Cen

A compact 0.9B multimodal model, GLM-OCR, achieves state-of-the-art document understanding by predicting multiple tokens at once, boosting decoding throughput without blowing up memory.

Shuaiqi Duan, Ya-Qi Xue, Weihan Wang +15

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Training Efficiency & Optimization

Feb 17, 2026

Feb 17, 2026·also Tsinghua AI, Ant Group, Chengdu Minto Tech, HKUST +7

GLM-5: from Vibe Coding to Agentic Engineering

GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.

GLM-5 Team, Qinkai Zheng, Da Yin +128

Code Generation & Program Synthesis Tool Use & Agents Training Efficiency & Optimization

Search

Yukuo Cen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)