Jie Tang

Manuscript received ; revised . (Corresponding author: Wenlong Niu.)This work was supported by the Civil Aerospace Pre-research Project under Grant D040103. All authors are with the Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China. Weihua Gao is also with the School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China. (e-mail: gaoweihua22@mails.ucas.ac.cn; niuwenlong@nssc.ac.cn)

Tsinghua AI

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Multimodal Models (1)Training Efficiency & Optimization (1)

Frequent co-authors

Shuaiqi Duan (1)Ya-Qi Xue (1)Weihan Wang (1)Zhèngyuān Sū (1)

Papers (1)

Mar 11, 2026

Tsinghua AIMar 11, 2026·also CAS, RAI, ZJU

GLM-OCR Technical Report

A compact 0.9B multimodal model, GLM-OCR, achieves state-of-the-art document understanding by predicting multiple tokens at once, boosting decoding throughput without blowing up memory.

Shuaiqi Duan, Ya-Qi Xue, Weihan Wang +15

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Training Efficiency & Optimization

Search

Jie Tang

Research focus

Frequent co-authors

Papers (1)