Search papers, labs, and topics across Lattice.
003448@nuist.edu.cn
3
0
6
Ditch the critic: This new reinforcement learning approach trains feature extractors for human activity recognition without needing a value function, leading to more stable and generalizable performance across diverse users.
A compact 0.9B multimodal model, GLM-OCR, achieves state-of-the-art document understanding by predicting multiple tokens at once, boosting decoding throughput without blowing up memory.
GLM-5 doesn't just code; it engineers, showcasing unprecedented capability in tackling end-to-end software engineering challenges.