Lattice AI Research

Research focus

Tool Use & Agents (2)Computer Vision (1)Multimodal Models (1)Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)

Frequent co-authors

V Team (1)GLM-V Team Wenyi Hong (1)Xiaotao Gu (1)Wenyi Hong (1)

Papers (4)

Apr 29, 2026

Tsinghua AIApr 29, 2026·also Fudan

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Multimodal perception is no longer just an add-on: GLM-5V-Turbo bakes it directly into the core of reasoning, planning, and action.

V Team, GLM-V Team Wenyi Hong, Xiaotao Gu +88

Computer Vision Multimodal Models Tool Use & Agents

Apr 8, 2026

Apr 8, 2026·also NUS, ByteDance, HKUST, SMU

InfiniLoRA: Disaggregated Multi-LoRA Serving for Large Language Models

Serving LoRA adapters at scale doesn't have to crush your latency SLOs: InfiniLoRA disaggregates LoRA execution to achieve 3x higher throughput and dramatically improved tail latency.

Hongyu Chen, Letian Ruan, Zilin Xu +1

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Apr 7, 2026

Apr 7, 2026·also Baidu, CAS, Institute of Software

UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning

Models can learn to self-differentiate between tasks requiring rigorous planning versus direct generation in creative writing, unlocking a new level of meta-cognitive ability.

Xiaolong Wei, Zerun Zhu, Simin Niu +8

Natural Language Processing RLHF & Preference Learning

Mar 18, 2026

CMU MLMar 18, 2026·also INSA Rennes

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

Forget specialized tools: a standard Unix terminal and clever RL are all you need to beat much larger LLMs at code search.

Lintang Sutawika, Aditya Bharat Soni, Bharath Sriraam R R +11

Code Generation & Program Synthesis Recommendation & Information Retrieval Tool Use & Agents

Search

Yuchen Li

Research focus

Frequent co-authors

Papers (4)