T. Zhu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (2)Architecture Design (Transformers, SSMs, MoE) (1)Computer Vision (1)Code Generation & Program Synthesis (1)Tool Use & Agents (1)

Frequent co-authors

Siyuan Huang (2)Xiaoye Qu (2)Zefeng He (2)Yafu Li (1)

Papers (2)

May 1, 2026

Siyuan Huang +8May 1, 2026·also WHU

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

LVLMs can maintain sharper visual focus during long-form generation by adding a lightweight, learnable memory module that bypasses attention dilution.

Siyuan Huang, Xiaoye Qu, Yafu Li +6

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Mar 30, 2026

Zefeng He +4Mar 30, 2026

GEMS: Agent-Native Multimodal Generation with Memory and Skills

A lightweight 6B model, when harnessed within the GEMS agent framework, leapfrogs state-of-the-art models in multimodal generation, suggesting architectural innovations in agents can compensate for raw parameter count.

Zefeng He, Siyuan Huang, Xiaoye Qu +2

Code Generation & Program Synthesis Multimodal Models Tool Use & Agents

Search

T. Zhu

Research focus

Frequent co-authors

Papers (2)