Lattice AI Research

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)Multimodal Models (1)Data Curation & Synthetic Data (1)Open-Source Models & Weights (1)

Frequent co-authors

Tao Gui (2)Xiaoran Fan (1)Zhichao Sun (1)Lixing Shen (1)

Papers (2)

Jan 16, 2026

MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models

Retrofit your VLMs with Multi-Head Latent Attention (MLA) for faster inference and smaller memory footprint, without costly pretraining, using this parameter-efficient conversion framework.

Xiaoran Fan, Zhichao Sun, Tao Ji +2

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Multimodal Models

Jan 7, 2026

Google ResearchJan 7, 2026·also Fudan, HuggingFace

Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control

Finally, a fully open-source, reproducible system for long-form song generation is here, complete with licensed data, code, and a Qwen-based model that rivals closed-source systems.

Changhao Jiang, Jiahao Chen, Zhenghao Xiang +14

Data Curation & Synthetic Data Open-Source Models & Weights Speech & Audio

Search

Tao Ji

Research focus

Frequent co-authors

Papers (2)