Lattice AI Research

Research focus

Speech & Audio (3)Architecture Design (Transformers, SSMs, MoE) (2)Natural Language Processing (2)Multimodal Models (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Yu Zhang (3)Ruiqi Li (2)Changhao Pan (2)Wenxiang Guo (2)

Papers (3)

May 29, 2026

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

SwanVoice leaps ahead in zero-shot TTS by nailing expressive, multi-speaker dialogue with a single model, finally bridging the gap between monologue quality and conversational coherence.

Ruiqi Li, Yu Zhang, Changhao Pan +2

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing Speech & Audio

May 29, 2026

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

SwanSphere achieves real-time, high-fidelity spatial audio generation from panoramic video and text, overcoming the latency and spatial accuracy limitations of existing methods.

Ke Lei, Yu Zhang, Changhao Pan +4

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

May 27, 2026

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Current speech generation models still fall short in maintaining consistency and capturing nuanced expressiveness when generating long-form speech, despite advances in high-fidelity synthesis.

Changhao Pan, Rui Yang, Hankun Wang +13

Eval Frameworks & Benchmarks Natural Language Processing Speech & Audio

Search

Ke Lei

Research focus

Frequent co-authors

Papers (3)