Potsawee Manakul

Papers on Lattice

Total citations

Topics

h-index

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Multimodal Models (1)Speech & Audio (1)

Frequent co-authors

Potsawee Manakul (1)Woody Haosheng Gan (1)Woody Haosheng Gan (1)Martijn Bartelds (1)

Papers (1)

Feb 18, 2026

Feb 18, 2026·also Stanford HAI, Together

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

Forget text-first: SODA models show that scaling native audio foundation models with interleaved semantic, acoustic, and text tokens unlocks powerful audio generation and cross-modal capabilities.

Potsawee Manakul, Potsawee Manakul, Woody Haosheng Gan +6

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

Search

Potsawee Manakul

Research focus

Frequent co-authors

Papers (1)