Yi-Jing Chen

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Natural Language Processing (1)Speech & Audio (1)Computer Vision (1)

Frequent co-authors

Yuyue Wang (1)Xihua Wang (1)Xin Cheng (1)Ruihua Song (1)

Papers (2)

May 27, 2026

Yuyue Wang +43w ago

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts

Forget disjointed pipelines and structured inputs: PlanAudio uses an LLM and semantic latent chain-of-thought to directly synthesize unified audio from free-form text prompts.

Yuyue Wang, Xihua Wang, Xin Cheng +2

Multimodal Models Natural Language Processing Speech & Audio

Feb 26, 2026

Feb 26, 2026·also Cohere

MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

By jointly training a keyframe sampler with an MLLM, MSJoE achieves state-of-the-art accuracy in long-form video understanding while significantly reducing computational cost.

Wenhui Tan, Xiaoyi Yu, Xiaoyi Yu +6

Computer Vision Multimodal Models Training Efficiency & Optimization

Search

Yi-Jing Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)