Jonah Casebeer

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (2)Speech & Audio (2)Multimodal Models (1)Architecture Design (Transformers, SSMs, MoE) (1)Training Efficiency & Optimization (1)

Frequent co-authors

Nicholas J. Bryan (3)Yan-Bo Lin (1)Long Mai (1)Aniruddha Mahapatra (1)

Papers (3)

Mar 11, 2026

Mar 11, 2026·also Adobe Research

V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation

Forget paired video-music training data: V2M-Zero aligns video and music by matching the *timing* of changes within each modality, not the content itself.

Yan-Bo Lin, Jonah Casebeer, Long Mai +3

Computer Vision Multimodal Models Speech & Audio

Feb 17, 2026

Jonah Casebeer +3Feb 17, 2026

A Generative-First Neural Audio Autoencoder

Compressing 60-second audio into just 788 tokens, this new autoencoder makes generative audio modeling far more tractable by slashing encoding time and latent rates.

Jonah Casebeer, Ge Zhu, Zhepei Wang +1

Architecture Design (Transformers, SSMs, MoE)Speech & Audio Training Efficiency & Optimization

Apr 21, 2025

Yatong Bai +3Apr 21, 2025

DRAGON: Distributional Rewards Optimize Diffusion Generative Models

Forget RLHF and DPO – DRAGON lets you fine-tune generative models with rewards that compare entire *distributions* of outputs, unlocking better control and quality without human preference data.

Yatong Bai, Jonah Casebeer, S. Sojoudi +1

Computer Vision RLHF & Preference Learning

Search

Jonah Casebeer

Research focus

Frequent co-authors

Papers (3)