Josh Susskind

Papers on Lattice

Total citations

Topics

Research focus

Computer Vision (1)Robotics & Embodied AI (1)World Models & Planning (1)Multimodal Models (1)Scaling Laws & Emergent Abilities (1)

Frequent co-authors

Joshua M. Susskind (2)Nick Stracke (1)Kolja Bauer (1)Stefan Andreas Baumann (1)

Papers (2)

Apr 13, 2026

Apple MLApr 13, 2026·also LMU

Learning Long-term Motion Embeddings for Efficient Kinematics Generation

Forget generating entire videos – this method distills motion into a highly compressed latent space, letting you steer scene dynamics with text prompts at unprecedented speeds.

Nick Stracke, Kolja Bauer, Stefan Andreas Baumann +5

Computer Vision Robotics & Embodied AI World Models & Planning

Feb 25, 2026

DeepMindFeb 25, 2026·also Apple ML, Berkeley University, Institut National de la Recherche, UPF

The Design Space of Tri-Modal Masked Diffusion Models

Tri-modal masked diffusion models can now be trained from scratch, achieving strong results in text generation, text-to-image, and text-to-speech, thanks to a systematic exploration of the design space and a novel SDE-based batch size reparameterization.

Louis Bethune, L. Béthune, Victor Turrisi +42

Multimodal Models Scaling Laws & Emergent Abilities Speech & Audio

Search

Josh Susskind

Research focus

Frequent co-authors

Papers (2)