Joshua M. Susskind

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (3)Multimodal Models (2)Architecture Design (Transformers, SSMs, MoE) (1)Robotics & Embodied AI (1)

Frequent co-authors

Nick Stracke (2)Kolja Bauer (2)Stefan Andreas Baumann (2)Miguel Angel Bautista (2)

Papers (4)

Jul 2, 2026

1w ago

Show Me Examples: Inferring Visual Concepts from Image Sets

State-of-the-art vision-language models fail to leverage visual context, leading to biased outputs, but a new training framework shows they can learn to infer concepts from image sets effectively.

Nick Stracke, Kolja Bauer, Stefan Andreas Baumann +3

Computer Vision Multimodal Models

Apr 21, 2026

Tianrong Chen +3Apr 21, 2026

Normalizing Flows with Iterative Denoising

Normalizing Flows can now compete with diffusion models on image generation tasks, thanks to an iterative denoising scheme that boosts performance without sacrificing likelihood-based training.

Tianrong Chen, David Berthelot, Joshua M. Susskind +1

Architecture Design (Transformers, SSMs, MoE)Computer Vision

Apr 13, 2026

Apple MLApr 13, 2026·also LMU

Learning Long-term Motion Embeddings for Efficient Kinematics Generation

Forget generating entire videos – this method distills motion into a highly compressed latent space, letting you steer scene dynamics with text prompts at unprecedented speeds.

Nick Stracke, Kolja Bauer, Stefan Andreas Baumann +5

Computer Vision Robotics & Embodied AI World Models & Planning

Feb 25, 2026

DeepMindFeb 25, 2026·also Apple ML, Berkeley University, Institut National de la Recherche, UPF

The Design Space of Tri-Modal Masked Diffusion Models

Tri-modal masked diffusion models can now be trained from scratch, achieving strong results in text generation, text-to-image, and text-to-speech, thanks to a systematic exploration of the design space and a novel SDE-based batch size reparameterization.

L. Béthune, Louis Bethune, V. Turrisi +42

Multimodal Models Scaling Laws & Emergent Abilities Speech & Audio

Search

Joshua M. Susskind

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)