Michael S. Ryoo

Salesforce AI Research, V baseline from CogVideoX [98], and our framework, FOFPred

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (1)Robotics & Embodied AI (1)

Frequent co-authors

Kanchana Ranasinghe (1)Honglu Zhou (1)Yu Fang (1)Luyu Yang (1)

Papers (1)

Jan 15, 2026

Jan 15, 2026·also Stanford HAI, UNC, V baseline from CogVideoX [98]

Future Optical Flow Prediction Improves Robot Control&Video Generation

A unified Vision-Language Model and Diffusion architecture unlocks surprisingly effective optical flow forecasting from noisy web data, enabling language-conditioned robot control and video generation.

Kanchana Ranasinghe, Honglu Zhou, Yu Fang +7

Multimodal Models Robotics & Embodied AI

Search

Michael S. Ryoo

Research focus

Frequent co-authors

Papers (1)