Search papers, labs, and topics across Lattice.
Salesforce AI Research, V baseline from CogVideoX [98], and our framework, FOFPred
1
0
2
4
A unified Vision-Language Model and Diffusion architecture unlocks surprisingly effective optical flow forecasting from noisy web data, enabling language-conditioned robot control and video generation.