Search papers, labs, and topics across Lattice.
University of Southern California
1
0
3
9
Finally, a model that can generate realistic videos of human-object interactions, like pouring liquid, by conditioning on actions, text, and images.