Search papers, labs, and topics across Lattice.
1
0
3
Finally, a multimodal dialogue model that doesn't just talk about instructional videos, but actually understands and reasons about the visual steps involved, blowing away previous text-only approaches.