Search papers, labs, and topics across Lattice.
Saarland Informatics Campus
1
0
3
Multimodal language models are fluent liars: they produce convincing procedural video captions that are often factually incomplete, with systematic omissions and role-level inconsistencies exposed by video-grounded verification.