Search papers, labs, and topics across Lattice.
Tohoku University
2
0
4
5
Current text-to-long-video evaluation metrics can't reliably assess video quality, failing to match human judgment in 9 out of 10 tested degradation aspects.
LVLMs encode nodes early but edges late, suggesting a fundamental bottleneck in how these models process relational information in diagrams.