Search papers, labs, and topics across Lattice.
The University of Hong Kong
2
0
4
Stop blind drawing: giving MLLMs eyes to see their work-in-progress boosts SVG generation performance.
By integrating hierarchical memory and agentic RL, EventMemAgent enables MLLMs to actively perceive and reason about unbounded video streams, outperforming passive processing methods.