Search papers, labs, and topics across Lattice.
CUHK MMLab
2
0
5
A 4B-parameter model, InternVL-U, punches above its weight, outperforming 14B-parameter models in multimodal generation and editing by using a novel data synthesis pipeline and architecture.
Current GUI agents are reactive, but PIRA-Bench offers a challenging new environment for training agents to *proactively* anticipate user intentions from continuous visual inputs, a crucial step towards truly intelligent AI assistants.