Search papers, labs, and topics across Lattice.
2
0
5
OmniAgent's ability to improve performance with more reasoning turns challenges the traditional "watch-it-all" approach in video understanding.
Fine-tuning on the new ProCUA-SFT dataset boosts UI-TARS 7B's performance from a dismal 8-10% to an impressive 45.0% on OSWorld tasks, highlighting the critical role of high-quality training data.