Search papers, labs, and topics across Lattice.
The Hong Kong University of Science and Technology, The Hong Kong University of Science and Technology (Guangzhou)
Microsoft Research4
0
10
OpenWebRL-4B sets a new benchmark for open-source visual web agents, achieving impressive success rates with minimal initial data while outperforming larger-scale competitors.
SeClaw reveals that existing benchmarks fall short in capturing the complexities of agent behavior, enabling a more nuanced evaluation of security risks in autonomous systems.
Recurrent memory can be added to transformers at scale with minimal parameter overhead and no performance penalty by reusing existing hidden states and training with interleaved parallel updates.
Autonomous driving gets a boost: CRAFT cleverly combines the best of both worlds – dense counterfactual supervision and grounded closed-loop feedback – to significantly improve driving policies.