Search papers, labs, and topics across Lattice.
Zhejiang University
3
0
6
SlimSearcher cuts tool-call rounds by up to 58% without sacrificing accuracy, redefining efficiency in web agent training.
Freezing most of your critic network and only training a tiny LoRA adapter can dramatically improve off-policy RL performance and stability.
Jointly modeling need prediction and service recommendation in an LLM framework dramatically improves local life service recommendations, outperforming traditional isolated approaches.