Search papers, labs, and topics across Lattice.
2
0
6
0
Freezing most of your critic network and only training a tiny LoRA adapter can dramatically improve off-policy RL performance and stability.
Jointly modeling need prediction and service recommendation in an LLM framework dramatically improves local life service recommendations, outperforming traditional isolated approaches.