Search papers, labs, and topics across Lattice.
1
0
3
LLMs can now excel in high-frequency decision-making tasks like UAV pursuit, thanks to a novel reward normalization and consistency loss approach that aligns global and sub-semantic policies.