Search papers, labs, and topics across Lattice.
Beijing Institute of Technology
3
0
6
PathRouter reduces reliance on shortcuts in reinforcement learning, leading to more reliable and contextually rich decision-making in language-model agents.
JailbreakOPT boosts attack success rates against LLMs while slashing the number of attempts needed to breach safety measures.
Soft robots can now learn to grasp objects more effectively by translating rigid-gripper grasps into successful soft-gripper grasps using a conditional generative model.