Search papers, labs, and topics across Lattice.
3
0
6
Current phone-use agents are often *too* helpful, routinely violating user privacy by filling in unnecessary personal information even when a task doesn't require it.
Achieve robust locomotion for multi-legged robots on rough terrain with a surprisingly simple, decentralized control architecture that blends event-driven and CPG-based approaches.
PPO's fixed clipping hurts exploration by squashing high-reward, low-probability actions, but BandPO fixes this with probability-aware bounds that boost performance.