Search papers, labs, and topics across Lattice.
Hunyuan Team Tencent
4
0
7
Coding agents struggle to create complete and engaging games, with top performers barely reaching 41.46% success in end-to-end game generation.
Current phone-use agents are often *too* helpful, routinely violating user privacy by filling in unnecessary personal information even when a task doesn't require it.
Achieve robust locomotion for multi-legged robots on rough terrain with a surprisingly simple, decentralized control architecture that blends event-driven and CPG-based approaches.
PPO's fixed clipping hurts exploration by squashing high-reward, low-probability actions, but BandPO fixes this with probability-aware bounds that boost performance.