Search papers, labs, and topics across Lattice.
Hunyuan Team Tencent
3
0
4
Coding agents struggle to create complete and engaging games, with top performers barely reaching 41.46% success in end-to-end game generation.
Reliable phone automation hinges on mixed-action capabilities, with agents achieving a 75% success rate in real-world workflows.
Current phone-use agents are often *too* helpful, routinely violating user privacy by filling in unnecessary personal information even when a task doesn't require it.