Search papers, labs, and topics across Lattice.
3
0
7
Achieving new state-of-the-art scores in deep research benchmarks, DuMate-DeepResearch redefines the capabilities of multi-agent systems in tackling complex research tasks.
GRPO's Achilles' heel in deep search is its coarse advantage assignment, but CalibAdv offers a way to surgically correct it, boosting both performance and training stability.
Current phone-use agents are often *too* helpful, routinely violating user privacy by filling in unnecessary personal information even when a task doesn't require it.