Search papers, labs, and topics across Lattice.
This paper introduces PASK, a proactive agent system designed for real-world scenarios, addressing limitations of prior work in laboratory settings. PASK employs a Demand Detection, Memory Modeling, and Proactive Agent System (DD-MM-PAS) paradigm, featuring a streaming IntentFlow model for demand detection and a hybrid memory system for long-term context. Evaluated on a new real-world benchmark, LatentNeeds-Bench, PASK's IntentFlow model achieves performance comparable to Gemini3-Flash while identifying deeper user intent under latency constraints.
Real-world proactive agents can now infer latent user needs and act on them in real-time, rivaling state-of-the-art models in intent detection while maintaining low latency.
Proactivity is a core expectation for AGI. Prior work remains largely confined to laboratory settings, leaving a clear gap in real-world proactive agent: depth, complexity, ambiguity, precision and real-time constraints. We study this setting, where useful intervention requires inferring latent needs from ongoing context and grounding actions in evolving user memory under latency and long-horizon constraints. We first propose DD-MM-PAS (Demand Detection, Memory Modeling, Proactive Agent System) as a general paradigm for streaming proactive AI agent. We instantiate this paradigm in Pask, with streaming IntentFlow model for DD, a hybrid memory (workspace, user, global) for long-term MM, PAS infra framework and introduce how these components form a closed loop. We also introduce LatentNeeds-Bench, a real-world benchmark built from user-consented data and refined through thousands of rounds of human editing. Experiments show that IntentFlow matches leading Gemini3-Flash models under latency constraints, while identifying deeper user intent.