Apple MLUCSBApr 1, 2026arXiv:2604.00842

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Deepak Nathani, Cheng Zhang, Chang Huan, Jiaming Shan, Yinfei Yang, Alkesh Patel, Zhe Gan, William Yang Wang, Michael Stephen Saxon, Michael Saxon, Xin Eric Wang

AI Summary

This paper introduces Proactive Agent Research Environment (Pare), a framework for simulating active users interacting with digital environments modeled as finite state machines. Pare enables more realistic user simulation by capturing stateful navigation and state-dependent action spaces, addressing limitations of existing tool-calling API approaches. The authors also present Pare-Bench, a benchmark of 143 tasks across various applications to evaluate proactive agents on context observation, goal inference, intervention timing, and multi-app orchestration.

Key Contribution

Realistic user simulation is now possible: Pare offers a framework that moves beyond flat tool-calling APIs to model stateful user interactions, enabling better evaluation of proactive agents.

Abstract

Proactive agents that anticipate user needs and autonomously execute tasks hold great promise as digital assistants, yet the lack of realistic user simulation frameworks hinders their development. Existing approaches model apps as flat tool-calling APIs, failing to capture the stateful and sequential nature of user interaction in digital environments and making realistic user simulation infeasible. We introduce Proactive Agent Research Environment (Pare), a framework for building and evaluating proactive agents in digital environments. Pare models applications as finite state machines with stateful navigation and state-dependent action space for the user simulator, enabling active user simulation. Building on this foundation, we present Pare-Bench, a benchmark of 143 diverse tasks spanning communication, productivity, scheduling, and lifestyle apps, designed to test context observation, goal inference, intervention timing, and multi-app orchestration.

Eval Frameworks & Benchmarks Tool Use & Agents World Models & Planning

Citation Metrics

Citations0

Influential citations0

References25

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Related Papers