UWDukeFeb 19, 2026arXiv:2602.17588

Modeling Distinct Human Interaction in Web Agents

Faria Huq, Faria Huq, Zora Zhiruo Wang, Z. Z. Wang, Zhanqiu Guo, Zhanqiu Guo, Venu Arvind Arangarajan, Venu Arvind Arangarajan, Tianyue Ou, Tianyue Ou, Frank Xu, Frank F. Xu, Shuyan Zhou, Graham Neubig, Jeffrey P. Bigham, Jeffrey P. Bigham

AI Summary

The paper introduces the task of modeling human intervention in web agents to improve collaborative task execution by collecting CowCorpus, a dataset of 400 real-user web navigation trajectories with interleaved human and agent actions, and identifying four distinct patterns of user interaction. They train language models to predict when users will intervene based on interaction styles, achieving a 61.4-63.4% improvement in intervention prediction accuracy. Deploying these models in live web navigation agents resulted in a 26.5% increase in user-rated agent usefulness, demonstrating the value of structured modeling of human intervention.

Key Contribution

Stop guessing when humans want to take over: modeling user intervention styles in web agents boosts their usefulness by 26.5%.

Abstract

Despite rapid progress in autonomous web agents, human involvement remains essential for shaping preferences and correcting agent behavior as tasks unfold. However, current agentic systems lack a principled understanding of when and why humans intervene, often proceeding autonomously past critical decision points or requesting unnecessary confirmation. In this work, we introduce the task of modeling human intervention to support collaborative web task execution. We collect CowCorpus, a dataset of 400 real-user web navigation trajectories containing over 4,200 interleaved human and agent actions. We identify four distinct patterns of user interaction with agents -- hands-off supervision, hands-on oversight, collaborative task-solving, and full user takeover. Leveraging these insights, we train language models (LMs) to anticipate when users are likely to intervene based on their interaction styles, yielding a 61.4-63.4% improvement in intervention prediction accuracy over base LMs. Finally, we deploy these intervention-aware models in live web navigation agents and evaluate them in a user study, finding a 26.5% increase in user-rated agent usefulness. Together, our results show structured modeling of human intervention leads to more adaptive, collaborative agents.

Data Curation & Synthetic Data RLHF & Preference Learning Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References38

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Modeling Distinct Human Interaction in Web Agents

Related Papers