Search papers, labs, and topics across Lattice.
The paper introduces PageGuide, a browser extension that enhances web navigation by grounding LLM answers and instructions directly within the HTML DOM using visual overlays. This addresses the limitations of existing AI assistants that lack in-situ evidence presentation, forcing users to manually verify results. A user study (N=94) demonstrates that PageGuide significantly improves task completion time, accuracy, and reduces manual search effort compared to unaided browsing.
Stop blindly trusting LLMs: PageGuide visually grounds AI answers directly in the webpage, slashing task times by up to 70% and boosting accuracy by 26%.
Users browsing the web daily struggle to quickly locate relevant information in cluttered pages, complete unfamiliar multi-step tasks, and stay focused amid distracting content. State-of-the-art AI assistants (e.g., ChatGPT, Gemini, Claude) and browser agents (e.g., OpenAI Operator, Browser Use) can answer questions and automate actions, yet they return answers without showing where the information comes from on the page, forcing users to manually verify results and blindly trust every automated steps. We present PageGuide, a browser extension that grounds LLM answers directly in the HTML DOM via visual overlays, addressing three core user needs: (a) Find-locating and highlighting relevant evidence in-situ so users can instantly verify answers on the page; (b) Guide-showing step-by-step instructions (e.g. how to change password) one at a time so users can follow and perform actions by themselves; and (c) Hide-hiding distracting content-giving users a chance to decide to hide an element or not. In a user study (N=94), PageGuide outperform unaided browsing across all modes: Hide accuracy improve by 26 percentage points (86.7% relative gain) and task completion time drops by 70%; Guide completion rate increases by 30 percentage points; and Find reduces manual search effort, with Ctrl+F usage falling by 80% and task time decreasing by 19%. Code and demo is at: pageguide.github.io.