BeihangNTUPKUTencent AIWHUMay 28, 2026arXiv:2605.29372

On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors

Yuhong Liu, Yuhong Liu, Yunhe Su, Yunhe Su, Zhipeng Peng, Zhiwen Luo, Zhiwen Luo, Ling Shi, Zhi Jin, Zhi Jin, Li Zhang

AI Summary

This paper introduces VirtualME, an IDE-embedded data infrastructure that models developers by capturing and interpreting their programming behaviors and preferences. VirtualME extracts log-level behaviors, recognizes task-level behaviors via a multi-agent pipeline, and distills a four-dimensional developer persona. Integrating this persona into a Q&A agent for repository-level knowledge retrieval improves answer quality by 33.80% compared to generic baselines, demonstrating the value of personalized code intelligence.

Key Contribution

Ditch the one-size-fits-all code intelligence: modeling individual developer behaviors inside the IDE boosts Q&A accuracy by 33.8%.

Abstract

With the advent of large language models, research in automated software engineering has increasingly focused on leveraging these models to achieve a deeper semantic understanding of code or to engineer sophisticated agent-based processes. However, this research trajectory has largely overlooked a critical factor: the developers themselves. Programming is a deeply individualized activity; developers exhibit significant variation in their tool-chain preferences, domain-specific expertise, and problem-solving strategies. Consequently, the current paradigm of one-size-fits-all code intelligence systems struggles to accommodate the needs of individual developers. To address this gap, we introduce VirtualME, a novel IDE-embedded data infrastructure designed to model the developer by continuously capturing and interpreting their dynamic programming behaviors and preferences. VirtualME contains three components. (1) Log-level Behavior Extraction: it captures and extracts developers'log-level behaviors from IDE. (2) Task-level Behavior Recognition: it aggregates log-level behaviors into task-level behaviors via a multi-agent pipeline. (3) Developer-personality Measurement: it builds a rule engine to distill a four-dimensional developer persona: technology stack, ability, behavioral habits, and learning style. On top of VirtualME, we propose a solution for personalized repository-level knowledge Q&A by integrating the developer persona into the Q&A agent. We evaluated VirtualME by building a multi-repository benchmark with real-world developer trajectories, balancing correctness and personalization. Experimental results show that VirtualME-enhanced answers outperform generic baselines on five dimensions, yielding an average 33.80% improvement. Our results demonstrate that abundant, continuous developer-behavior data can pave the new way for adaptive and personalized code intelligence.

Code Generation & Program Synthesis Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References62

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors

Related Papers