NTUPKUApr 6, 2026arXiv:2604.04901

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Shuai Liu, Shuai Liu, Shulin Tian, Shulin Tian, Kairui Hu, Kairui Hu, Yuhao Dong, Zhe Yang, Zhe Yang, Bo Li, Bo Li, Jingkang Yang, Jingkang Yang, Jingkang Yang, Chen Change Loy, Chen Change Loy, Ziwei Liu, Ziwei Liu

AI Summary

This paper introduces FileGram, a framework for personalizing AI agents operating within local file systems by leveraging file-system behavioral traces. FileGram includes a data engine (FileGramEngine) for simulating realistic workflows, a benchmark (FileGramBench) for evaluating memory systems, and a memory architecture (FileGramOS) that builds user profiles from atomic actions and content deltas. Experiments demonstrate that FileGramBench poses challenges for existing memory systems and that FileGramEngine and FileGramOS are effective components for personalized agents.

Key Contribution

Forget dialogue summaries – FileGram builds user profiles directly from atomic file-system actions, unlocking a richer, more privacy-preserving approach to agent personalization.

Abstract

Coworking AI agents operating within local file systems are rapidly emerging as a paradigm in human-AI interaction; however, effective personalization remains limited by severe data constraints, as strict privacy barriers and the difficulty of jointly collecting multimodal real-world traces prevent scalable training and evaluation, and existing methods remain interaction-centric while overlooking dense behavioral traces in file-system operations; to address this gap, we propose FileGram, a comprehensive framework that grounds agent memory and personalization in file-system behavioral traces, comprising three core components: (1) FileGramEngine, a scalable persona-driven data engine that simulates realistic workflows and generates fine-grained multimodal action sequences at scale; (2) FileGramBench, a diagnostic benchmark grounded in file-system behavioral traces for evaluating memory systems on profile reconstruction, trace disentanglement, persona drift detection, and multimodal grounding; and (3) FileGramOS, a bottom-up memory architecture that builds user profiles directly from atomic actions and content deltas rather than dialogue summaries, encoding these traces into procedural, semantic, and episodic channels with query-time abstraction; extensive experiments show that FileGramBench remains challenging for state-of-the-art memory systems and that FileGramEngine and FileGramOS are effective, and by open-sourcing the framework, we hope to support future research on personalized memory-centric file-system agents.

Data Curation & Synthetic Data Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References48

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Related Papers