CU BoulderApr 6, 2026arXiv:2604.04892

Data Attribution in Adaptive Learning

AI Summary

This paper formalizes occurrence-level data attribution for adaptive learning settings, where training data distribution shifts as the model learns. It demonstrates that standard attribution methods fail in these settings and that replay-side information is insufficient for recovering the true attribution. The authors then identify a structural class of adaptive learning problems where the desired attribution target *is* identifiable from logged data.

Key Contribution

Standard data attribution methods break down in adaptive learning scenarios, but this work identifies conditions under which you *can* still recover meaningful attributions from logged data.

Abstract

Machine learning models increasingly generate their own training data -- online bandits, reinforcement learning, and post-training pipelines for language models are leading examples. In these adaptive settings, a single training observation both updates the learner and shifts the distribution of future data the learner will collect. Standard attribution methods, designed for static datasets, ignore this feedback. We formalize occurrence-level attribution for finite-horizon adaptive learning via a conditional interventional target, prove that replay-side information cannot recover it in general, and identify a structural class in which the target is identified from logged data.

Data Curation & Synthetic Data Interpretability & Mechanistic Interp

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Data Attribution in Adaptive Learning

Related Papers