Search papers, labs, and topics across Lattice.
This paper formalizes occurrence-level data attribution for adaptive learning settings, where training data distribution shifts as the model learns. It demonstrates that standard attribution methods fail in these settings and that replay-side information is insufficient for recovering the true attribution. The authors then identify a structural class of adaptive learning problems where the desired attribution target *is* identifiable from logged data.
Standard data attribution methods break down in adaptive learning scenarios, but this work identifies conditions under which you *can* still recover meaningful attributions from logged data.
Machine learning models increasingly generate their own training data -- online bandits, reinforcement learning, and post-training pipelines for language models are leading examples. In these adaptive settings, a single training observation both updates the learner and shifts the distribution of future data the learner will collect. Standard attribution methods, designed for static datasets, ignore this feedback. We formalize occurrence-level attribution for finite-horizon adaptive learning via a conditional interventional target, prove that replay-side information cannot recover it in general, and identify a structural class in which the target is identified from logged data.