NVIDIAApr 6, 2026arXiv:2604.04681

Batch Loss Score for Dynamic Data Pruning

Qing Zhou, Bingxuan Zhao, Hongyuan Zhang

AI Summary

This paper introduces Batch Loss Score (BLS), a computationally efficient method for dynamic data pruning that approximates per-sample loss by applying an Exponential Moving Average (EMA) to batch losses. By framing batch loss as a noisy measurement of individual sample loss, the authors theoretically ground BLS as a low-pass filter that attenuates noise from stochastic batch composition. Experiments across 14 datasets, 11 tasks, and 18 models demonstrate that BLS can enhance existing pruning methods to losslessly prune 20%-50% of samples with minimal code integration effort.

Key Contribution

Forget complex per-sample loss calculations – this simple three-line code injection uses batch loss smoothing to prune 20-50% of training data without sacrificing performance.

Abstract

Dynamic data pruning accelerates deep learning by selectively omitting less informative samples during training. While per-sample loss is a common importance metric, obtaining it can be challenging or infeasible for complex models or loss functions, often requiring significant implementation effort. This work proposes the Batch Loss Score (BLS), a computationally efficient alternative using an Exponential Moving Average (EMA) of readily available batch losses to assign scores to individual samples. We frame the batch loss, from the perspective of a single sample, as a noisy measurement of its scaled individual loss, with noise originating from stochastic batch composition. It is formally shown that the EMA mechanism functions as a first-order low-pass filter, attenuating high-frequency batch composition noise. This yields a score approximating the smoothed and persistent contribution of the individual sample to the loss, providing a theoretical grounding for BLS as a proxy for sample importance. BLS demonstrates remarkable code integration simplicity (\textbf{three-line injection}) and readily adapts existing per-sample loss-based methods (\textbf{one-line proxy}). Its effectiveness is demonstrated by enhancing two such methods to losslessly prune \textbf{20\%-50\%} of samples across \textit{14 datasets}, \textit{11 tasks} and \textit{18 models}, highlighting its utility and broad applicability, especially for complex scenarios where per-sample loss is difficult to access. Code is available at https://github.com/mrazhou/BLS.

Data Curation & Synthetic Data Inference & Quantization Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Batch Loss Score for Dynamic Data Pruning

Related Papers