Search papers, labs, and topics across Lattice.
This paper investigates the real-world performance of machine learning-based anti-money laundering (AML) systems in cryptocurrency, specifically Bitcoin. It demonstrates that static classification metrics are poor indicators of actual regulatory effectiveness due to temporal non-stationarity in transaction data. The study reveals that fixed enforcement policies lead to significant and persistent excess regulatory losses compared to dynamically optimized benchmarks, primarily due to miscalibration of decision rules.
ML-based crypto AML systems are more fragile than their static metrics suggest, losing significant effectiveness due to temporal shifts in transaction patterns and miscalibrated decision rules.
We study the deployment performance of machine learning based enforcement systems used in cryptocurrency anti money laundering (AML). Using forward looking and rolling evaluations on Bitcoin transaction data, we show that strong static classification metrics substantially overstate real world regulatory effectiveness. Temporal nonstationarity induces pronounced instability in cost sensitive enforcement thresholds, generating large and persistent excess regulatory losses relative to dynamically optimal benchmarks. The core failure arises from miscalibration of decision rules rather than from declining predictive accuracy per se. These findings underscore the fragility of fixed AML enforcement policies in evolving digital asset markets and motivate loss-based evaluation frameworks for regulatory oversight.