Search papers, labs, and topics across Lattice.
This paper introduces QRisk, a framework to identify and mitigate execution-dependent error patterns on NISQ devices that are missed by standard calibration-based noise models. QRisk uses delta debugging to isolate problematic circuit fragments exhibiting excess error and validates their persistence. By applying commuting gate swaps to disrupt these patterns during compilation, QRisk achieves a 24-45% reduction in excess hardware noise on IBM backends.
Quantum compilers are blind to recurring error patterns that can be fixed with simple gate swaps, leading to a 45% performance boost on real hardware.
Quantum compilers rely on calibration-derived noise models to guide circuit mapping and optimization. These models characterize gate and qubit errors independently and miss context-dependent effects such as crosstalk and correlated scheduling errors. As a result, two compiled circuits that score equally under the noise model can behave very differently on real hardware, and the compiler has no mechanism to learn from such recurring mismatches. We present QRisk, a framework that discovers backend-specific abnormal patterns from real hardware executions. QRisk uses delta debugging to isolate compact circuit fragments that consistently produce excess error not predicted by the noise model, then validates their persistence across repeated runs and calibration windows. The verified patterns are stored in a backend-specific pattern database. At compilation time, QRisk scans a compiled circuit for occurrences of known patterns and applies targeted commuting gate swaps to disrupt them, producing a semantically equivalent circuit with fewer abnormal patterns. We evaluate QRisk on two IBM backends (ibm_fez and ibm_marrakesh) using Grover search circuits. On both backends, discovered patterns persist across multiple calibration windows over months. Disrupting these patterns via commuting gate swaps reduces excess hardware noise by 24% on ibm_fez (Spearman $蟻$ = 0.515, p = 0.0007) and 45% on ibm_marrakesh ($蟻$ = 0.711, p < 0.0001), while the noise model predicts identical error for all equivalent circuits. Testing on a third backend confirms that these patterns are backend-specific.