Search papers, labs, and topics across Lattice.
The RARE25 challenge introduces a large-scale, prevalence-aware benchmark for computer-aided detection (CADe) of early neoplasia in Barrett's esophagus, a low-prevalence surveillance problem. Eleven teams developed CADe systems using diverse architectures and training strategies, evaluated on a hidden test set reflecting real-world incidence. Results showed strong discriminative performance but low positive predictive values, highlighting the challenges of low-prevalence detection and the need for prevalence-agnostic approaches.
Current CADe systems excel at identifying early neoplasia in Barrett's esophagus on balanced datasets, but their clinical utility plummets in real-world, low-prevalence settings, exposing a critical gap in prevalence-aware design.
Computer-aided detection (CADe) of early neoplasia in Barrett's esophagus is a low-prevalence surveillance problem in which clinically relevant findings are rare. Although many CADe systems report strong performance on balanced or enriched datasets, their behavior under realistic prevalence remains insufficiently characterized. The RARE25 challenge addresses this gap by introducing a large-scale, prevalence-aware benchmark for neoplasia detection. It includes a public training set and a hidden test set reflecting real-world incidence. Methods were evaluated using operating-point-specific metrics emphasizing high sensitivity and accounting for prevalence. Eleven teams from seven countries submitted approaches using diverse architectures, pretraining, ensembling, and calibration strategies. While several methods achieved strong discriminative performance, positive predictive values remained low, highlighting the difficulty of low-prevalence detection and the risk of overestimating clinical utility when prevalence is ignored. All methods relied on fully supervised classification despite the dominance of normal findings, indicating a lack of prevalence-agnostic approaches such as anomaly detection or one-class learning. By releasing a public dataset and a reproducible evaluation framework, RARE25 aims to support the development of CADe systems robust to prevalence shift and suitable for clinical surveillance workflows.