Search papers, labs, and topics across Lattice.
The CXR-LT 2026 challenge was introduced to address limitations of existing chest X-ray benchmarks by providing a multi-center dataset with long-tailed pathology distributions and out-of-distribution rare disease classes. The challenge defined two tasks: robust multi-label classification on 30 known classes and open-world generalization to 6 unseen classes, using a dataset of over 145,000 images from PadChest and NIH Chest X-ray datasets. Results from top-performing teams showed that large-scale vision-language pre-training significantly improved zero-shot diagnosis performance, achieving an mAP of 0.5854 on Task 1 and 0.4315 on Task 2.
Vision-language pre-training closes the gap in zero-shot chest X-ray diagnosis, nearly doubling mAP compared to previous benchmarks grappling with rare diseases.
Chest X-ray (CXR) interpretation is hindered by the long-tailed distribution of pathologies and the open-world nature of clinical environments. Existing benchmarks often rely on closed-set classes from single institutions, failing to capture the prevalence of rare diseases or the appearance of novel findings. To address this, we present the CXR-LT 2026 challenge. This third iteration of the benchmark introduces a multi-center dataset comprising over 145,000 images from PadChest and NIH Chest X-ray datasets. The challenge defines two core tasks: (1) Robust Multi-Label Classification on 30 known classes and (2) Open-World Generalization to 6 unseen (out-of-distribution) rare disease classes. We report the results of the top-performing teams, evaluating them via mean Average Precision (mAP), AUROC, and F1-score. The winning solutions achieved an mAP of 0.5854 on Task 1 and 0.4315 on Task 2, demonstrating that large-scale vision-language pre-training significantly mitigates the performance drop typically associated with zero-shot diagnosis.