Search papers, labs, and topics across Lattice.
This paper investigates the use of deep learning, specifically ResNet-34, for classifying the overlap between the mandibular third molar and the mandibular canal in panoramic radiographs, comparing local, federated, and centralized learning approaches. The study aims to automate this classification to aid clinical triage and reduce unnecessary CBCT referrals. Results show that centralized learning achieved the best performance (AUC 0.831), federated learning offered intermediate performance (AUC 0.757), and local learning generalized poorly.
Centralized deep learning beats federated learning for classifying molar-canal overlap in dental radiographs, but federated learning still offers a privacy-preserving boost over local training.
Impaction of the mandibular third molar in proximity to the mandibular canal increases the risk of inferior alveolar nerve injury. Panoramic radiography is routinely used to assess this relationship. Automated classification of molar-canal overlap could support clinical triage and reduce unnecessary CBCT referrals, while federated learning (FL) enables multi-center collaboration without sharing patient data. We compared Local Learning (LL), FL, and Centralized Learning (CL) for binary overlap/no-overlap classification on cropped panoramic radiographs partitioned across eight independent labelers. A pretrained ResNet-34 was trained under each paradigm and evaluated using per-client metrics with locally optimized thresholds and pooled test performance with a global threshold. Performance was assessed using area under the receiver operating characteristic curve (AUC) and threshold-based metrics, alongside training dynamics, Grad-CAM visualizations, and server-side aggregate monitoring signals. On the test set, CL achieved the highest performance (AUC 0.831; accuracy = 0.782), FL showed intermediate performance (AUC 0.757; accuracy = 0.703), and LL generalized poorly across clients (AUC range = 0.619-0.734; mean = 0.672). Training curves suggested overfitting, particularly in LL models, and Grad-CAM indicated more anatomically focused attention in CL and FL. Overall, centralized training provided the strongest performance, while FL offers a privacy-preserving alternative that outperforms LL.