Search papers, labs, and topics across Lattice.
This paper introduces Hypothesis-Driven Test-Time Adaptation (HD-TTA) for brain tumor segmentation, addressing the safety concerns of standard TTA methods that often lead to tumor mask spillover or degradation of correct predictions. HD-TTA generates competing geometric hypotheses (compaction vs. inflation) and uses a representation-guided selector to choose the safest outcome based on texture consistency, along with a gatekeeper to prevent negative transfer. Experiments on cross-domain brain tumor segmentation (adult gliomas to pediatric and meningioma) demonstrate that HD-TTA improves Hausdorff Distance (HD95) and Precision, indicating enhanced safety, while maintaining comparable Dice scores.
Stop blindly optimizing at test-time: HD-TTA uses competing geometric hypotheses and representation-guided selection to make brain tumor segmentation safer.
Standard Test-Time Adaptation (TTA) methods typically treat inference as a blind optimization task, applying generic objectives to all or filtered test samples. In safety-critical medical segmentation, this lack of selectivity often causes the tumor mask to spill into healthy brain tissue or degrades predictions that were already correct. We propose Hypothesis-Driven TTA, a novel framework that reformulates adaptation as a dynamic decision process. Rather than forcing a single optimization trajectory, our method generates intuitive competing geometric hypotheses: compaction (is the prediction noisy? trim artifacts) versus inflation (is the valid tumor under-segmented? safely inflate to recover). It then employs a representation-guided selector to autonomously identify the safest outcome based on intrinsic texture consistency. Additionally, a pre-screening Gatekeeper prevents negative transfer by skipping adaptation on confident cases. We validate this proof-of-concept on a cross-domain binary brain tumor segmentation task, applying a source model trained on adult BraTS gliomas to unseen pediatric and more challenging meningioma target domains. HD-TTA improves safety-oriented outcomes (Hausdorff Distance (HD95) and Precision) over several state-of-the-art representative baselines in the challenging safety regime, reducing the HD95 by approximately 6.4 mm and improving Precision by over 4%, while maintaining comparable Dice scores. These results demonstrate that resolving the safety-adaptation trade-off via explicit hypothesis selection is a viable, robust path for safe clinical model deployment. Code will be made publicly available upon acceptance.