Search papers, labs, and topics across Lattice.
This paper addresses the challenge of diagnosing deep intracranial tumors from MRI scans, where data scarcity and small lesion volumes hinder accurate pathology prediction. They introduce the ICT-MRI dataset, a biopsy-verified benchmark of 249 cases, and a Virtual Biopsy framework consisting of an MRI-Processor, Tumor-Localizer using vision-language models, and an Adaptive-Diagnoser with Masked Channel Attention. The proposed framework achieves over 90% accuracy in tumor diagnosis, significantly outperforming existing baselines.
Achieve >90% accuracy in non-invasive intracranial tumor diagnosis from MRI using a novel "Virtual Biopsy" framework, potentially reducing the need for risky and biased traditional biopsies.
Deep intracranial tumors situated in eloquent brain regions controlling vital functions present critical diagnostic challenges. Clinical practice has shifted toward stereotactic biopsy for pathological confirmation before treatment. Yet biopsy carries inherent risks of hemorrhage and neurological deficits and struggles with sampling bias due to tumor spatial heterogeneity, because pathological changes are typically region-selective rather than tumor-wide. Therefore, advancing non-invasive MRI-based pathology prediction is essential for holistic tumor assessment and modern clinical decision-making. The primary challenge lies in data scarcity: low tumor incidence requires long collection cycles, and annotation demands biopsy-verified pathology from neurosurgical experts. Additionally, tiny lesion volumes lacking segmentation masks cause critical features to be overwhelmed by background noise. To address these challenges, we construct the ICT-MRI dataset - the first public biopsy-verified benchmark with 249 cases across four categories. We propose a Virtual Biopsy framework comprising: MRI-Processor for standardization; Tumor-Localizer employing vision-language models for coarse-to-fine localization via weak supervision; and Adaptive-Diagnoser with a Masked Channel Attention mechanism fusing local discriminative features with global contexts. Experiments demonstrate over 90% accuracy, outperforming baselines by more than 20%.