Search papers, labs, and topics across Lattice.
This paper introduces a multimodal deep learning framework for predicting major pathological response (pR) in non-small cell lung cancer (NSCLC) following neoadjuvant therapy. The framework integrates CT image features extracted using foundation models with a missing-aware architecture for handling incomplete clinical variables, avoiding imputation. Experiments demonstrate that the multimodal model, using a weighted fusion mechanism, outperforms unimodal baselines in predicting pR from limited datasets.
A new multimodal deep learning model accurately predicts cancer treatment response from incomplete patient data, even when key clinical information is missing.
Major pathological response (pR) following neoadjuvant therapy is a clinically meaningful endpoint in non-small cell lung cancer, strongly associated with improved survival. However, accurate preoperative prediction of pR remains challenging, particularly in real-world clinical settings characterized by limited data availability and incomplete clinical profiles. In this study, we propose a multimodal deep learning framework designed to address these constraints by integrating foundation model-based CT feature extraction with a missing-aware architecture for clinical variables. This approach enables robust learning from small cohorts while explicitly modeling missing clinical information, without relying on conventional imputation strategies. A weighted fusion mechanism is employed to leverage the complementary contributions of imaging and clinical modalities, yielding a multimodal model that consistently outperforms both unimodal imaging and clinical baselines. These findings underscore the added value of integrating heterogeneous data sources and highlight the potential of multimodal, missing-aware systems to support pR prediction under realistic clinical conditions.