NUSMar 2, 2026arXiv:2603.01690

QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions

Yixuan Tang, Zheng-Lin Lin, Yandong Sun, W. Hsu, Mong Li Lee, A. K. Tung

AI Summary

The paper introduces QIME, a framework for generating interpretable medical text embeddings where each dimension represents a clinically meaningful yes/no question derived from medical ontologies. QIME leverages cluster-specific medical concept signatures to generate semantically atomic questions, enabling fine-grained distinctions in biomedical text. The framework employs a training-free embedding construction strategy, achieving state-of-the-art performance among interpretable methods and significantly closing the gap with black-box encoders on biomedical NLP tasks.

Key Contribution

Finally, interpretable medical text embeddings that rival black-box models in performance, thanks to ontology-grounded question generation and a training-free approach.

Abstract

While dense biomedical embeddings achieve strong performance, their black-box nature limits their utility in clinical decision-making. Recent question-based interpretable embeddings represent text as binary answers to natural-language questions, but these approaches often rely on heuristic or surface-level contrastive signals and overlook specialized domain knowledge. We propose QIME, an ontology-grounded framework for constructing interpretable medical text embeddings in which each dimension corresponds to a clinically meaningful yes/no question. By conditioning on cluster-specific medical concept signatures, QIME generates semantically atomic questions that capture fine-grained distinctions in biomedical text. Furthermore, QIME supports a training-free embedding construction strategy that eliminates per-question classifier training while further improving performance. Experiments across biomedical semantic similarity, clustering, and retrieval benchmarks show that QIME consistently outperforms prior interpretable embedding methods and substantially narrows the gap to strong black-box biomedical encoders, while providing concise and clinically informative explanations.

Interpretability & Mechanistic Interp Natural Language Processing Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions

Related Papers