Apr 23, 2026arXiv:2604.21690

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

Isabel Kurth, Paulo Yanez Sarmiento, Bernhard Y. Renard

AI Summary

This paper adapts AttnLRP, an attention-based extension of layer-wise relevance propagation, to explain the predictions of DNABERT-2, a state-of-the-art genome language model. They propose strategies to transfer explanations from token and nucleotide level. Through extensive comparisons with CNN explanations on genomic datasets, the authors demonstrate that AttnLRP applied to DNABERT-2 yields reliable explanations that align with known biological patterns.

Key Contribution

Despite their architectural differences, Transformer-based genome language models can provide equally reliable biological insights as CNNs, as revealed by attention-based explainability methods.

Abstract

Explaining deep neural network predictions on genome sequences enables biological insight and hypothesis generation-often of greater interest than predictive performance alone. While explanations of convolutional neural networks (CNNs) have been shown to capture relevant patterns in genome sequences, it is unclear whether this transfers to more expressive Transformer-based genome language models (gLMs). To answer this question, we adapt AttnLRP, an extension of layer-wise relevance propagation to the attention mechanism, and apply it to the state-of-the-art gLM DNABERT-2. Thereby, we propose strategies to transfer explanations from token and nucleotide level. We evaluate the adaption of AttnLRP on genomic datasets using multiple metrics. Further, we provide an extensive comparison between the explanations of DNABERT-2 and a baseline CNN. Our results demonstrate that AttnLRP yields reliable explanations corresponding to known biological patterns. Hence, like CNNs, gLMs can also help derive biological insights. This work contributes to the explainability of gLMs and addresses the comparability of relevance attributions across different architectures.

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References32

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

Related Papers