Artificial Intelligence LabInterdisciplinary Research Center for SmartFeb 16, 2026arXiv:2602.15139

CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding

AI Summary

The paper introduces CGRA-DeBERTa, a concept-guided residual domain augmentation transformer, to improve question answering over Hadith corpora by incorporating theological priors. It customizes DeBERTa with LoRA adaptations and a residual concept-aware gating mechanism that selectively amplifies semantically critical tokens based on an Islamic Concept Dictionary. Experiments on a dataset of 42591 QA pairs from Sahih al-Bukhari and Sahih Muslim demonstrate that CGRA-DeBERTa achieves a 97.85 EM score, surpassing BERT and DeBERTa by a significant margin while maintaining computational efficiency.

Key Contribution

Injecting domain-specific theological concepts into a DeBERTa model boosts question answering accuracy on Islamic texts by 8%, proving that targeted knowledge infusion beats brute-force scaling.

Abstract

Accurate QA over classical Islamic texts remains challenging due to domain specific semantics, long context dependencies, and concept sensitive reasoning. Therefore, a new CGRA DeBERTa, a concept guided residual domain augmentation transformer framework, is proposed that enhances theological QA over Hadith corpora. The CGRA DeBERTa builds on a customized DeBERTa transformer backbone with lightweight LoRA based adaptations and a residual concept aware gating mechanism. The customized DeBERTa embedding block learns global and positional context, while Concept Guided Residual Blocks incorporate theological priors from a curated Islamic Concept Dictionary of 12 core terms. Moreover, the Concept Gating Mechanism selectively amplifies semantically critical tokens via importance weighted attention, applying differential scaling from 1.04 to 3.00. This design preserves contextual integrity, strengthens domain-specific semantic representations, and enables accurate, efficient span extraction while maintaining computational efficiency. This paper reports the results of training CGRA using a specially constructed dataset of 42591 QA pairs from the text of Sahih alBukhari and Sahih Muslim. While BERT achieved an EM score of 75.87 and DeBERTa one of 89.77, our model scored 97.85 and thus surpassed them by 8.08 on an absolute scale, all while adding approximately 8 inference overhead due to parameter efficient gating. The qualitative evaluation noted better extraction and discrimination and theological precision. This study presents Hadith QA systems that are efficient, interpretable, and accurate and that scale provide educational materials with necessary theological nuance.

Architecture Design (Transformers, SSMs, MoE)Data Curation & Synthetic Data Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding

Related Papers