Hankuk University of Foreign StudiesApr 6, 2026arXiv:2604.04403

MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

Seohyeon Shin, Hanjun Choi, Jun-Hyung Park, Hongkook Kim, Mansu Kim

AI Summary

MolDA replaces the autoregressive backbone in molecular LLMs with a discrete Large Language Diffusion Model to improve chemical validity and structural coherence. It uses a hybrid graph encoder to capture local and global molecular topologies, aligning them with language tokens via a Q-Former. By reformulating Molecular Structure Preference Optimization for masked diffusion, MolDA achieves state-of-the-art results in molecule generation, captioning, and property prediction.

Key Contribution

Ditching the standard autoregressive approach lets molecular LLMs generate more chemically valid and structurally coherent molecules.

Abstract

Large Language Models (LLMs) have significantly advanced molecular discovery, but existing multimodal molecular architectures fundamentally rely on autoregressive (AR) backbones. This strict left-to-right inductive bias is sub-optimal for generating chemically valid molecules, as it struggles to account for non-local global constraints (e.g., ring closures) and often accumulates structural errors during sequential generation. To address these limitations, we propose MolDA (Molecular language model with masked Diffusion with mAsking), a novel multimodal framework that replaces the conventional AR backbone with a discrete Large Language Diffusion Model. MolDA extracts comprehensive structural representations using a hybrid graph encoder, which captures both local and global topologies, and aligns them into the language token space via a Q-Former. Furthermore, we mathematically reformulate Molecular Structure Preference Optimization specifically for the masked diffusion. Through bidirectional iterative denoising, MolDA ensures global structural coherence, chemical validity, and robust reasoning across molecule generation, captioning, and property prediction.

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

Related Papers