Feb 26, 2026arXiv:2602.22822

FlexMS is a flexible framework for benchmarking deep learning-based mass spectrum prediction tools in metabolomics

Yunhua Zhong, Yixuan Tang, Yifan Li, Yifan Li, Jie Yang, Jie Yang, Pan Liu, Panlong Liu, Jun Xia

AI Summary

The paper introduces FlexMS, a flexible benchmarking framework for deep learning models predicting mass spectra from molecular structures, addressing the lack of standardized evaluation in metabolomics. FlexMS allows dynamic construction and evaluation of diverse model architectures on public datasets using various metrics, providing insights into factors influencing performance such as dataset diversity, hyperparameters, and pretraining. The framework also includes retrieval benchmarks to simulate practical molecular identification scenarios.

Key Contribution

Stop struggling with inconsistent benchmarks for mass spec prediction: FlexMS offers a flexible framework to build and evaluate diverse deep learning architectures, revealing key factors that drive performance.

Abstract

The identification and property prediction of chemical molecules is of central importance in the advancement of drug discovery and material science, where the tandem mass spectrometry technology gives valuable fragmentation cues in the form of mass-to-charge ratio peaks. However, the lack of experimental spectra hinders the attachment of each molecular identification, and thus urges the establishment of prediction approaches for computational models. Deep learning models appear promising for predicting molecular structure spectra, but overall assessment remains challenging as a result of the heterogeneity in methods and the lack of well-defined benchmarks. To address this, our contribution is the creation of benchmark framework FlexMS for constructing and evaluating diverse model architectures in mass spectrum prediction. With its easy-to-use flexibility, FlexMS supports the dynamic construction of numerous distinct combinations of model architectures, while assessing their performance on preprocessed public datasets using different metrics. In this paper, we provide insights into factors influencing performance, including the structural diversity of datasets, hyperparameters like learning rate and data sparsity, pretraining effects, metadata ablation settings and cross-domain transfer learning analysis. This provides practical guidance in choosing suitable models. Moreover, retrieval benchmarks simulate practical identification scenarios and score potential matches based on predicted spectra.

Eval Frameworks & Benchmarks Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References58

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

FlexMS is a flexible framework for benchmarking deep learning-based mass spectrum prediction tools in metabolomics

Related Papers