CASEötvös Loránd UniversityJHUOhio StateFeb 16, 2026arXiv:2602.15021

Generalization from Low- to Moderate-Resolution Spectra with Neural Networks for Stellar Parameter Estimation: A Case Study with DESI

Xiaosheng Zhao, Yuan-Sen Ting, Rosemary F. G. Wyse, Alexander S. Szalay, Yang Huang, László Dobos, Tamás Budavári, Viska Wei

AI Summary

The paper investigates cross-survey generalization in stellar spectral analysis, specifically transferring from low-resolution LAMOST spectra to medium-resolution DESI spectra using pre-trained multilayer perceptrons (MLPs). They compared MLPs trained directly on spectra with those trained on embeddings from transformer-based models and evaluated different fine-tuning strategies like residual-head adapters, LoRA, and full fine-tuning. The key result is that MLPs pre-trained on LAMOST LRS achieve strong performance, even without fine-tuning, and modest fine-tuning with DESI spectra further improves results, although transformer embeddings only show advantages in the metal-rich regime for iron abundance.

Key Contribution

Simple MLPs pre-trained on low-resolution spectra can generalize surprisingly well to higher-resolution data for stellar parameter estimation, sometimes outperforming transformer-based embeddings.

Abstract

Cross-survey generalization is a critical challenge in stellar spectral analysis, particularly in cases such as transferring from low- to moderate-resolution surveys. We investigate this problem using pre-trained models, focusing on simple neural networks such as multilayer perceptrons (MLPs), with a case study transferring from LAMOST low-resolution spectra (LRS) to DESI medium-resolution spectra (MRS). Specifically, we pre-train MLPs on either LRS or their embeddings and fine-tune them for application to DESI stellar spectra. We compare MLPs trained directly on spectra with those trained on embeddings derived from transformer-based models (self-supervised foundation models pre-trained for multiple downstream tasks). We also evaluate different fine-tuning strategies, including residual-head adapters, LoRA, and full fine-tuning. We find that MLPs pre-trained on LAMOST LRS achieve strong performance, even without fine-tuning, and that modest fine-tuning with DESI spectra further improves the results. For iron abundance, embeddings from a transformer-based model yield advantages in the metal-rich ([Fe/H] > -1.0) regime, but underperform in the metal-poor regime compared to MLPs trained directly on LRS. We also show that the optimal fine-tuning strategy depends on the specific stellar parameter under consideration. These results highlight that simple pre-trained MLPs can provide competitive cross-survey generalization, while the role of spectral foundation models for cross-survey stellar parameter estimation requires further exploration.

Architecture Design (Transformers, SSMs, MoE)Scientific Discovery & Drug Design Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Generalization from Low- to Moderate-Resolution Spectra with Neural Networks for Stellar Parameter Estimation: A Case Study with DESI

Related Papers