Hainan Normal UniversityApr 20, 2026arXiv:2604.17823

A novel LSTM music generator based on the fractional time-frequency feature extraction

Li Ya, Chen Wei, Li Xiulai, Xiulai Li, Deng Xinyi, Chen Chaofan, Chaofan Chen

AI Summary

This paper introduces a music generation system leveraging fractional Fourier transform (FrFT) for feature extraction and LSTM networks for sequence prediction. FrFT extracts time-frequency domain features from music, which are then used to train an LSTM network on the GiantMIDI-Piano dataset to generate new musical sequences. The generated music is claimed to be of high quality, comparable to human-generated music.

Key Contribution

Time-frequency feature extraction via fractional Fourier transform unlocks surprisingly high-quality music generation from LSTMs.

Abstract

In this paper, we propose a novel approach for generating music based on an artificial intelligence (AI) system. We analyze the features of music and use them to fit and predict the music. The fractional Fourier transform (FrFT) and the long short-term memory (LSTM) network are the foundations of our method. The FrFT method is used to extract the spectral features of a music piece, where the music signal is expressed on the time and frequency domains. The LSTM network is used to generate new music based on the extracted features, where we predict the music according to the hidden layer features and real-time inputs using GiantMIDI-Piano dataset. The results of our experiments show that our proposed system is capable of generating high-quality music that is comparable to human-generated music.

Architecture Design (Transformers, SSMs, MoE)Speech & Audio

Citation Metrics

Citations0

Influential citations0

References33

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

A novel LSTM music generator based on the fractional time-frequency feature extraction

Related Papers