Search papers, labs, and topics across Lattice.
This paper introduces a music generation system leveraging fractional Fourier transform (FrFT) for feature extraction and LSTM networks for sequence prediction. FrFT extracts time-frequency domain features from music, which are then used to train an LSTM network on the GiantMIDI-Piano dataset to generate new musical sequences. The generated music is claimed to be of high quality, comparable to human-generated music.
Time-frequency feature extraction via fractional Fourier transform unlocks surprisingly high-quality music generation from LSTMs.
In this paper, we propose a novel approach for generating music based on an artificial intelligence (AI) system. We analyze the features of music and use them to fit and predict the music. The fractional Fourier transform (FrFT) and the long short-term memory (LSTM) network are the foundations of our method. The FrFT method is used to extract the spectral features of a music piece, where the music signal is expressed on the time and frequency domains. The LSTM network is used to generate new music based on the extracted features, where we predict the music according to the hidden layer features and real-time inputs using GiantMIDI-Piano dataset. The results of our experiments show that our proposed system is capable of generating high-quality music that is comparable to human-generated music.