Search papers, labs, and topics across Lattice.
This paper introduces LISTA-Transformer, a novel architecture that integrates Learnable Iterative Shrinkage Threshold Algorithm (LISTA) sparse coding with the visual Transformer to improve feature extraction in fault diagnosis. The model aims to overcome limitations of CNNs and Transformers in local feature modeling and global dependency capture by creating an adaptive local and global feature collaboration mechanism. Experiments on the CWRU dataset demonstrate a fault recognition rate of 98.5%, outperforming traditional and existing Transformer-based methods by 3.3%.
By fusing sparse coding with visual Transformers, LISTA-Transformer achieves state-of-the-art fault diagnosis accuracy, surpassing traditional methods by 3.3% on the CWRU dataset.
Driven by the continuous development of models such as Multi-Layer Perceptron, Convolutional Neural Network (CNN), and Transformer, deep learning has made breakthrough progress in fields such as computer vision and natural language processing, and has been successfully applied in practical scenarios such as image classification and industrial fault diagnosis. However, existing models still have certain limitations in local feature modeling and global dependency capture. Specifically, CNN is limited by local receptive fields, while Transformer has shortcomings in effectively modeling local structures, and both face challenges of high model complexity and insufficient interpretability. In response to the above issues, we proposes the following innovative work: A sparse Transformer based on Learnable Iterative Shrinkage Threshold Algorithm (LISTA-Transformer) was designed, which deeply integrates LISTA sparse encoding with visual Transformer to construct a model architecture with adaptive local and global feature collaboration mechanism. This method utilizes continuous wavelet transform to convert vibration signals into time-frequency maps and inputs them into LISTA-Transformer for more effective feature extraction. On the CWRU dataset, the fault recognition rate of our method reached 98.5%, which is 3.3% higher than traditional methods and exhibits certain superiority over existing Transformer-based approaches.