Abstract
It has always been an important issue to diagnose mechanical equipment faults with limited training data. Specifically for the problem of bearing fault diagnosis, a multi-subspace self-attention siamese network (MSSASN) is designed for fault diagnosis with limited training data. In MSSASN, multi-subspace self-attention block is developed to assign higher weights to the fault-related information during learning. Particularly, input features are divided into multiple sub-paths in the channel dimension, and the spatial attention features are calculated separately on sub-path and then merged. In this way, the cross-channel information can be effectively learned, while multi-scale feature learning is carried out. Finally, contrastive learning is carried out on the fault features of different samples using siamese networks to deal with the problem of limited training samples. The proposed method is verified by the vibration dataset collected from the three-phase asynchronous motor experiment platform in Zhejiang University of Technology. The results show that the proposed method can identify rolling bearing faults more accurately with limited training data.
Similar content being viewed by others
Data Availability
Some of the data and materials supporting the results of this study can be obtained from the corresponding authors upon reasonable request.
References
Lei, Y., Yang, B., Jiang, X., Jia, F., Li, N., Nandi, A.K.: Applications of machine learning to machine fault diagnosis: a review and roadmap. Mech. Syst. Signal Process. 138, 106587 (2020)
Lin, A., Cheng, J., Rutkowski, L., Wen, S., Luo, M., Cao, J.: Asynchronous fault detection for memristive neural networks with dwell-time-based communication protocol. IEEE Trans. Neural Netw. Learn. Syst. 34(11), 9004–9015 (2023)
Cheng, J., Lin, A., Cao, J., Qiu, J., Qi, W.: Protocol-based fault detection for discrete-time memristive neural networks with quantization effect. Inf. Sci. 615, 118–135 (2022)
Chen, Y., Zhang, D., Zhang, H., Wang, Q.-G.: Dual-path mixed-domain residual threshold networks for bearing fault diagnosis. IEEE Trans. Ind. Electron. 69(12), 13462–13472 (2022)
Marticorena, M., Peyrano, O.G.: Rolling bearing condition monitoring technique based on cage rotation analysis and acoustic emission. J. Dyn. Monit. Diagn. 1(2), 57–65 (2022)
Han, S., Feng, Z.: Deep residual joint transfer strategy for cross-condition fault diagnosis of rolling bearings. J. Dyn. Monit. Diagn. 2(1), 51–60 (2023)
Hashempour, Z., Agahi, H., Mahmoodzadeh, A.: A novel method for fault diagnosis in rolling bearings based on bispectrum signals and combined feature extraction algorithms. SIViP 16, 1043–1051 (2022)
Xu, Y., Tang, X., Feng, G., Wang, D., Ashworth, C., Gu, F., Ball, A.: Orthogonal on-rotor sensing vibrations for condition monitoring of rotating machines. J. Dyn. Monit. Diagn. 1(1), 29–36 (2022)
Zhang, X., Kong, J., Zhao, Y., Qian, W., Xu, X.: A deep-learning model with improved capsule networks and LSTM filters for bearing fault diagnosis. SIViP 17(4), 1325–1333 (2023)
Zhang, Z., Zhou, F., Karimi, H.R., Fujita, H., Hu, X., Wen, C., Wang, T.: Attention gate guided multiscale recursive fusion strategy for deep neural network-based fault diagnosis. Eng. Appl. Artif. Intell. 126, 107052 (2023)
Tian, J., Han, D., Karimi, H.R., Zhang, Y., Shi, P.: Deep learning-based open set multi-source domain adaptation with complementary transferability metric for mechanical fault diagnosis. Neural Netw. 162, 69–82 (2023)
Chen, Y., Zhang, D., Zhu, K., Yan, R.: An adaptive activation transfer learning approach for fault diagnosis. IEEE/ASME Trans. Mechatron. 28(5), 2645–2656 (2023)
Yang, D., Karimi, H.R., Sun, K.: Residual wide-kernel deep convolutional auto-encoder for intelligent rotating machinery fault diagnosis with limited samples. Neural Netw. 141, 133–144 (2021)
Chen, Y., Zhang, D., Yan, R.: Domain adaptation networks with parameter-free adaptively rectified linear units for fault diagnosis under variable operating conditions. IEEE Trans. Neural Netw. Learn. Syst. (2023). https://doi.org/10.1109/TNNLS.2023.3298648
Zhang, Y., Ji, J., Ren, Z., Ni, Q., Gu, F., Feng, K., Yu, K., Ge, J., Lei, Z., Liu, Z.: Digital twin-driven partial domain adaptation network for intelligent fault diagnosis of rolling bearing. Reliab. Eng. Syst. Saf. 234, 109186 (2023)
Wang, H., Liu, Z., Peng, D., Cheng, Z.: Attention-guided joint learning CNN with noise robustness for bearing fault diagnosis and vibration signal denoising. ISA Trans. 128, 470–484 (2022)
Li, X., Wan, S., Liu, S., Zhang, Y., Hong, J., Wang, D.: Bearing fault diagnosis method based on attention mechanism and multilayer fusion network. ISA Trans. 128, 550–564 (2022)
Zhou, F., Sun, T., Hu, X., Wang, T., Wen, C.: A sparse denoising deep neural network for improving fault diagnosis performance. SIViP 15(8), 1889–1898 (2021)
Li, H., Wang, D.: Multilevel feature fusion of multi-domain vibration signals for bearing fault diagnosis. Signal Image Video Process. 1–10 (2023)
Qiao, M., Yan, S., Tang, X., Xu, C.: Deep convolutional and LSTM recurrent neural networks for rolling bearing fault diagnosis under strong noises and variable loads. IEEE Access 8, 66257–66269 (2020)
Zhang, A., Li, S., Cui, Y., Yang, W., Dong, R., Hu, J.: Limited data rolling bearing fault diagnosis with few-shot learning. IEEE Access 7, 110895–110904 (2019)
Luo, J., Huang, J., Li, H.: A case study of conditional deep convolutional generative adversarial networks in machine fault diagnosis. J. Intell. Manuf. 32, 407–425 (2021)
Wang, R., Chen, Z., Zhang, S., Li, W.: Dual-attention generative adversarial networks for fault diagnosis under the class-imbalanced conditions. IEEE Sens. J. 22(2), 1474–1485 (2021)
Zhao, M., Zhong, S., Fu, X., Tang, B., Pecht, M.: Deep residual shrinkage networks for fault diagnosis. IEEE Trans. Ind. Inf. 16(7), 4681–4690 (2019)
Liu, H., Liu, F., Fan, X., Huang, D.: Polarized self-attention: Towards high-quality pixel-wise regression (2021). arXiv:2107.00782
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th international conference on machine learning (ICML-10), pp. 807–814 (2010)
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Ye, Z., Zhang, D., Deng, C., Yan, H., Feng, G.: Finite-time resilient sliding mode control of nonlinear UMV systems subject to dos attacks. Automatica 156, 111170 (2023)
Guo, X.-G., Liu, P.-M., Wu, Z.-G., Zhang, D., Ahn, C.K.: Hybrid event-triggered group consensus control for heterogeneous multi-agent systems with TVNUD faults and stochastic FDI attacks. IEEE Trans. Autom. Control (2023). https://doi.org/10.1109/TAC.2023.3254368
Chen, Y., Zhang, D., Karimi, H.R., Deng, C., Yin, W.: A new deep learning framework based on blood pressure range constraint for continuous cuffless BP estimation. Neural Netw. 152, 181–190 (2022)
Panwar, M., Gautam, A., Biswas, D., Acharyya, A.: PP-Net: a deep learning framework for PPG-based blood pressure and heart rate estimation. IEEE Sens. J. 20(17), 10000–10011 (2020)
Zhang, W., Peng, G., Li, C., Chen, Y., Zhang, Z.: A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors 17(2), 425 (2017)
Song, X., Cong, Y., Song, Y., Chen, Y., Liang, P.: A bearing fault diagnosis model based on CNN with wide convolution kernels. J. Ambient Intell. Humaniz. Comput. 13(8), 4041–4056 (2022)
Zhang, S., Wang, R., Si, Y., Wang, L.: An improved convolutional neural network for three-phase inverter fault diagnosis. IEEE Trans. Instrum. Meas. 71, 3510915 (2022)
Xu, Y., Yan, X., Sun, B., Zhai, J., Liu, Z.: Multireceptive field denoising residual convolutional networks for fault diagnosis. IEEE Trans. Ind. Electron. 69(11), 11686–11696 (2022)
Zhang, L., Fan, Q., Lin, J., Zhang, Z., Yan, X., Li, C.: A nearly end-to-end deep learning approach to fault diagnosis of wind turbine gearboxes under nonstationary conditions. Eng. Appl. Artif. Intell. 119, 105735 (2023)
Funding
This work was partially supported by the National Natural Science Foundation of China under Grant Nos. 62322315 and 61873237, the Zhejiang Provincial Natural Science Foundation of China under Grant No. LR22F030003, the National Key R &D Funding under Grant No. 2018YFB1403702, the Key R &D Programs of Zhejiang Province under Grant No. 2023C01224 and Major Project of Science and Technology Innovation in Ningbo City under Grant No. 2019B1003.
Author information
Authors and Affiliations
Contributions
XZ and YC contributed to conceptualization, methodology, data processing, experimental validation and manuscript writing. HN, DZ and MA contributed to the modification of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, X., Chen, Y., Ni, H. et al. Multi-subspace self-attention siamese networks for fault diagnosis with limited data. SIViP 18, 2465–2472 (2024). https://doi.org/10.1007/s11760-023-02922-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02922-3