Skip to main content
Log in

A Novel Small Samples Fault Diagnosis Method Based on the Self-attention Wasserstein Generative Adversarial Network

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

In the current industrial production process, fault data of rotating machinery are often difficult to obtain, and a small amount of fault data can lead to insufficient training of the model and reduced diagnostic accuracy. In addition, the generative adversarial network as a data generation model has the disadvantage of unstable training leading to poor quality of generated data. To address the shortcomings mentioned above, this paper proposed a rotating machinery fault diagnosis method based on the self-attention gradient penalty Wasserstein generative adversarial network under small samples. First, the Wasserstein distance with gradient penalty and self-attention mechanism was introduced into the generative adversarial network model to construct the self-attention gradient penalty Wasserstein generative adversarial network (SA-WGAN-GP) to solve the problems of poor quality of simulation data and unstable training. Then, a convolutional variational autoencoder (Con-VAE)-based sample screening strategy was constructed to realize the screening of high-quality simulation data. Finally, the original data and the screened simulation data were fed into the classifier to achieve fault diagnosis. Experimental validation was performed on Case Western Reserve University and laboratory self-test rolling bearing data. The experimental results show that the method can generate higher-quality simulation data and obtain higher diagnostic accuracy compared with other small-sample methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24

Similar content being viewed by others

References

  1. Grover C, Turk N (2020) Rolling element bearing fault diagnosis using empirical mode decomposition and Hjorth parameters. Procedia Comput Sci 167:1484–1494. https://doi.org/10.1016/j.procs.2020.03.359

    Article  Google Scholar 

  2. Song Y, Zeng S, Ma J, Guo J (2018) A fault diagnosis method for roller bearing based on empirical wavelet transform decomposition with adaptive empirical mode segmentation. Measurement 117:266–276. https://doi.org/10.1016/j.measurement.2017.12.029

    Article  Google Scholar 

  3. Sapena-Bañó A, Pineda-Sanchez M, Puche-Panadero R, Martinez-Roman J, Matić D (2015) Fault diagnosis of rotating electrical machines in transient regime using a single stator current’s FFT. IEEE Trans Instrum Meas 64(11):3137–3146. https://doi.org/10.1109/TIM.2015.2444240

    Article  Google Scholar 

  4. Zhang Y, Mei W, Dong G, Gao J, Wang P et al (2018) A cable fault recognition method based on a deep belief network. Comput Electr Eng 71:452–464

    Article  Google Scholar 

  5. Dangdang Z, Han B, Liu G, Li Y, Yu H (2021) Cross-domain intelligent fault diagnosis method of rotating machinery using multi-scale transfer fuzzy entropy. IEEE Access 9:95481–95492. https://doi.org/10.1109/ACCESS.2021.3063743

    Article  Google Scholar 

  6. Yan X, Jin Y, Xu Y, Li R (2019) Wind turbine generator fault detection based on multi-layer neural network and random forest algorithm. In: 2019 IEEE Innovative Smart Grid Technologies-Asia (ISGT Asia). IEEE, p 4132–4136. http://doi.org/https://doi.org/10.1109/ISGT-Asia.2019.8881778

  7. Tian J, Morillo C, Azarian MH, Pecht M (2015) Motor bearing fault detection using spectral kurtosis-based feature extraction coupled with K-nearest neighbor distance analysis. IEEE Trans Ind Electron 63(3):1793–1803

    Article  Google Scholar 

  8. Lei Y, Jia F, Lin J, Xing S, Ding SX (2016) An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data. IEEE Trans Ind Electron 63(5):3137–3147. https://doi.org/10.1109/TIE.2015.2509913

    Article  Google Scholar 

  9. Shang Z, Li W, Gao M, Liu X, Yu Y (2021) An intelligent fault diagnosis method of multi-scale deep feature fusion based on information entropy. Chin J Mech Eng 34(1):1–16. https://doi.org/10.1186/s10033-021-00580-5

    Article  Google Scholar 

  10. Yang J, Yin S, Sun C, Gao T (2022) A novel convolutional neural network with interference suppression for the fault diagnosis of mechanical rotating components. Neural Comput Appl 2022:1–17. https://doi.org/10.1007/s00521-022-07022-w

    Article  Google Scholar 

  11. Du W, Wang S, Gong X, Wang H, Yao X et al (2020) Translation invariance-based deep learning for rotating machinery diagnosis. Shock Vib 2020:1–16. https://doi.org/10.1155/2020/1635621

    Article  Google Scholar 

  12. Sun K, Huang Z, Mao H, Qin A, Li X et al (2021) Multi-scale cluster-graph convolution network with multi-channel residual network for intelligent fault diagnosis. IEEE Trans Instrum Meas 71:1–12. https://doi.org/10.1109/TIM.2021.3136264

    Article  Google Scholar 

  13. Goodfellow I, et al (2014) Generative adversarial nets. In: Advances in neural information processing systems 2672–2680.

  14. Liang P, Deng C, Wu J, Yang Z, Zhu J et al (2020) Single and simultaneous fault diagnosis of gearbox via a semi-supervised and high-accuracy adversarial learning framework. Knowl Based Syst 198:105895. https://doi.org/10.1016/j.knosys.2020.105895

    Article  Google Scholar 

  15. Chen Z, Tong L, Qian B et al (2021) Self-attention-based conditional variational auto-encoder generative adversarial networks for hyperspectral classification. Remote Sens 13(16):3316. https://doi.org/10.3390/rs13163316

    Article  Google Scholar 

  16. Liu S, Jiang H, Wu Z et al (2022) Machine fault diagnosis with small sample based on variational information constrained generative adversarial network. Adv Eng Inform 54:101762. https://doi.org/10.1016/j.aei.2022.101762

    Article  Google Scholar 

  17. M. Arjovsky, S. Chintala, and L. Bottou (2017) Wasserstein generative adversarial networks. In: 2017 the 34th International Conference on Machine Learning (ICML), p 214–223

  18. Hu W, Wang T, Chu F (2022) Fault feature recovery with Wasserstein generative adversarial imputation network with gradient penalty for rotating machine health monitoring under signal loss condition. IEEE Trans Instrum Meas 71:1–12. https://doi.org/10.1109/TIM.2022.3168898

    Article  Google Scholar 

  19. Yan K, Su J, Huang J, Mo Y (2020) Chiller fault diagnosis based on VAE-enabled generative adversarial networks. IEEE Trans Autom Sci Eng 99:1–9. https://doi.org/10.1109/TASE.2020.3035620

    Article  Google Scholar 

  20. Gao X, Deng F, Yue X (2020) Data augmentation in fault diagnosis based on the Wasserstein generative adversarial network with gradient penalty. Neurocomputing 396:487–494. https://doi.org/10.1016/j.neucom.2018.10.109

    Article  Google Scholar 

  21. Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville AC (2017) Improved training of Wasserstein GANs. In: 2017 the 31st International Conference on Neural Information Processing Systems (NIPS), p 5769–5779

  22. Yan X, Xu Y, She D, Zhang W (2021) Reliable fault diagnosis of bearings using an optimized stacked variational denoising auto-encoder. Entropy 24(1):36. https://doi.org/10.3390/e24010036

    Article  MathSciNet  Google Scholar 

  23. Yan X, She D, Xu Y, Jia M (2021) Deep regularized variational autoencoder for intelligent fault diagnosis of rotor–bearing system within entire life-cycle process. Knowl Based Syst 226:107142. https://doi.org/10.1016/j.knosys.2021.107142

    Article  Google Scholar 

  24. Zhang S, Ye F, Wang B, Habetler TG (2020) Semi-supervised bearing fault diagnosis and classification using variational autoencoder-based deep generative models. IEEE Sens J 21(5):6476–6486. https://doi.org/10.1109/JSEN.2020.3040696

    Article  Google Scholar 

  25. Zhang H, Goodfellow I, Metaxas D, Odena A (2019) Self-attention generative adversarial networks. In: 2019 the 36th International conference on machine learning (ICML), p 7354–7363

  26. Ding Y, Jia M, Miao Q, Cao Y (2022) A novel time–frequency transformer based on self–attention mechanism and its application in fault diagnosis of rolling bearings. Mech Syst Signal Process 168:108616. https://doi.org/10.1016/j.ymssp.2021.108616

    Article  Google Scholar 

  27. Smith WA, Randall RB (2015) Rolling element bearing diagnostics using the case western reserve university data: a benchmark study. Mech Syst Signal Proces 64:100–131. https://doi.org/10.1016/j.ymssp.2015.04.021

    Article  Google Scholar 

  28. Case Western reserve university bearing data center website. Available: http://csegroups.case.edu/bearingdatacenter/home.

  29. Bertram S (1970) On the derivation of the fast Fourier transform. IEEE Trans Audio Electroacoust 18(1):55–58

    Article  Google Scholar 

  30. Granlund GH (1972) Fourier preprocessing for hand print character recognition. IEEE Trans Comput 100(2):195–201

    Article  MathSciNet  MATH  Google Scholar 

  31. Eleftheriadis C, Karakonstantis G (2022) Energy-efficient fast fourier transform for real-valued applications. IEEE Trans Circuits Syst II Express Briefs 69(5):2458–2462. https://doi.org/10.1109/TCSII.2022.3163280

    Article  Google Scholar 

  32. Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S (2017) Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv Neural Inf Proces Syst 30

  33. Viola J, Chen YQ, Wang J (2021) FaultFace: deep convolutional generative adversarial network (DCGAN) based ball-bearing failure detection method. Inf Sci 542:195–211. https://doi.org/10.1016/j.ins.2020.06.060

    Article  Google Scholar 

  34. Wang S, Xiang J, Zhong Y, Zhou Y (2018) Convolutional neural network-based hidden Markov models for rolling element bearing fault identification. Knowl Based Syst 144:65–76. https://doi.org/10.1016/j.knosys.2017.12.027

    Article  Google Scholar 

  35. Wang Y, Chen L, Jo J et al (2021) Joint t-SNE for comparable projections of multiple high-dimensional datasets. IEEE Trans Vis Comput Graph 28(1):623–632. https://doi.org/10.1109/TVCG.2021.3114765

    Article  Google Scholar 

  36. Jiang W, Xu Y, Chen Z, Zhang N, Zhou J (2022) Fault diagnosis for rolling bearing using a hybrid hierarchical method based on scale-variable dispersion entropy and parametric t-SNE algorithm. Measurement 191:110843. https://doi.org/10.1016/j.measurement.2022.110843

    Article  Google Scholar 

Download references

Acknowledgements

This work was financially supported by The National Natural Science Foundation of China and the Civil Aviation Administration of China joint funded projects (U1733108); the Key Program of Natural Science Foundation of Tianjin (21JCZDJC00770).

Author information

Authors and Affiliations

Authors

Contributions

JZ: Methodology, Software, Writing-original draft. ZS: Resources, Funding acquisition, Project administration, Supervision, Writing-review and editing. WL: Formal analysis, Investigation. SQ: Validation. JL: Data curation, Data acquisition. MG: Funding acquisition.

Corresponding author

Correspondence to Zhiwu Shang.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shang, Z., Zhang, J., Li, W. et al. A Novel Small Samples Fault Diagnosis Method Based on the Self-attention Wasserstein Generative Adversarial Network. Neural Process Lett 55, 6377–6407 (2023). https://doi.org/10.1007/s11063-022-11143-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-022-11143-7

Keywords

Navigation