Elsevier

Neurocomputing

Volume 442, 28 June 2021, Pages 348-358
Neurocomputing

Diagnosis of Inter-turn Short Circuit of Permanent Magnet Synchronous Motor Based on Deep learning and Small Fault Samples

https://doi.org/10.1016/j.neucom.2020.04.160Get rights and content

Abstract

An efficient and accurate method based on a conditional generative adversarial net (CGAN) and an optimized sparse auto encoder (OSAE) is proposed to detect the inter-turn short circuit (ITSC) problem for permanent magnet synchronous motors (PMSMs). In order to achieve an accurate detection of the ITSC, the CGAN is adopted to augment the few fault samples, and a noise injection strategy is applied to enhance the generalization ability of the network in the framework of the OSAE. Specifically, we made a combination of two types of signals to create a training set that is augmented by the CGAN, and the parameters of the OSAE are determined by the training process of networks. The experimental results indicate that the proposed method for the fault diagnosis of this fault achieves high accuracy 98.9%.

Introduction

The permanent magnet synchronous motor (PMSM) has some special characteristics in many rotating machinery such as high efficiency, power density and wide speed range [1]. But it is vulnerable to various types of faults. Especially the inter-turn short circuit (ITSC) is a root of all apparent faults of PMSMs which can result in other serious faults such as an inter-phase short circuit, a single-phase ground and so on. These coupling faults may lead to serious demagnetization of a permanent magnet, which causes the PMSM to stop running. Traditionally, the ITSC is detected by regular disassembly. Wu et al. [2] proposed a new detection coil which is installed inside the generator for online detection of this fault. However, the above methods are destructive and are not suitable for many application scenarios. So the nondestructive diagnosis is the hot spot of detecting the ITSC.

The nondestructive diagnosis relies on the physical changes of the PMSM to detect the ITSC faults. For example, the temperature is the most intuitive manifestation of this kind of faults, if the number of short circuit coils increases unregularly, the temperature inside the PMSM may exceed 100 degrees. The deviation of a three-phase current fluctuates greatly more than normal with the increase of eddy loss, and a magnetomotive force is opposite to the direction of a resultant magnetic, which causes a demagnetization of the PMSM easily. Besides, negative sequence characteristics, a harmonic current and vibration are also fit for detecting the ITSC. These parameters are easily interfered by several objective conditions such as humidity, dust and so on. So the combination of these fault samples are benefit for reducing the influence of environmental factors, which enhancing the diagnosis reliability of the short circuit fault effectively.

However, the probability of detecting such faults in rotating machinery is only 1%, which makes the diagnosis of such faults become a difficult few-shot problem due to the lack of fault samples. A series of mathematical analysis methods such as principal component analysis(PCA) [3] empirical mode decomposition (EMD) [4] BP network [5] support vector machines (SVM) [6] and other machine learning methods can be used to identify the fault based on these few samples. But there are several limitations in the application of fault diagnosis.

  • 1)

    The fault samples, despite few, are crucial to the classification quality of the above-mentioned methods. Hence, the issue of lack of fault samples significantly impairs the classification performance of those methods.

  • 2)

    The distribution characteristics of these samples are not compatible with the analysis capability provided by these traditional methods. For example, the PCA method is not the best method for identifing the few sample with the non-Gaussian distribution.

  • 3)

    The SVM and artificial neural networks are unable to learn complex non-linear relationships in the task of fault diagnosis due to their shallow architectures. So deeper architectures are conducive automatically extract valid fault samples from big data.

Fault diagnosis methods based on deep learning are very meaningful for us to discover early fault of motors timely, improve system reliability and reduce maintenance cost [7]. Deep learning methods which includes convolutional neural network (CNN) [8] , deep neural networks [9] , sparse auto encoder (SAE) [10], recursive neural network (RNN) [11], deep belief network (DBN) [12], and generative adversarial net (GAN) [13] can provide effective solution to overcome the above limitations. Deep architectures have more multiple hidden layers than traditional networks. They are useful for learning representative features from the raw data directly, and automatically selecting discriminative representations of fault samples.

A large number of short circuit fault samples and deep learning based methods are the core to improve the stability of fault diagnosis based on professional knowledge. However, the structure of the PMSM is becoming more and more complex, which causes the short circuit to be invisible and random. Meanwhile the correlated interference usually leads to the homogenization of these samples and covers up the fault performance of the PMSM. So it is necessary to modify the diversity of the training samples and augment these limited features. Li et al. [14] adopted the GAN to generate fault samples and enhanced the diagnosis accuracy obviously. But the GAN has some defects such as the problem of convergence between a generative model and a discriminative model. The conditional generative adversarial net (CGAN) transforms the unsupervised GAN into a supervised model which makes the output data strongly correlate with the input samples [15]. In addition to the problem of the small samples and networks’ parameters, many types of optimization methods are proposed to accelerate the training speed of models, such as the adaptive sub-gradient method [16] and adaptive moment estimation (Adam) [17].

The denoising auto encoder (DAE) is an effective improvement of the auto encoder[18]. In order to make the hidden features more robust, a certain probability of noise is added to the training samples, where the input data of each hidden layer is randomly set to zero. Because the training mechanism of the auto encoder requires that the output data is as much as possible close to the input data without noise pollution, it means that the DAE learn how to remove the noise from the input data. The DAE enhances the generalization ability of the auto encoder in the condition of practical application. So we draw on this idea of the DAE for finding the way of noise injection in the background of detecting the ITSC.

The main contributions of this paper are summarized as follows.

  • 1)

    We proposed a fault diagnosis method based on the small sample of the PMSM, and completed an optimized solution with two types of fault features. The CGAN is used to augment the number of fault samples to increase the diversity of data, and a certain probability of noise is added to the training set to enhance the generalization ability of the SAE. The entire architecture of the optimized SAE (OSAE) was finally adjusted by these task-specific samples, and several hierarchical representations about the ITSC were learned from the training samples.

  • 2)

    The performance of the proposed method based on the CGAN and the OSAE is compared with the performance of deep learning algorithms such as the CNN, the RNN and the BP methods. The former leads to faster training and better accuracy in terms of fault classification of the PMSM.

The rest of this paper is organized as follows. We presented a diagnosis method based on the CGAN and the OSAE for the ITSC fault of the PMSM. Firstly, an electromagnetic torque and a negative sequence current are acquired to describe the fault status of the PMSM. Secondly, the CGAN is used to augment the sample data-set based on the previously combined characteristics. Finally, a concise expression of fault characteristics is created and optimized by the OSAE model, and then a softmax classifier is trained to detect the ITSC of the PMSM. Sections 2 and 3 discuss the principles and fault characteristics of deep learning-based models, respectively. The experimental analysis and conclusions are introduced in Section 4 and Section 5, respectively

The research using machine learning techniques is very fruitful in the field of fault diagnosis. Wang et al.[18] adopted the DAE combined with softmax classifier to accomplish 6 types of fault diagnosis of asynchronous motor with an accuracy over 95%. Tang et al. [19] proposed an adaptive DBN with Nesterov momentum for the diagnosis of the rotating machinery. Janssens et al. [20] developed a learning model based on the CNN for bearing fault detection, and Chen et al. [21] used the CNN to identify faults in gearboxes. Piotr et al. [22] adopted the RNN to realize the behavior of a chaos engineering system for fault detection. Sun et al. [23] presented the SAE to achieve samples’ learning for the identification of induction motor faults. If the number of hidden layer nodes is larger than the number of input nodes, a sparse restriction is added to the auto encoder for activating some nodes. Chang et al. [24] proposed a mutual-channel loss for improving the classification ability of deep networks, which is also fit for the SAE algorithm. These above models can be easily deployed in industrial Intelligence IoT [25].

Section snippets

The CGAN

The CGAN is used to augment small samples of the training set, which strengthens the generalization ability of the OSAE by creating noisy samples. The CGAN includes the generative model and the discriminative model [15] shown in Figure 1. The two models play games to learn the special characteristic of the samples within the restriction of the conditional variable y. The augmented samples are obtianed from the generative model, and then input into the discriminative model. Generally, the

The characteristics Analysis of the ITSC

The negative sequence current and the electromagnetic torque are typical characteristics of the ITSC. The negative sequence current is close to zero in the condition of symmetrical three-phase voltage. However, due to several types of load and operating mode, the negative sequence current contains some non-fault components. It is difficult to distinguish the ITSC based on this single feature. As shown in Fig. 4, the electromagnetic torque fluctuated obviously under the state of the short

Experimental analysis

We have established an experimental platform to verify the effectiveness of the proposed method shown in Fig. 8, and its parameters are presented in Table 1.

In order to obtain a few samples without other interferences, we adopted an ANSOFT software to construct to construct the finite element model of the PMSM[28]. The structure of the PMSM about the winding is shown in Fig. 9. The characteristics of various short circuit are obtained by changing of the parameters of coil. The labels related to

Conclusion

This paper presented an efficient and accurate fault diagnosis method for the PMSM based on the CGAN and the OSAE. It uses the CGAN to augment these joint features of the ITSC and enrich the training set, thus overcoming the shortcomings of insufficient training of the deep learning based networks. Meanwhile, the noise injection strategy is introduced into the SAE to enlarge the variety of fault samples and enhance the sparse representation of networks. At last, the OSAE is optimized to achieve

Future work

The research direction is summarized as the following two points.

  • The hyperparameters in the OSAE mainly depend on experience. If a single parameter leads to an optimal effect, it doesn’t mean the network is absolutely optimal after the combination of these multiple parameters. So, the automatic learning of hyperparameters is a development trend in the field of fault diagnosis with the initialization of the network.

  • At present, supervised learning steps are available for most of auto encoder

CRediT authorship contribution statement

Yuanjiang Li: Methodology, Software, Writing - original draft. Yanbo Wang: Data curation, Software. Yi Zhang: Visualization, Investigation, Validation. Jinglin Zhang: Methodology, Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This project was supported partially by National Natural Science Foundation of China (Grants: 41775008, 61702275, 51977101).

Yuanjiang Li received the MS degree in Information and Communication Engineering from Jiangsu University of Science and Technology, Zhenjiang, China in 2006 and his PhD degree in electronics and telecommunications from Nanjing University of Technology in 2013. Since 2013, he has been on Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.

References (29)

  • H. Ren et al.

    Research status and challenges of deep learning in fault diagnosis

    Control Decision

    (2017)
  • X. Guo et al.

    Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis

    Measurement

    (2016)
  • F. Jia et al.

    Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data

    Mech. Syst. Signal Process.

    (2016)
  • Jun Xu et al.

    Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images

    IEEE Transactions on Medical Imaging

    (2016)
  • Cited by (40)

    View all citing articles on Scopus

    Yuanjiang Li received the MS degree in Information and Communication Engineering from Jiangsu University of Science and Technology, Zhenjiang, China in 2006 and his PhD degree in electronics and telecommunications from Nanjing University of Technology in 2013. Since 2013, he has been on Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.

    Yanbo Wang is the graduated student at Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.

    Yi Zhang received the MS degree in Electrical engineering from Jiangsu University, Zhenjiang, China in 2005 and his PhD degree in Electrical engineering from Jiangsu University in 2009. Since 2009, she has been on Jiangsu University of Science and Technology. His current research interest includes optimum control, deep learning and fault diagnosis.

    Jinglin Zhang received the MS degree in circuits and systems from Shanghai University, Shanghai, China in 2010 and his PhD degree inelectronics and telecommunications from the National Institute of Applied Sciences-Rennes (INSA de Rennes) in 2013. From 2014 to 2020, he was on the faculty of the School of computer and software, Nanjing University of Information Science Technology. Since 2021, he has been on the faculty of the School of Artificial Intelligence, Hebei University of Technology. His current research interest includes computer vision, high-performance computing, interdisciplinary research with pattern recognition and atmospheric science.

    View full text