Diagnosis of Inter-turn Short Circuit of Permanent Magnet Synchronous Motor Based on Deep learning and Small Fault Samples
Introduction
The permanent magnet synchronous motor (PMSM) has some special characteristics in many rotating machinery such as high efficiency, power density and wide speed range [1]. But it is vulnerable to various types of faults. Especially the inter-turn short circuit (ITSC) is a root of all apparent faults of PMSMs which can result in other serious faults such as an inter-phase short circuit, a single-phase ground and so on. These coupling faults may lead to serious demagnetization of a permanent magnet, which causes the PMSM to stop running. Traditionally, the ITSC is detected by regular disassembly. Wu et al. [2] proposed a new detection coil which is installed inside the generator for online detection of this fault. However, the above methods are destructive and are not suitable for many application scenarios. So the nondestructive diagnosis is the hot spot of detecting the ITSC.
The nondestructive diagnosis relies on the physical changes of the PMSM to detect the ITSC faults. For example, the temperature is the most intuitive manifestation of this kind of faults, if the number of short circuit coils increases unregularly, the temperature inside the PMSM may exceed 100 degrees. The deviation of a three-phase current fluctuates greatly more than normal with the increase of eddy loss, and a magnetomotive force is opposite to the direction of a resultant magnetic, which causes a demagnetization of the PMSM easily. Besides, negative sequence characteristics, a harmonic current and vibration are also fit for detecting the ITSC. These parameters are easily interfered by several objective conditions such as humidity, dust and so on. So the combination of these fault samples are benefit for reducing the influence of environmental factors, which enhancing the diagnosis reliability of the short circuit fault effectively.
However, the probability of detecting such faults in rotating machinery is only 1%, which makes the diagnosis of such faults become a difficult few-shot problem due to the lack of fault samples. A series of mathematical analysis methods such as principal component analysis(PCA) [3] empirical mode decomposition (EMD) [4] BP network [5] support vector machines (SVM) [6] and other machine learning methods can be used to identify the fault based on these few samples. But there are several limitations in the application of fault diagnosis.
- 1)
The fault samples, despite few, are crucial to the classification quality of the above-mentioned methods. Hence, the issue of lack of fault samples significantly impairs the classification performance of those methods.
- 2)
The distribution characteristics of these samples are not compatible with the analysis capability provided by these traditional methods. For example, the PCA method is not the best method for identifing the few sample with the non-Gaussian distribution.
- 3)
The SVM and artificial neural networks are unable to learn complex non-linear relationships in the task of fault diagnosis due to their shallow architectures. So deeper architectures are conducive automatically extract valid fault samples from big data.
Fault diagnosis methods based on deep learning are very meaningful for us to discover early fault of motors timely, improve system reliability and reduce maintenance cost [7]. Deep learning methods which includes convolutional neural network (CNN) [8] , deep neural networks [9] , sparse auto encoder (SAE) [10], recursive neural network (RNN) [11], deep belief network (DBN) [12], and generative adversarial net (GAN) [13] can provide effective solution to overcome the above limitations. Deep architectures have more multiple hidden layers than traditional networks. They are useful for learning representative features from the raw data directly, and automatically selecting discriminative representations of fault samples.
A large number of short circuit fault samples and deep learning based methods are the core to improve the stability of fault diagnosis based on professional knowledge. However, the structure of the PMSM is becoming more and more complex, which causes the short circuit to be invisible and random. Meanwhile the correlated interference usually leads to the homogenization of these samples and covers up the fault performance of the PMSM. So it is necessary to modify the diversity of the training samples and augment these limited features. Li et al. [14] adopted the GAN to generate fault samples and enhanced the diagnosis accuracy obviously. But the GAN has some defects such as the problem of convergence between a generative model and a discriminative model. The conditional generative adversarial net (CGAN) transforms the unsupervised GAN into a supervised model which makes the output data strongly correlate with the input samples [15]. In addition to the problem of the small samples and networks’ parameters, many types of optimization methods are proposed to accelerate the training speed of models, such as the adaptive sub-gradient method [16] and adaptive moment estimation (Adam) [17].
The denoising auto encoder (DAE) is an effective improvement of the auto encoder[18]. In order to make the hidden features more robust, a certain probability of noise is added to the training samples, where the input data of each hidden layer is randomly set to zero. Because the training mechanism of the auto encoder requires that the output data is as much as possible close to the input data without noise pollution, it means that the DAE learn how to remove the noise from the input data. The DAE enhances the generalization ability of the auto encoder in the condition of practical application. So we draw on this idea of the DAE for finding the way of noise injection in the background of detecting the ITSC.
The main contributions of this paper are summarized as follows.
- 1)
We proposed a fault diagnosis method based on the small sample of the PMSM, and completed an optimized solution with two types of fault features. The CGAN is used to augment the number of fault samples to increase the diversity of data, and a certain probability of noise is added to the training set to enhance the generalization ability of the SAE. The entire architecture of the optimized SAE (OSAE) was finally adjusted by these task-specific samples, and several hierarchical representations about the ITSC were learned from the training samples.
- 2)
The performance of the proposed method based on the CGAN and the OSAE is compared with the performance of deep learning algorithms such as the CNN, the RNN and the BP methods. The former leads to faster training and better accuracy in terms of fault classification of the PMSM.
The rest of this paper is organized as follows. We presented a diagnosis method based on the CGAN and the OSAE for the ITSC fault of the PMSM. Firstly, an electromagnetic torque and a negative sequence current are acquired to describe the fault status of the PMSM. Secondly, the CGAN is used to augment the sample data-set based on the previously combined characteristics. Finally, a concise expression of fault characteristics is created and optimized by the OSAE model, and then a softmax classifier is trained to detect the ITSC of the PMSM. Sections 2 and 3 discuss the principles and fault characteristics of deep learning-based models, respectively. The experimental analysis and conclusions are introduced in Section 4 and Section 5, respectively
The research using machine learning techniques is very fruitful in the field of fault diagnosis. Wang et al.[18] adopted the DAE combined with softmax classifier to accomplish 6 types of fault diagnosis of asynchronous motor with an accuracy over 95%. Tang et al. [19] proposed an adaptive DBN with Nesterov momentum for the diagnosis of the rotating machinery. Janssens et al. [20] developed a learning model based on the CNN for bearing fault detection, and Chen et al. [21] used the CNN to identify faults in gearboxes. Piotr et al. [22] adopted the RNN to realize the behavior of a chaos engineering system for fault detection. Sun et al. [23] presented the SAE to achieve samples’ learning for the identification of induction motor faults. If the number of hidden layer nodes is larger than the number of input nodes, a sparse restriction is added to the auto encoder for activating some nodes. Chang et al. [24] proposed a mutual-channel loss for improving the classification ability of deep networks, which is also fit for the SAE algorithm. These above models can be easily deployed in industrial Intelligence IoT [25].
Section snippets
The CGAN
The CGAN is used to augment small samples of the training set, which strengthens the generalization ability of the OSAE by creating noisy samples. The CGAN includes the generative model and the discriminative model [15] shown in Figure 1. The two models play games to learn the special characteristic of the samples within the restriction of the conditional variable y. The augmented samples are obtianed from the generative model, and then input into the discriminative model. Generally, the
The characteristics Analysis of the ITSC
The negative sequence current and the electromagnetic torque are typical characteristics of the ITSC. The negative sequence current is close to zero in the condition of symmetrical three-phase voltage. However, due to several types of load and operating mode, the negative sequence current contains some non-fault components. It is difficult to distinguish the ITSC based on this single feature. As shown in Fig. 4, the electromagnetic torque fluctuated obviously under the state of the short
Experimental analysis
We have established an experimental platform to verify the effectiveness of the proposed method shown in Fig. 8, and its parameters are presented in Table 1.
In order to obtain a few samples without other interferences, we adopted an ANSOFT software to construct to construct the finite element model of the PMSM[28]. The structure of the PMSM about the winding is shown in Fig. 9. The characteristics of various short circuit are obtained by changing of the parameters of coil. The labels related to
Conclusion
This paper presented an efficient and accurate fault diagnosis method for the PMSM based on the CGAN and the OSAE. It uses the CGAN to augment these joint features of the ITSC and enrich the training set, thus overcoming the shortcomings of insufficient training of the deep learning based networks. Meanwhile, the noise injection strategy is introduced into the SAE to enlarge the variety of fault samples and enhance the sparse representation of networks. At last, the OSAE is optimized to achieve
Future work
The research direction is summarized as the following two points.
The hyperparameters in the OSAE mainly depend on experience. If a single parameter leads to an optimal effect, it doesn’t mean the network is absolutely optimal after the combination of these multiple parameters. So, the automatic learning of hyperparameters is a development trend in the field of fault diagnosis with the initialization of the network.
At present, supervised learning steps are available for most of auto encoder
CRediT authorship contribution statement
Yuanjiang Li: Methodology, Software, Writing - original draft. Yanbo Wang: Data curation, Software. Yi Zhang: Visualization, Investigation, Validation. Jinglin Zhang: Methodology, Supervision, Writing - review & editing.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgement
This project was supported partially by National Natural Science Foundation of China (Grants: 41775008, 61702275, 51977101).
Yuanjiang Li received the MS degree in Information and Communication Engineering from Jiangsu University of Science and Technology, Zhenjiang, China in 2006 and his PhD degree in electronics and telecommunications from Nanjing University of Technology in 2013. Since 2013, he has been on Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.
References (29)
- et al.
Bearing fault detection of induction motor using wavelet and support vector machines (SVMs)
Appl. Soft Computer.
(2011) - et al.
Adaptive deep feature learning network with Nesterov momentum and its application to rotating machinery fault diagnosis
Neurocomputing
(2018) - et al.
Convolutional neural network based fault detection for rotating machinery
J. Sound Vib.
(2016) - et al.
Methodology of neural modeling in fault detection with the use of chaos engineering
Eng. Appl. Artif. Intell.
(2015) - et al.
A sparse auto-encoder-based deep neural network approach for induction motor faults classification
Measurement
(2016) - WEI Shurong, REN Zixu, FU Yang, HU Hao, ZHANG Kaihua. Early Stage Inter-turn Faults Detection Technique for the Rotor...
- et al.
A new detection coil capable of performing online diagnosis of exciting winding short-circuits in steam-turbine generators
IEEE Trans. Energy Convers.
(2018) - Xie Y, Zhang T. A fault diagnosis approach using SVM with data dimension reduction by PCA and LDA method[C]// Chinese...
- et al.
An improved empirical mode decomposition based on adaptive weighted rational quartic spline for rolling bearing fault diagnosis
IEEE Access
(2020) - et al.
Algorithm learning based neural network integrating feature selection and classification
Expert Systems with Applications
(2013)
Research status and challenges of deep learning in fault diagnosis
Control Decision
Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis
Measurement
Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data
Mech. Syst. Signal Process.
Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images
IEEE Transactions on Medical Imaging
Cited by (40)
Permanent magnet synchronous motor inter-turn short circuit diagnosis based on physical-data dual model under oil-drilling environment
2024, Engineering Applications of Artificial IntelligenceRecent progress and prospective evaluation of fault diagnosis strategies for electrified drive powertrains: A comprehensive review
2023, Measurement: Journal of the International Measurement ConfederationStator current operation compensation (SCOC): A novel preprocessing method for deep learning-based fault diagnosis of permanent magnet synchronous motors under variable operating conditions
2023, Measurement: Journal of the International Measurement Confederation
Yuanjiang Li received the MS degree in Information and Communication Engineering from Jiangsu University of Science and Technology, Zhenjiang, China in 2006 and his PhD degree in electronics and telecommunications from Nanjing University of Technology in 2013. Since 2013, he has been on Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.
Yanbo Wang is the graduated student at Jiangsu University of Science and Technology. His current research interest includes computer vision, deep learning and fault diagnosis.
Yi Zhang received the MS degree in Electrical engineering from Jiangsu University, Zhenjiang, China in 2005 and his PhD degree in Electrical engineering from Jiangsu University in 2009. Since 2009, she has been on Jiangsu University of Science and Technology. His current research interest includes optimum control, deep learning and fault diagnosis.
Jinglin Zhang received the MS degree in circuits and systems from Shanghai University, Shanghai, China in 2010 and his PhD degree inelectronics and telecommunications from the National Institute of Applied Sciences-Rennes (INSA de Rennes) in 2013. From 2014 to 2020, he was on the faculty of the School of computer and software, Nanjing University of Information Science Technology. Since 2021, he has been on the faculty of the School of Artificial Intelligence, Hebei University of Technology. His current research interest includes computer vision, high-performance computing, interdisciplinary research with pattern recognition and atmospheric science.