A novel deep learning method based on attention mechanism for bearing remaining useful life prediction

doi:10.1016/j.asoc.2019.105919

Applied Soft Computing

Volume 86, January 2020, 105919

https://doi.org/10.1016/j.asoc.2019.105919 Get rights and content

Highlights

•
A novel neural network for bearing remaining useful life prediction is proposed.
•
This end-to-end method with attention mechanism required little prior knowledge.
•
Useful degradation information can be mined from long historic data.

Abstract

Rolling bearing is a key component in rotation machine, whose remaining useful life (RUL) prediction is an essential issue of constructing condition-based maintenance (CBM) system. However, recent data-driven approaches for bearing RUL prediction still require prior knowledge to extract features, construct health indicate (HI) and set up threshold, which is inefficient in the big data era. In this paper, a pure data-driven method for bearing RUL prediction with little prior knowledge is proposed. This method includes three steps, i.e., features extraction, HI prediction and RUL calculation. In the first step, five band-pass energy values of frequency spectrum are extracted as features. Then, a recurrent neural network based on encoder–decoder framework with attention mechanism is proposed to predict HI values, which are designed closely related with the RUL values in this paper. Finally, the final RUL value can be obtained via linear regression. Experiments carried out on the dataset from PRONOSTIA and comparison with other novel approaches demonstrate that the proposed method achieves a better performance.

Introduction

As one of the most critical components, rolling element bearing is widely used to evaluate the statement of rotating machine. Any unexpected bearing failures would deflect the mechanical system from normal state, such as precision loss, productivity reduction and even the increase of safety risks [1], [2], [3], [4], [5]. To employ condition-based maintenance (CBM) strategy on rotating machine to avoid such disaster, bearing remaining useful life (RUL) estimation is one of the major tasks.

Generally, RUL prediction methods could be grouped into two main categories, i.e. model-based approaches and data-driven approaches [6]. The core of model-based approaches is to build a mathematical model to accurately describe the degradation of machinery. However, the constructing process requires not only the parameters of the real engineering system after a series of measurements, but also extensive prior knowledge about the systems. Actually, few machineries can be accurately simulated by simple mathematical models for the complexity of real world. Nowadays, model-based approaches for predicting a general trend of the degradation of machinery are still helpful, including particle filter [7] , Eyring model [8] , Weibull distribution [9], etc. On the other hand, data-driven methods attempt to learn the machinery degradation patterns based on historical collected data. Thanks to the strong learning ability, these methods are capable of revealing the underlying correlations and causalities between the phenomenon (the collected data) and the reason (the statement of the corresponding system), especially the complex ones that hardly can be described by manual mathematical model. Because of this characteristic, more and more data-driven approaches in the field of machinery prognostics are proposed and achieved better prognostic results. A comprehensive review of statistical data-driven approaches is presented by Si et al. [10]. Lei et al. [11] built a data-driven model based on artificial neural network to predict bearing RUL. In [12], Huang et al. suggested that a novel health indicator (HI) called minimum quantization error obtained by a self-organizing map (SOM), is then helpful to train back propagation neural networks for degradation prediction model. Chen et al. [13] developed a prognostic method using adaptive neuro-fuzzy inference systems and high order particle filtering. Loutas et al. [14] proposed a data-driven approach for bearing RUL estimation based on support vector regression (SVR), which utilizes multiple statistical features from time-domain, frequency domain and time-scale domain. And more data-driven approaches are introduced briefly in a recent review paper [15].

There is no doubt that, when faced with enormous amount of collected data, data-driven approaches based on deep learning (DL) show more effective processing capacity and achieve more excellent performance, especially in the field of computer vision, natural language processing, speech processing, etc. [16], [17], [18]. And some deep learning techniques have already found their way into machine health monitoring systems. Zhu et al. proposed a stacked autoencoder (SAE) based DNN (Deep Neural Network) for hydraulic pump fault diagnosis that uses frequency features generated by Fourier transform [19]. Liu et al. uses normalized spectrum generated by short-time Fourier transform (STFT) of sound signal as inputs of a 2-layer SAE based DNN. Some researchers [20], [21] feed multi-domain statistical features including time domain features, frequency domain features and time-frequency domain features into SAE as a way of feature fusion. Zhang et al. [22], [23] proposed a novel convolution neural network (CNN) to make a successful bearing fault diagnosis directly on vibration signals. Furthermore, other difficult bearing fault diagnosis problems can be solved by more complex CNNs [24], [25], [26]. Although more and more DL-based approaches are employed to deal with fault diagnose problems in mechanical system, few successful cases can be found in addressing prognosis problem. Recurrent neural network (RNN), instead of CNN, is a smart choice for seeking underlying knowledge from historical data. Malhi et al. [27] proposed a competitive learning-based approach based on RNN for long-term prognostics of machine health statement, where vibration signals collected from a defected rolling bearing are pre-processed with the continuous wavelet transform and used as the model input. Besides, a long short-term memory (LSTM) based neural network scheme was proposed by Yuan et al. [28] for RUL estimation of aero-engines, in the cases of complicated operations, hybrid faults and strong noises. LSTM was also utilized by Zhao et al. [29] for a tool wear health monitoring task.

Though many works mentioned above have achieved good results, few of them are purely data-driven. In order to make an accurate prognosis, two main steps are still essential: health indicator calculation and bearing RUL prediction. For now, manual methods for calculating health indicator are still the most popular and common ones, because such selected indicator has an obvious trend to make it easier to predict the RUL of bearing. But this still requires much expert knowledge on mechanical system, degradation theory and statistics. Besides, data-driven methods have always been employed in the second step to regress the degradation curve of health indicator, but the failure point is still calculated according to an empirical special threshold in general.

Furthermore, accurate and effective prognosis in mechanic system is still hampered by a vital drawback of RNN. The collected data responding to the whole life of the machine is so long that RNN failed to process it, due to the so-called vanishing and exploding gradient problems [30]. To the authors’ best knowledge, most of proposed RNN-based models for prognosis make do with sliced data instead of the whole long data [31], which may cause some problems such as overfitting and decrease the prognosis accuracy.

In order to solve the aforementioned shortcoming, this paper proposes an RNN model based on encoder–decoder structure with attention mechanism. First, five band-pass energy values of frequency spectrum of vertical and horizontal vibration signals are used as input to train and test the proposed RNN model. Different from traditional RNN structure, attention machine is added to decide the attention distribution according to the first look by encoder, which help decoder make a better prognosis and overcome the vanishing and exploding gradient problems. As a result, a sequence of HI values ranges from 0 to 1 is supposed to be obtained. Finally, by linear regression with least square method, the accurate prediction of bearing RUL can be calculated. The main contributions of this paper are summarized as follows.

(1)
A novel method with little prior knowledge for rolling bearing RUL prediction is proposed, and achieved high prognosis accuracy.
(2)
An RNN model based on encoder–decoder structure with attention mechanism is proposed to mine useful degradation information from long historic data, which is validated by visualizing the attention distribution.

The rest of this paper is arranged as follows. Section 2 introduces the basic theories of RNN, LSTM, GRU (Gated Recurrent Unit), encoder–decoder structure and attention mechanism. The detailed process of proposed approach is then described in Section 3. In Section 4, our method is validated using the dataset from the accelerated degradation testing on rolling element bearings. Furthermore, the analysis of the proposed neural network is also presented to explain its effectiveness. Finally, conclusions are drawn in Section 5.

Section snippets

Theoretical background

To deal with prognosis problem in bearing RUL, it is essential to obtain degradation tendency according to historical information with uncertain length. And RNN is designed to process such sequence data. In this section, some theories about RNN, encoder–decoder frameworks and attention mechanism are introduced in brief.

Prognostic procedure

In this section, the procedure of the proposed approaches is described in detail. As shown in Fig. 3, this process mainly contains three steps, feature extraction, HI calculation by RNN and bearing RUL prediction via linear regression. In the first step, energy values of five sub-bands frequency spectra are extracted from raw signal as the features. Then, a sequence of HI values between 0 and 1 is obtained as the output of the proposed neural network with features as input. Finally, the RUL

Experiment verification

In order to validate the proposed method, an experimental dataset is introduced as training and testing data to evaluate its performance in this section. Further comparison with recent novel methods is also carried in the later part. And some analyzes are carried out in final.

Conclusion

Accurate RUL prediction highly depends on using the long-time-depended information from the long-time sequence data effectively. In this paper, a pure data-driven approach based on encoder–decoder framework is proposed. During the proposed procedure of RUL prediction, features extraction and threshold setting is automatic and without any prior expert knowledge. Finally, in the validation, the proposed method achieves the lowest average percent error and highest average score compared with other

Funding

This work was supported by the National Natural Science Foundation of China (Grant no. 51875138), and the National Science and Technology Major Project of China (863 Program, No. 2016ZX04002-004).

References (45)

LeiY. et al.
Condition monitoring and fault diagnosis of planetary gearboxes: A review
Measurement
(2014)
GhodsA. et al.
Probabilistic frequency-domain discrete wavelet transform for better detection of bearing faults in induction motors
Neurocomputing
(2016)
WangY. et al.
Rotating speed isolation and its application to rolling element bearing fault diagnosis under large speed variation conditions
J. Sound Vib.
(2015)
JiaF. et al.
Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data
Mech. Syst. Signal Process.
(2016)
JouinM. et al.
Particle filter-based prognostics: Review, discussion and perspectives
Mech. Syst. Signal Process.
(2016)
JouinM. et al.
Degradations analysis and aging modeling for health assessment and prognostics of PEMFC
Reliab. Eng. Syst. Saf.
(2016)
SiX.S. et al.
Remaining useful life estimation–a review on the statistical data driven approaches
European J. Oper. Res.
(2011)
RenL. et al.
Multi-bearing remaining useful life collaborative prediction: A deep learning approach
J. Manuf. Syst.
(2017)
HuangR. et al.
Residual life predictions for ball bearings based on self-organizing map and back propagation neural network methods
Mech. Syst. Signal Process.
(2007)
LeiY. et al.
Machinery health prognostics: A systematic review from data acquisition to RUL prediction
Mech. Syst. Signal Process.
(2018)

ZhangW. et al.

A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load

Mech. Syst. Signal Process.

(2018)

ChenY. et al.

ACDIN: Bridging the gap between artificial and real bearing damages for bearing fault diagnosis

Neurocomputing

(2018)

ZhuZ. et al.

A convolutional neural network based on a capsule network with strong generalization for bearing fault diagnosis

Neurocomputing

(2019)

GuoL. et al.

A recurrent neural network based health indicator for remaining useful life prediction of bearings

Neurocomputing

(2017)

HinchiA.Z. et al.

Rolling element bearing remaining useful life estimation based on a convolutional long-short-term memory network

Procedia Comput. Sci.

(2018)

HongS. et al.

Condition assessment for the performance degradation of bearing based on a combinatorial feature extraction method

Digit. Signal Process.

(2014)

LiuJ. et al.

An enhanced diagnostic scheme for bearing condition monitoring

IEEE Trans. Instrum. Meas.

(2010)

ZhuJ. et al.

Estimation of bearing remaining useful life based on multiscale convolutional neural network

IEEE Trans. Ind. Electron.

(2019)

AliJ.B. et al.

Accurate bearing remaining useful life prediction based on Weibull distribution and artificial neural network

Mech. Syst. Signal Process.

(2015)

ChenC. et al.

Machine condition prediction based on adaptive neuro–fuzzy and high-order particle filtering

IEEE Trans. Ind. Electron.

(2011)

LoutasT.H. et al.

Remaining useful life estimation in rolling bearings utilizing data-driven probabilistic e-support vectors regression

IEEE Trans. Reliab.

(2013)

KrizhevskyA. et al.

Imagenet classification with deep convolutional neural networks

Cited by (253)

Remaining useful life estimation of bearing via temporal convolutional networks enhanced by a gated convolutional unit
2024, Engineering Applications of Artificial Intelligence
In the field of prognostics and health management (PHM) for industrial equipment and systems, the estimation of remaining useful life (RUL) constitutes a fundamental task. A reliable and accurate method for estimating the RUL is therefore essential. This paper proposes a dynamic self-adaptive ensemble model, aimed at improving the rolling bearing RUL prediction. This model integrates an adaptive multi-scale feature extractor, a gated convolutional unit (GCU) and temporal convolutional networks (TCN). Through a redesign of the data flow, this model directly incorporates multi-scale comprehensive feature evaluation indicators into the neural network data flow, significantly enhancing the model's feature extraction capabilities. Subsequently, the study extends the traditional TCN by incorporating the GCU module and its gating mechanisms, further strengthening the model's capacity to capture long-term dependencies in sequence tasks. Experimental results demonstrate that, compared to existing state-of-the-art (SOTA) models, our method achieves at least a 10% increase in the prediction accuracy on two public run-to-failure bearing datasets. Beyond the tested datasets, the architecture that directly maps multi-scale evaluation indicators into the structure of neural network data flows also holds potential for broader application across diverse PHM tasks, promising significant improvements in the industrial safety and efficiency.
Method to enhance time series rolling fault prediction by deep fast Fourier convolution
2024, Measurement: Journal of the International Measurement Confederation
Deep learning realizes the adaptive extraction of features in remaining useful life (RUL), and most of the methods combined with traditional signal processing stay in the preprocessing stage. A fast Fourier convolutional gated recurrent unit (FFCGRU) method is proposed, which incorporates fast Fourier transform (FFT) into a convolutional neural network for adaptive feature extraction, enhancing fault prediction. First, a plug-and-play FFT convolution block is proposed. This method combines signal processing with the adaptive deep learning process and achieves a larger receptive field through a smaller convolution kernel. Secondly, an adaptive pooling layer is designed before the feature map is input into the gated recurrent unit. The feature maps of different channels are aggregated into one feature value to reduce the redundant information of the signal feature map. Comparative studies indicate that the FFCGRU method utilizes a small number of parameters even with large convolutional kernels and exhibits robustness.
Joint optimization of degradation assessment and remaining useful life prediction for bearings with temporal convolutional auto-encoder
2024, ISA Transactions
Remaining useful life (RUL) prediction and degradation assessment are pivotal components of prognostic and health management (PHM) and represent vital tasks in the implementation of predictive maintenance for bearings. In recent years, data-driven PHM techniques for bearings have made substantial progress through the integration of deep learning methods. However, modeling the temporal dependencies inherent in raw vibration signals for both degradation assessment and RUL prediction remains a significant challenge. Hence, we propose a joint optimization architecture that uses a temporal convolutional auto-encoder (TCAE) for the degradation assessment and RUL prediction of bearings. Specifically, the architecture includes a sequence-to-sequence model to extract degradation-sensitive features from the raw signal and utilizes temporal distribution characterization (TDC) and a nonlinear regressor to determine the degradation stages and predict RUL, respectively. Our framework integrates the tasks of degradation assessment and RUL prediction in a unified, end-to-end manner, using raw signals as input, resulting in high RUL prediction accuracy (RMSE = 0.0832) on publicly available and self-built datasets. Our approach outperforms state-of-the-art methods, indicating its potential to significantly advance the field of PHM for bearings.
RUL prediction of rolling bearings across working conditions based on multi-scale convolutional parallel memory domain adaptation network
2024, Reliability Engineering and System Safety
Rolling bearings are widely used in mechanical equipment, effectively determining the failure time of rolling bearings is particularly significant to ensure the safe performance of mechanical equipment. However, in industrial scenarios, the machine mainly works in the normal state for a long time, it is difficult to accumulate the same distribution of the whole life data, but the use of different distribution of data for forecasting will reduce the performance of deep learning-based prediction methods. Therefore, in order to tackle this problem, a multi-scale convolutional parallel memory domain adaptation network is investigated to forecast the residual useful life (RUL) of bearings across working conditions. Firstly, a new characteristic extractor—multi-scale convolutional parallel memory network is designed to extract spatial and temporal characteristics of bearing degradation data. At the same time, in order to minimize the distribution difference between source domain and target domain, a temporal-spatial feature alignment strategy is proposed to obtain domain invariable characteristics by combining maximum mean difference and domain adversarial learning. Finally, the availability of the proposed approach is verified using two rolling bearing data sets. The results reveal that it can efficiently forecast the RUL of rolling bearings across working conditions.
A dual attention LSTM lightweight model based on exponential smoothing for remaining useful life prediction
2024, Reliability Engineering and System Safety
Accurate remaining useful life (RUL) prediction of degrading systems is crucial to predict failures in advance and develop maintenance plans. As systems degrade gradually over time, sequential degradation feature (SDF) is very important. However, in attention mechanism (AM) based RUL prediction approaches, the sequential operation at each time step is abandoned. Further, these methods are modeled based on numerous parameters, making it difficult to enable timely RUL prediction. Therefore, this paper proposes a dual attention and long short-term memory (LSTM) lightweight model (DA-LSTM). LSTM compensates for the shortcomings of AM in modeling SDF, and exponential smoothing is adopted to train a lightweight model. Specifically, the SDF is divided into aggregated encoding feature (AEF) and aggregated original feature (AOF). AEF is obtained by the encoder which includes a novel soft attention mechanism and an LSTM network. To prevent losing useful information during the encoding process, the second attention layer aggregates the original sensor signal to obtain AOF. Finally, the decoder LSTM network combines AEF with AOF and calculates RUL based on a weighting average method. Extensive experiments are conducted on the C-MAPSS dataset to verify model effectiveness. The results show the superiority of DA-LSTM in prediction accuracy and computational quantity.
A new unsupervised health index estimation method for bearings early fault detection based on Gaussian mixture model
2024, Engineering Applications of Artificial Intelligence
Bearings are indispensable components of machinery, playing a critical role in effective health monitoring. This monitoring is vital in detecting equipment incipient failure and reducing maintenance costs. The bearing degradation is a complex nonlinear process that defies straightforward description using physical models or predefined degradation patterns. Bearings exhibit inherent fault frequencies, and the early faults of bearings can alter their monitoring data distribution. Gaussian Mixture Modeling (GMM) can effectively visualize these changes in data distribution. This research focuses on the development of a new unsupervised method for constructing the bearing Health Index (HI) using GMM to estimate vibration signal distributions. Firstly, we introduce a new unsupervised HI construction method, named GMM-HI, designed to provide insight into the bearing degradation process. Secondly, the Wasserstein distance is adopted as the bearing HI, measuring the distance between initial and current health data. Thirdly, isotonic regression is utilized to address spurious fluctuations in bearing monitoring signals. Through extensive experimentation on three bearing datasets, our results demonstrate that the newly introduced GMM-HI allows for accurate detection of bearing early failures. It effectively categorizes the health stages of bearings and provides a unified approach for establishing bearing failure thresholds in estimating their remaining useful life. In comparison to other well-known HI methods, GMM-HI effectively and accurately characterizes the health index of bearings.

View all citing articles on Scopus

View full text

A novel deep learning method based on attention mechanism for bearing remaining useful life prediction

Highlights

Abstract

Introduction

Section snippets

Theoretical background

Prognostic procedure

Experiment verification

Conclusion

Funding

Measurement

Neurocomputing

J. Sound Vib.

Mech. Syst. Signal Process.

Mech. Syst. Signal Process.

Reliab. Eng. Syst. Saf.

European J. Oper. Res.

J. Manuf. Syst.

Mech. Syst. Signal Process.

Mech. Syst. Signal Process.

Mech. Syst. Signal Process.

Neurocomputing

Neurocomputing

Neurocomputing

Procedia Comput. Sci.

Digit. Signal Process.

An enhanced diagnostic scheme for bearing condition monitoring

IEEE Trans. Instrum. Meas.

Estimation of bearing remaining useful life based on multiscale convolutional neural network

IEEE Trans. Ind. Electron.

Accurate bearing remaining useful life prediction based on Weibull distribution and artificial neural network

Mech. Syst. Signal Process.

Machine condition prediction based on adaptive neuro–fuzzy and high-order particle filtering

IEEE Trans. Ind. Electron.

Remaining useful life estimation in rolling bearings utilizing data-driven probabilistic e-support vectors regression

IEEE Trans. Reliab.

Imagenet classification with deep convolutional neural networks