An experimental methodology to evaluate machine learning methods for fault diagnosis based on vibration signals

doi:10.1016/j.eswa.2020.114022

Expert Systems with Applications

Volume 167, 1 April 2021, 114022

https://doi.org/10.1016/j.eswa.2020.114022 Get rights and content

Highlights

•
Systematic analysis of overoptimistic results in machine learning fault diagnosis.
•
Computational framework to test new feature models.
•
Computational framework to test new classifier architectures
•
Experimental analysis of more realistic fault diagnosis scenarios.

Abstract

This paper presents a systematic procedure to fairly compare experimental performance scores for machine learning methods for fault diagnosis based on vibration signals. In the vast majority of related scientific publications, the estimated accuracy and similar performance criteria are the sole quality parameter presented. However, the experimental design giving rise to these results is mostly biased, based on unacceptably simple validation methods and on recycling identical patterns in test data sets, previously used for training. Moreover, the methods in general overfit their hyperparameters, introducing additional overoptimistic results. In order to remedy this defect, we critically analyse the usual training-validation-test division and propose an algorithmic guideline in the form of a validation framework. This allows a well defined comparison of experimental results. In order to illustrate the ideas of the paper, the Case Western Reserve University Bearing Data benchmark is used as a case study. Four distinct classifiers are experimentally compared, under gradually more difficult generalization tasks using the proposed evaluation framework: K-Nearest-Neighbor, Support Vector Machine, Random Forest and One-Dimensional Convolutional Neural Network. An extensive literature review suggests that most vibration based research papers, particularly for the Case Western Reserve University Bearing Data, use similar patterns for training and testing, making their classification an easy task.

Introduction

Software based fault diagnosis is an essential tool to guarantee the safety and maintainability of dynamic processes (Gao et al., 2015, Chiang et al., 2001). A principal distinction of employable methods is model-based fault diagnosis, c.f. for instance (Varga, 2017, Gertler, 2017, Ding, 2012), and model-free diagnosis, e.g. (Ding, 2016, McMillan and Vegas, 2019). Vibration based fault diagnosis focuses on analysing vibration signals in order to identify possible equipment faults. Some research works try to find signal characteristics or measures for identifying faults (Smith and Randall, 2015, Diaz et al., 2015, Diaz et al., 2017). Other research works focus on machine learning techniques that use the signals for training and testing classifiers for identifying faults. However, this work focuses on the latter approach.

The access to real world, well documented benchmark data is limited. A limited set of repositories with real vibration signals, obtained from existing mechanical systems, is available (Lee et al., 2007, Nectoux et al., 2012, Bechhoefer, 2016, Paderborn, 2020, MaFaulDa, 2016). The Case Western Reserve University (CWRU) Bearing Data (CWRU, 2014) is probably referred to the most in scientific literature. Some research papers use non publicly available vibration data sets (Liao et al., 2019, Lei et al., 2016, Verstraete et al., 2017).

An extensive amount of research works apply model-free, machine learning methods to classify distinct operational states of the process. A robust fault diagnosis system must be able to generalize well. This means that a trained classifier should be able to recognize as many faults as possible, even when there are variations of the machine conditions. Presented results must be reproducible and must be statistically significant; otherwise overoptimistic results are obtained. For instance, when a limited amount of samples are available, the data must be separated into non-overlapping sets. Each set may not be visible to the other set during the parametrization of the diagnosis system. It must be avoided that the same data is used both for tuning the hyperparameters of a classifier model and for testing the resultant classifier. Otherwise the results will probably be biased towards optimism. A typical example are the kernel and its associated variables in a Support Vector Machine (SVM). Which kernel is best for a C-SVM, RBF or Polynomial? And then, when using, for instance, the RBF kernel, use a grid search to find the best combination of the C regularization value and of the spread $γ$ . Another example are deep learning structures of artificial neural networks where the layout is sometimes adjusted until it delivers the best results for the same data set. A more justified way of presenting performance scores is to isolate a test set completely. With the rest of the data, the hyperparameter tuning and the training can be done. When the final adjustments have been made, the test set is then used to estimate, for instance, an accuracy score. This procedure can be repeated several times, however the test set needs to always be kept apart. This hierarchy will be denoted as the inner and outer loop. Even then, when there is only a small amount of samples available for training, the performance may surpass theoretical Bayes limits.

Another common practice with vibration based machine learning research consists of defining a class, based on the chunks of a single chopped signal, even sometimes overlapping. Then, chunks of the same signal are used both on the training and testing data sets (Lei et al., 2016, Zhang et al., 2017, Verstraete et al., 2017, Liao et al., 2019). We call this the similarity bias problem. The patterns used for testing are almost indistinguishable from patterns used in the training data set. This fact may lead to an oversimplified model of real world fault diagnosis problems. A robust system must comprehend different machine conditions, and still be able to provide reasonable diagnostic information.

An important aspect when experimentally comparing fault diagnosis approaches consists of statistically verifying if the results are significantly different. However, this is seldom the case in the vibration based fault diagnosis research works.

The concern with the reproducibility of scientific research has steadily increased recently. Reproducibility is defined as obtaining consistent results using the same data and computational code as the original study. It has been found that many scientific studies are difficult or impossible to reproduce. The vibration based fault diagnosis works often have a lack of reproducibility.

This work proposes an experimental methodology for evaluating machine learning approaches for fault diagnosis based on vibration signals aiming to cope with all previously mentioned problems, i.e. completely isolating the test set, avoiding the similarity bias, verifying statistically significant differences and also allowing reproducibility. The main contributions of the paper are

1. A methodology for machine learning applied to vibration signal analysis, integrating nested cross validation, reproducibility, statistical analysis and avoiding similarity bias. Except for the latter, all these topics have already been applied in an isolated manner in the context to fault diagnosis;
2. An experimental study with synthetic data sets suggests superiority of the nested cross validation approach, especially for a small number of training samples;
3. The identification of common problems of research papers in the area of model-free fault diagnosis;
4. Identification of the overoptimistic bias, due to very similar training and test samples of the same class of machine conditions;
5. A study of the CWRU database, the principal real vibration signal source used in the scientific literature of the area.

The rest of the paper is organized in the following manner: Section 2 reviews conventional performance estimation methods in supervised learning problems. Furthermore, the common cross validation techniques are improved by a two-level evaluation hierarchy that isolates performance estimation from hyperparameter tuning. Synthetic data sets with known Bayesian error bounds are used to juxtapose the conventional and improved evaluation methods, suggesting the mostly overoptimistic results in research papers. Section 3 focuses on the important aspects of the proposed methodology for avoiding the similarity bias, for checking statistical significance of result differences, and also how to guarantee reproducibility. Section 4 critically reviews research works that use the vibration signals on machine learning approaches for fault diagnosis. Special focus is given to the CWRU mechanical data set, emphasizing the applied methods, and highlighting, if applicable, the defects that motivate the elaboration of this study. Moreover, the paper shows how the methodology may be customized for a specific data set. Section 5 and Section 6 present the case study of the CWRU Bearing Data Set. Section 5 defines different experimental modes for fault diagnosis, with gradually increasing difficulties. Section 6 provides an application of the proposed framework for the CWRU data, with four different classifier models, varying the generalization difficulty of the fault diagnosis, and finally the conclusions are drawn in Section 7.

Section snippets

Supervised learning performance evaluation techniques

In this section, a fair performance estimation framework with an outer validation loop is defined and introduced by data sets with analytically known Bayesian error rates.

Important additional aspects of the proposed methodology

This section describes three important aspects of the proposed methodology: requirement of reproducibility, avoiding similarity bias, and verifying the statistical significance of the difference in the results.

Related research in vibration based fault diagnosis

Section 2 and Section 3 present an experimental methodology for evaluating model-free machine learning methods for fault diagnosis based on vibration signals. This methodology requires that experiments should use nested cross validation as their evaluation method, be completely described to allow their reproducibility, avoid the similarity bias problem and apply appropriate statistical tests for showing significant differences between the proposed methods and other state of the art methods from

Proposal for CWRU systematic performance comparison

This section initially describes how the CWRU signal files were used to define the classes of the fault diagnosis problem and generate the patterns used for training and testing the classifiers. It also describes which feature extraction models were applied on each chunk of data. The second part of the section presents three proposals for the distribution of patterns with different degrees of difficulty. The first form of division is proposed without concerning the effects of the similarity

Experiments

In the following we report classification experiment results with two principal objectives. Firstly, the tests are a practical application of the proposed nested performance evaluation. Secondly, a more realistic scenario for fault diagnosis is created by the fusion of several machine conditions into a single class. Moreover, in order to evaluate their generalization capabilities, the classifiers are subjected to data that has never been seen during training. We qualitatively show that the high

Conclusion

This work presents a performance evaluation framework for machine learning approaches for fault diagnosis from vibration signals. One experimental study is performed showing that nested cross validation is more reliable than conventional cross validation techniques. The work also identifies common methodological evaluation drawbacks on machine learning approaches for fault diagnosis. Special attention is given to the identification of the similarity bias problem and how it impacts the folds

Computational framework

In order to facilitate experimental comparisons of methods involving the CWRU data, Python source code and the full experimental results of this work are provided at http://bit.ly/2S0Dnhj. The programming framework allows to evaluate classification, feature extraction and feature selection methods. It is implemented using Numpy, Scikit-learn and Keras, following the design pattern of these libraries. The python source code of the experiments with the synthetic Fukunaga data can be found at

CRediT authorship contribution statement

Thomas Walter Rauber: Writing - original draft, Writing - review & editing. Antonio Luiz da Silva Loca: Software, Resources, Investigation. de Francisco Assis Boldt: Software, Data curation, Resources. Alexandre Loureiros Rodrigues: Writing - review & editing, Software. Flávio Miguel Varejão: Conceptualization, Methodology, Project administration, Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (77)

T. Berredjem et al.
Bearing faults diagnosis using fuzzy expert system relying on an improved range overlaps and similarity method
Expert Systems with Applications
(2018)
M. Diaz et al.
Stability-based system for bearing fault early detection
Expert Systems with Applications
(2017)
M. Gan et al.
Construction of hierarchical diagnosis network based on deep learning and its application in the fault pattern recognition of rolling element bearings
Mechanical Systems and Signal Processing
(2016)
X. Guo et al.
Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis
Measurement
(2016)
D.-T. Hoang et al.
Rolling element bearing fault diagnosis using convolutional neural network and vibration image
Cognitive Systems Research
(2019)
S. Kavathekar et al.
Fault classification of ball bearing by rotation forest technique
Procedia Technology
(2016)
X. Li et al.
Intelligent cross-machine fault diagnosis approach with deep auto-encoder and domain adaptation
Neurocomputing
(2020)
X. Li et al.
Understanding and improving deep learning-based rolling bearing fault diagnosis with attention mechanism
Signal Processing
(2019)
Z. Li et al.
Diversified learning for continuous hidden markov models with application to fault diagnosis
Expert Systems with Applications
(2015)
M.A. Marins et al.
Improved similarity-based modeling for the classification of rotating-machine failures
Journal of the Franklin Institute
(2018)

H. Shao et al.

Rolling bearing fault diagnosis using adaptive deep belief network with dual-tree complex wavelet packet

ISA Transactions

(2017)

C. Shen et al.

Fault diagnosis of rotating machinery based on the statistical parameters of wavelet packet paving and a generic support vector regressive classifier

Measurement

(2013)

W.A. Smith et al.

Rolling element bearing diagnostics using the case western reserve university data: A benchmark study

Mechanical Systems and Signal Processing

(2015)

H. Xu et al.

An intelligent fault identification method of rolling bearings based on lssvm optimized by improved pso

Mechanical Systems and Signal Processing

(2013)

C. Yiakopoulos et al.

Rolling element bearing fault detection in industrial environments based on a k-means clustering approach

Expert Systems with Applications

(2011)

J.-B. Yu

Bearing performance degradation assessment using locality preserving projections

Expert Systems with Applications

(2011)

W. Zhang et al.

A deep convolutional neural network with new training methods for bearing fault diagnosis under noisy environment and different working load

Mechanical Systems and Signal Processing

(2018)

Y. Zhang et al.

A new subset based deep feature learning method for intelligent fault diagnosis of bearing

Expert Systems with Applications

(2018)

M. Zhao et al.

Fault diagnosis of rolling element bearings via discriminative subspace learning: Visualization and classification

Expert Systems with Applications

(2014)

X. Zhao et al.

An effective procedure exploiting unlabeled data to build monitoring system

Expert Systems with Applications

(2011)

J.B. Ali et al.

Application of empirical mode decomposition and artificial neural network for automatic bearing fault diagnosis based on vibration signals

Applied Acoustics

(2015)

Bechhoefer, E. (2016). A quick introduction to bearing envelope analysis. MFPT Data, http://www. mfpt....

Y. Bengio et al.

Deep learning

(2017)

Y. Benjamini et al.

Controlling the false discovery rate: A practical and powerful approach to multiple testing

Journal of the Royal Statistical Society: Series B (Methodological)

(1995)

L. Breiman

Random forests

Machine Learning

(2001)

G. Casella et al.

(2002)

G.C. Cawley et al.

On over-fitting in model selection and subsequent selection bias in performance evaluation

Journal of Machine Learning Research

(2010)

L. Chiang et al.

Fault detection and diagnosis in industrial systems. Advanced textbooks in control and signal processing

(2001)

T. Cover et al.

Nearest neighbor pattern classification

Information Theory, IEEE Transactions on

(Jan. 1967)

CWRU (2014). Case Western Reserve University, Bearing Data Center. http://csegroups.case.edu/bearingdatacenter,...

F. de Assis Boldt et al.

A fast feature selection algorithm applied to automatic faults diagnosis of rotating machinery

Journal of Applied Computing Research

(2014)

de Assis Boldt, F., Rauber, T. W., Varejão, F. M. & Ribeiro, M. P. (2015). Fast feature selection using hybrid ranking...

Diaz, M., Henríquez, P., Ferrer, M. A., Alonso, J. B., Pirlo, G. & Impedovo, D. (2015). Novel method for early bearing...

S. Ding

Model-based fault diagnosis techniques: Design schemes, algorithms and tools. Advances in industrial control

(2012)

S.X. Ding

Data-driven design of fault diagnosis and fault-tolerant control systems

(2016)

X. Ding et al.

Energy-fluctuated multiscale feature learning with deep convnet for intelligent spindle bearing fault diagnosis

IEEE Transactions on Instrumentation and Measurement

(2017)

R.O. Duda et al.

Pattern classification

(2012)

L. Eren et al.

A generic intelligent bearing fault diagnosis system using compact adaptive 1d cnn classifier

Journal of Signal Processing Systems

(2019)

Cited by (48)

A synchronization-induced cross-modal contrastive learning strategy for fault diagnosis of electromechanical systems under semi-supervised learning with current signal
2024, Expert Systems with Applications
Electromechanical systems is widely employed in the manufacturing industry, with fault diagnosis being critical for ensuring the reliable operation of them. Vibration signals exhibit distinct fault features, but their acquisition is subject to various limitations. Conversely, current signals, while easily measurable, typically manifest weak fault features. Therefore, selecting a signal for fault diagnosis necessitates a trade-off among cost, performance, and feasibility. To overpass these obstacles and enable flexible fault diagnosis in complex environments, this paper presents a novel contrastive vibration-current (CVC) framework that leverages synchronization information from multiple modalities to enhance the performance of single-modality models, primarily the current model. This allows the choice of any signal for monitoring during the deployment phase. Specifically, we first preprocess current signals using Clarke transformation to highlight their fault information. Subsequently, the vibration model employs semi-supervised learning to make full use of labeled and unlabeled samples. Additionally, a noise-resistant augment Mean Teacher is incorporated to enhance the robustness of the vibration model. Then, using synchronization-induced cross-modal contrastive learning (SICMCL), vibration and current features are aligned. And at the decision level, the current model leverages pseudo-labels derived from vibration. The results of experiments demonstrate that CVC excels when relying solely on single-modal signals, owing to the effectiveness of SICMCL. Moreover, for the current model, SICMCL is more beneficial in improving performance than true labels.
Knowledge-informed deep networks for robust fault diagnosis of rolling bearings
2024, Reliability Engineering and System Safety
Effective fault defection is of critical importance in condition-based maintenance to improve the reliability of engineered systems and reduce operational cost. This paper introduces a knowledge-informed deep learning approach to fuse prior knowledge and critical health information extracted from raw monitoring data for robust fault diagnosis of rolling bearings. A set of knowledge-based features is first extracted based on prior knowledge of engineered systems. A knowledge-informed deep network (KIDN) is then designed to leverage these knowledge-based features with data-driven machine learning for the accurate prediction of bearing faults. To further enhance the generalizability of deep networks for fault diagnosis and alleviate extensive tuning efforts, a novel generalizability-based adaptive network design strategy is developed based on constrained Gaussian process (CGP) to quickly obtain the promising architectures for the development of knowledge-informed deep networks. Specifically, it involves the training of a constrained Gaussian process (CGP) surrogate model to predict the generalizability of KIDN and seeking potential improvements by exploring alternative network architectures within a vast design space. Four experimental case studies are implemented to validate the proposed methodology.
Advancements in condition monitoring and fault diagnosis of rotating machinery: A comprehensive review of image-based intelligent techniques for induction motors
2024, Engineering Applications of Artificial Intelligence
Recently, condition monitoring (CM) and fault detection and diagnosis (FDD) techniques for rotating machinery (RM) have witnessed substantial advancements in recent decades, driven by the increasing demand for enhanced reliability, efficiency, and safety in industrial operations. CM of valuable and high-cost machinery is crucial for performance tracking, reducing maintenance costs, enhancing efficiency and reliability, and minimizing mechanical failures. While various FDD methods for RM have been developed, these predominantly focus on signal processing diagnostics techniques encompassing time, frequency, and time-frequency domains, intelligent diagnostics, image processing, data fusion, data mining, and expert systems. However, there is a noticeable knowledge gap regarding the specific review of image-based CM and FDD. The objective of this research is to address the aforementioned gap in the literature by conducting a comprehensive review of image-based intelligent techniques for CM and fault FDD specifically applied to induction motors (IMs). The focus of the study is to explore the utilization of image-based methods in the context of IMs, providing a thorough examination of the existing literature, methodologies, and applications. Furthermore, the integration of image-based techniques in CM and FDD holds promise for enhanced accuracy, as visual information can provide valuable insights into the physical condition and structural integrity of the IMs, thereby facilitating early FDD and proactive maintenance strategies. The review encompasses the three main faults associated with IMs, namely bearing faults, stator faults, and rotor faults. Furthermore, a thorough assessment is conducted to analyze the benefits and drawbacks associated with each approach, thereby enabling an evaluation of the efficacy of image-based intelligent techniques in the context of CM and FDD. Finally, the paper concludes by highlighting key issues and suggesting potential avenues for future research.
An open source experimental framework and public dataset for vibration-based fault diagnosis of electrical submersible pumps used on offshore oil exploration
2024, Knowledge-Based Systems
An Electrical Submersible Pump (ESP) is an important equipment used in the industry for lifting liquids in various types of wells. An ESP is widely used in the oil industry for offshore exploration. Detecting a faulty ESP before installation is a predictive maintenance measure in order to extend its operational time. Machine learning fault diagnosis is an effective way for performing this task. Machine learning fault diagnosis algorithms are highly dependent of the availability of an appropriate problem dataset. This paper describes in detail the problem of ESP fault diagnosis and the ESPset dataset, a real-world and public dataset for vibration-based fault diagnosis of electrical submersible pumps used on offshore oil exploration. In addition, the paper also proposes an experimental framework for adequately comparing research works based on the ESPset dataset and defines benchmark classifiers and respective results as referential to the fault diagnosis research community. The framework considers the fact that some subset of samples are not drawn independently, and therefore, proposes a cross-validation sampling strategy that mitigates the similarity bias among samples. Indeed, this work shows that a conventional k-fold cross-validation may lead to a clear overestimation of the average performance. This fact is supported by results which show that the best classification model drops from a mean F-measure of 0.887 to 0.733 when removing the similarity bias from the data.
SBR-Extended Kalman Filter model-based fault diagnosis and signal reconstruction for the papermaking wastewater treatment process
2023, Journal of Water Process Engineering
In the study, for an existing paper mill, to realize the fault diagnosis and further fault signal reconstruction for the papermaking SBR wastewater treatment process, a novel model-based SBR-EKF (Extended Kalman Filter) fault diagnosis model has been proposed. Combining the SBR process model and EKF method, using the field 120 sets of normal data of dissolved oxygen (DO) and level (L) from a paper mill, the model-based SBR-EKF fault diagnosis model for the papermaking SBR wastewater treatment process was established, and the weighted sum of squared residuals thresholds (WSSR0) was determined off-line. Subsequently, based on the normal data, four common types of faults, fixed bias, drift bias, total failure, and precision degradation, were generated and applied to the developed SBR-EKF model for on-line monitoring. The fault diagnostic results show that, by comparing the calculated WSSR with the obtained WSSR0, the developed model-based SBR-EKF model demonstrated acceptable fault detection rates for DO and L. Moreover, using the filtered value of the SBR-EKF model, the effective signal reconstructions for L and DO were realized. These investigation results reveal the effectiveness of the proposed SBR-EKF fault diagnosis model, achieving fault diagnosis with acceptable precision and reconstructed signal.
Adaptive fusion based on physics-constrained dictionary learning for fault diagnosis of rotating machinery
2023, Manufacturing Letters
Modern manufacturing systems rely on multiple types of sensors to monitor the manufacturing processes and machine faults to ensure product quality. The costs associated with sensor installation, maintenance, data transmission, and data storage are high. In this paper, a new data fusion method based on physics-constrained dictionary learning is proposed to improve the efficiency of data collection and the accuracy of fault diagnosis. In the proposed dictionary learning method, the measurement, basis, and classification matrices are optimized for the effective use of compressed sensing technique in process monitoring. With the optimized matrices, full-scale high-resolution signals can be reconstructed from a small number of sensor measurements. At the same time, machine states can be identified based on the sparse measurements. An adaptive weight scheme is introduced to combine low-cost sensor data so that the accuracy of fault diagnosis can be improved. The proposed method is tested with an experimental dataset of gearbox vibrations caused by gear cracks and different levels of crack severities are classified. The results show that up to 70% of data collection can be saved with the new approach while more than 95% of diagnosis accuracy is achieved. The sensitivities of the performance with respect to the number of measurements and number of sensors are also studied.

View all citing articles on Scopus

View full text

An experimental methodology to evaluate machine learning methods for fault diagnosis based on vibration signals

Highlights

Abstract

Introduction

Section snippets

Supervised learning performance evaluation techniques

Important additional aspects of the proposed methodology

Related research in vibration based fault diagnosis

Proposal for CWRU systematic performance comparison

Experiments

Conclusion

Computational framework

CRediT authorship contribution statement

Declaration of Competing Interest

Expert Systems with Applications

Expert Systems with Applications

Mechanical Systems and Signal Processing

Measurement

Cognitive Systems Research

Procedia Technology

Neurocomputing

Signal Processing

Expert Systems with Applications

Journal of the Franklin Institute

ISA Transactions

Measurement

Mechanical Systems and Signal Processing

Mechanical Systems and Signal Processing

Expert Systems with Applications

Expert Systems with Applications

Mechanical Systems and Signal Processing

Expert Systems with Applications

Expert Systems with Applications

Expert Systems with Applications

Application of empirical mode decomposition and artificial neural network for automatic bearing fault diagnosis based on vibration signals

Applied Acoustics

Deep learning

Controlling the false discovery rate: A practical and powerful approach to multiple testing

Journal of the Royal Statistical Society: Series B (Methodological)

Random forests

Machine Learning

On over-fitting in model selection and subsequent selection bias in performance evaluation

Journal of Machine Learning Research

Fault detection and diagnosis in industrial systems. Advanced textbooks in control and signal processing

Nearest neighbor pattern classification

Information Theory, IEEE Transactions on

A fast feature selection algorithm applied to automatic faults diagnosis of rotating machinery

Journal of Applied Computing Research

Model-based fault diagnosis techniques: Design schemes, algorithms and tools. Advances in industrial control

Data-driven design of fault diagnosis and fault-tolerant control systems

Energy-fluctuated multiscale feature learning with deep convnet for intelligent spindle bearing fault diagnosis

IEEE Transactions on Instrumentation and Measurement

Pattern classification

A generic intelligent bearing fault diagnosis system using compact adaptive 1d cnn classifier

Journal of Signal Processing Systems