A data-based framework for fault detection and diagnostics of non-linear systems with partial state measurement

doi:10.1016/j.engappai.2012.09.004

Engineering Applications of Artificial Intelligence

Volume 26, Issue 1, January 2013, Pages 446-455

https://doi.org/10.1016/j.engappai.2012.09.004 Get rights and content

Abstract

A novel framework based on the use of dynamic neural networks for data-based process monitoring, fault detection and diagnostics of non-linear systems with partial state measurement is presented in this paper. The proposed framework considers the presence of three kinds of states in a generic system model: states that can easily be measured in real time and in-situ, states that are difficult to measure online but can be measured offline to generate training data, and states that cannot be measured at all. The motivation for such a categorization of state variables comes from a wide class of problems in the manufacturing and chemical industries, wherein certain states are not measurable without expensive equipments or offline analysis while some other states may not be accessible at all. The framework makes use of a recurrent neural network for modeling the hidden dynamics of the system from available measurements and uses this model along with a non-linear observer to augment the information provided by the measured variables. The performance of the proposed method is verified on a synthetic problem as well as a benchmark simulation problem.

Introduction

Process monitoring and fault detection methods may broadly be divided into two classes: signal-based methods and model-based methods, and a large number of applications may be found in literature for both the methods (Isermann, 2006, Patton et al., 2000). Signal-based methods (Chen and Liao, 2002, Qin, 2003) generally do not need mathematical models of the system but need data from faulty conditions to perform fault detection and diagnostics. This is desirable in many real world applications as the process may be too complex to model mathematically (as in manufacturing applications) or the effort required in developing a model may not be justifiable economically. Model-based methods (El-Farra and Ghantasala, 2007, Zarei and Poshtan, 2010, Castillo et al., 2012) on the other hand make use of the fact that faults may change the nature of the relationship between the measured inputs and outputs and thus allow the detection of deviations in quantities that are not directly observed. Fault detection may usually be done without the need for data from faulty conditions. Fault diagnosis, however, may, still require data from faulty conditions (Uppal et al., 2006), which is usually difficult to acquire in many applications. Therefore, this paper proposes an approach that tries to retain the benefits of both signal-based and model-based methods by developing a hybrid data and model-based framework that can learn dynamic process models from historical data and diagnose a class of faults without the need for data from faulty conditions or complex physics-based models of the system.

For many industrial systems, approximate data-based dynamic process models can be developed from historical process data using techniques such as auto regressive models with exogenous inputs (ARX) and neural networks. Recurrent Neural Networks (RNN) in particular have been found to be very effective in modeling non-linear dynamic systems (Hou et al., 2007, Lee et al., 2001) and can be used to approximate any discrete dynamic system, which can be represented in the state space form, to any desired degree of accuracy (Jin et al., 1995). This developed data-based model may then be used for fault detection and isolation (FDI).

Model based fault detection and isolation methods have also been called as analytical redundancy methods as they involve the comparison of measured signals with their estimates, based on models subject to the same input condition, to generate residuals. Many model based methods have been proposed in the literature and a short survey of these methods is provided here. Diagnostic information may be extracted from these residuals using simple limit checks or statistical tests (Montgomery, 2008). More robust methods of evaluating residuals including the use of adaptive threshold evaluation (Patton et al., 2000) and non-linear classifiers (Chen and Patton, 1999) have also been proposed. The generation of residuals, however, remains the focus of most model based FDI methods. Residuals can be generated using any of a number of different methods mentioned below. A direct comparison of measured outputs with physics based models (Moskwa et al., 2001, Song et al., 2003, Conatser et al., 2004) or data based models (Capriglione et al., 2003, Calado et al., 2006, Uppal et al., 2006, Witczak et al., 2006) is the most straight-forward way to generate residuals. Parity relation approaches generate the residual based upon consistency checking on system input and output data over a time window (Gertler, 1997). Parameter estimation approaches directly make use of system identification techniques to isolate changes in critical but unmeasurable system parameters (Isermann, 1993). Observer based methods for deterministic (Edwards et al., 2000, Hou and Patton, 1996, Frank and Ding, 1997) and stochastic systems (Tsai et al., 2007, Kobayashi and Simon, 2006, Li et al., 2005, Xiong et al., 2007) can be used to estimate unmeasured system states/parameters and with suitably designed gain matrices, they can be used to generate residuals which are robust to modeling uncertainties. A bank of dedicated observers can be used to isolate faults for multi-input multi-output systems. The problem of robust model-based FDI in non-linear process systems, especially for actuators, has received significant attention in recent years (El-Farra and Ghantasala, 2007, Zarei and Poshtan, 2010, Castillo et al., 2012). Variations in the structure of the observer banks, such as all input one output, all input all but one output, one input all output, etc. can be used to isolate actuator, sensor and component faults (Isermann, 1997). The parity relation based approach, the observer-based approach and the parameter estimation-based approaches are all related to each other and the correspondence between these approaches may be found in (Chen and Patton, 1999). In this work the observer based approach is used because of the flexibility it affords. In spite of using sophisticated training methods, the use of data-based models, such as the RNN, introduces additional uncertainty regarding model predictions and this should be given due consideration by any model-based fault detection and isolation scheme. While there are a number of non-linear observers that can be considered for the task, such as the Extended Kalman Filter (Reif et al., 1999) or the Unscented Kalman Filter (Julier and Uhlmann, 2004), this paper uses a stochastic non-linear observer called the Adaptive Divided Difference Filter (ADDF), which explicitly accounts for model error and is robust to it (Subrahmanya and Shin, 2009).

Most model-based methods for FDI developed so far assume the availability of a first-principles based model wherein important states of the system are part of the model and may be estimated if necessary. Data-based dynamic models on the other hand may not have interpretable states and if some of the important states of the system need to be estimated on-line (for the purpose of monitoring the process) from input–output measurements, then special care has to be taken to ensure that these states are modeled explicitly. In practice, the instrumentation required to measure all states may not be available and the data available for modeling would include inputs, outputs and selected states. The proposed framework then considers this important practical scenario, where there are three kinds of states in the system model: states that can easily be measured in real time and in situ, states that are difficult to measure online but can be measured offline to generate training data, and states that cannot be measured at all. The motivation for such a categorization of state variables comes from a wide class of problems in the manufacturing and chemical industries, wherein certain states (such as surface roughness in manufacturing or intermediate stream compositions in chemical processes) are not measurable without expensive equipments or offline analysis while some other states may not be accessible at all. The goal then is to distinguish faults belonging to three classes—actuator faults (these are assumed to change the influence of an input on the model), component faults (it is assumed that these faults can be detected and diagnosed by monitoring certain states of the system) and sensor faults (these are assumed to affect the measured states). While it is possible that there may be a number of faults in complex systems, which affect multiple elements (inputs, states and outputs), it is believed that the above categorization is still useful to get a general idea of the location and effect of a fault. To the best of the authors' knowledge, this is the first paper considering the combination of a data-based model-based FDI system with this practically important categorization of state variables.

A block diagram describing the architecture of the proposed framework is shown in Fig. 1. The methods used in the three major blocks in Fig. 1 will be described in the following sections. It should be noted that our works on various individual modules in Fig. 1 have been reported elsewhere and the main contribution of this work is the combination of these individual modules and the validation of the entire data-based fault detection and diagnostics scheme. First, the use of recurrent neural networks is proposed for the purpose of learning the dynamics of a system and a suitable structure and training algorithm for the RNN model is given (Subrahmanya and Shin, 2010). A description of the adaptive divided difference filter (ADDF), a robust stochastic non-linear observer for discrete-time systems (Subrahmanya and Shin, 2009), is given next. This is followed by a section on the fault detection and isolation logic for input (actuator), state (component) and output (sensor) faults. Finally a couple of examples, one based on a synthetic state-space model and one based on the DAMADICS simulation benchmark (Bartys et al., 2006), are given to demonstrate the feasibility of the proposed method.

Section snippets

System modeling using recurrent neural networks

Although a number of training methods have been proposed for RNNs as mentioned in the introduction, all these methods require a considerable amount of parameter and structure tuning from an experienced user. In order to automate the process of structure and parameter learning for RNNs, the authors recently proposed a constructive training method for RNNs (Subrahmanya and Shin, 2010). This method ensures the stability of an RNN with a single hidden layer throughout the training process as

Adaptive divided difference filter

The adaptive divided difference filter (Subrahmanya and Shin, 2009) is a modification of the divided difference filter (Norgaard et al., 2000) and belongs to the class of robust, non-linear, derivative-free stochastic filters. This section summarizes implementation details of the ADDF to make this document self contained. The interested reader is referred to (Subrahmanya and Shin, 2009) for a detailed description and analysis of the ADDF and its properties. The class of systems considered here

Fault detection and isolation logic

Three different modeling strategies, used to model the three assumed kinds of faults, are described below along with the reason behind choosing these strategies. The RNN model is assumed to have p outputs, m inputs and h hidden nodes. Assume that $V = [\begin{array}{l} v_{1} \\ ⋮ \\ v_{p} \end{array}]$ and B=[b₁,…,b_m], where v_j are row vectors and b_i are column vectors. At this stage, it is assumed that an RNN model of the plant has been developed using the training method described in Section 2 and can be represented as given in (3):

The

Examples

Example 1:

This section presents results based on applying the proposed framework to a synthetic example, which has specifically been designed to test its capabilities. The dynamics of the system considered in this example is given by $x_{1} (k + 1) = 0.5 \tanh (x_{1} (k) x_{3} (k)) + [2 + \frac{1.5 x_{1} (k) u_{1} (k)}{1 + x_{1}^{2} (k) u_{1}^{2} (k)}] u_{1} (k) + x_{4} (k) x_{2} (k + 1) = [3 + \tanh (2 x_{1} (k))] u_{2} (k) x_{3} (k + 1) = x_{2} (k) [1 + \tanh (4 x_{2} (k))] + [\frac{x_{1} (k)}{1 + x_{1}^{2} (k)}] x_{4} (k + 1) = [0.1 x_{1} (k) + \frac{2 x_{1} (k)}{1 + x_{1}^{2} (k)}] u_{2} (k) y (k) = [\begin{matrix} x_{1} (k) & x_{2} (k) & x_{3} (k) \end{matrix}]$

The training data was generated by using swept sine waves given by (u₁(k

Conclusions

A novel framework for data-based process monitoring, fault detection and diagnostics of non-linear systems with partial state measurement was presented in this paper. The framework provides specific guidelines for combining multiple modules in a systematic manner to achieve the desired results. The main advantages of the proposed framework are that it is a completely data-based method, which requires data from only normal operating conditions while still providing a certain degree of diagnostic

References (36)

M. Bartys et al.
Introduction to the DAMADICS actuator FDI benchmark study
Control Eng. Pract.
(2006)
J.M.F. Calado et al.
FDI approach to the DAMADICS benchmark problem based on qualitative reasoning coupled with fuzzy neural networks
Control Eng. Pract.
(2006)
J. Chen et al.
Dynamic process fault monitoring based on neural network and PCA
J. Process Control
(2002)
R. Conatser et al.
Diagnosis of automotive electronic throttle control systems
Control Eng. Pract.
(2004)
C. Edwards et al.
Sliding mode observers for fault detection and isolation
Automatica
(2000)
P.M. Frank et al.
Survey of robust residual generation and evaluation methods in observer-based fault detection systems
J. Process Control
(1997)
J. Gertler
Fault detection and isolation using parity relations
Control Eng. Pract.
(1997)
R. Isermann
Fault diagnosis of machines via parameter estimation and knowledge processing—tutorial paper
Automatica
(1993)
R. Isermann
Supervision, fault-detection and fault-diagnosis methods—an introduction
Control Eng. Pract.
(1997)
M. Norgaard et al.
New developments in state estimation for nonlinear systems
Automatica
(2000)

N. Subrahmanya et al.

Adaptive divided difference filtering for simultaneous state and parameter estimation

Automatica

(2009)

F.J. Uppal et al.

A neuro-fuzzy multiple-model observer approach to robust fault diagnosis based on the DAMADICS benchmark problem

Control Eng. Pract.

(2006)

M. Witczak et al.

A GMDH neural network-based approach to robust fault diagnosis: application to the DAMADICS benchmark problem

Control Eng. Pract.

(2006)

D. Capriglione et al.

On-line sensor fault detection, isolation, and accommodation in automotive engines

IEEE Trans. Instrum. Meas.

(2003)

I. Castillo et al.

Robust model-based fault detection and isolation for nonlinear processes using sliding modes

Int. J. Robust Nonlinear Control

(2012)

J. Chen et al.

Robust Model-Based Fault Diagnosis for Dynamic Systems

(1999)

N.H. El-Farra et al.

Actuator fault isolation and reconfiguration in transport-reaction processes

A.I.Ch.E. J.

(2007)

Golub, G.H., van Loan, C.F., 1989. Matrix computations. The Johns Hopkins University Press, Baltimore,...

Cited by (28)

Deep learning of complex process data for fault classification based on sparse probabilistic dynamic network
2022, Journal of the Taiwan Institute of Chemical Engineers
Citation Excerpt :
During the decades of development, process monitoring has continuously integrated emerging technologies such as machine learning, pattern recognition and data mining, which greatly improve the performance of process monitoring. [4–6]. However, nonlinearity modeling among process variables and capture of dynamic data characteristics are two difficult tasks but both important for practical applications [7,8]. In order to achieve better performance of process monitoring, we need to further study on the basis of the proposed methods for these two tasks.
The dynamic and nonlinear characteristics of process data have become the major issue in data-driven process monitoring. Traditional data-driven methods are often only able to extract a single feature in process data. Therefore, how to effectively extract multi-dimensional features has become the focus of current research.
Sparse probabilistic dynamic network (SPDN) is a deep learning model proposed in this paper for the purpose of fault classification. The method mainly takes the advantages of the sparse Gaussian-Bernoulli Restricted Boltzmann Machine (GRBM) and the recurrent neural network (RNN) with long-short term memory (LSTM) units. First, the sparse GRBM is used for nonlinear feature extraction in an unsupervised way. Then, LSTM is introduced to realize the modeling of sequence data which can effectively handle the dynamic feature of the data.
In the Tennessee-Eastman benchmark process, the classification accuracies of the proposed method are proved to be far superior to MLP, RNN and PDN. Meanwhile, in order to prove the influence of the data dynamics and the internal parameters of the structure on the fault classification results, two additional experiments were carried out.
A hyper-heuristic inspired approach for automatic failure prediction in the context of industry 4.0
2022, Computers and Industrial Engineering
In the era of technological advances and Industry 4.0, massive data collection and analysis is a common approach followed by many industries and companies worldwide. One of the most important uses of data mining and Machine Learning techniques is to predict possible breaks or failures in industrial processes or machinery. This research designs and develops a hyper-heuristic inspired methodology to autonomously identify significant parameters of the time series that characterize the behaviour of relevant process variables enabling the prediction of failures. The proposed hyper-heuristic inspired approach is based on the combination of an optimization process performed by a meta-heuristic algorithm (Harmony Search) and feature based statistical methods for anomaly detection. It demonstrates its adaptability to different failure cases without expert domain knowledge and the capability of autonomously identifying most relevant parameters of the time series to detect the abnormal behaviour prior to the final failure. The proposed solution is validated against a real database of a cold stamping process yielding satisfactory results respect to a novel $A U C_R O C$ based metric, named $A U C_M O D$ , and other conventional metrics, i.e., Specificity, Sensitivity and False Positive Rate.
Parallel quality-related dynamic principal component regression method for chemical process monitoring
2019, Journal of Process Control
Citation Excerpt :
With the rapid development of industrial process, a reliable process monitoring system is necessary. In recent decades, multivariate statistical process monitoring (MSPM) has received great attention and has been applied to many industrial cases successfully [1–6]. However, part of MSPM methods such as PCA only use process variables, which is unable to reveal the changes in quality variables accurately [7–9].
Traditional quality-related process monitoring mainly focuses on the magnitude change of the quality variables caused by additive faults. However, the abnormal fluctuations in the quality variables caused by multiplicative faults are often overlooked. In this paper, a novel parallel dynamic principal component regression (P-DPCR) algorithm is proposed to monitor the changes in the magnitude and fluctuation of the quality variables simultaneously. Firstly, in order to eliminate the interference of quality-unrelated variables, the quality-related process variables are selected on the basis of correlation analysis. Secondly, the dynamic extension and moving window are carried out for process variables and quality variables, in which the dynamic variables space (called X-space/Y-space) and the variance space (called VX-space/VY-space) are constructed. Afterwards, double quality-related statistics based on the regression model of these four spaces are given, and the comprehensive monitoring decision can be obtained. Finally, two numerical cases and the Tennessee Eastman process are used to show the effectiveness of the proposed method.
Actuator Fault and Disturbance Estimation using the T-S fuzzy model
2017, undefined
This paper presents an actuator fault and disturbance estimation strategy using Takagi-Sugeno (TS) fuzzy model. In this approach, using a coordinate transformation, the TS fuzzy system is decomposed into three different modules: state subsystem without fault and disturbance, disturbance subsystem without fault, fault subsystem without disturbance. After the transformation, the fault and disturbance can be decoupled and calculated from the input and output signals and estimation state. The convergence of TS fuzzy observer is analyzed and proved. In order to verify the performance of the proposed approach, a wind turbine system with actuator fault and disturbance have been tested, the simulation results illustrate the efficiency of the proposed strategy.
Robust Just-in-time Learning Approach and Its Application on Fault Detection
2017, undefined
In data-driven state estimation and process monitoring, the correctness of results mainly rely on the accuracy of measurement. Actually, noises, outliers and measured errors always exist in real industrial systems. Just-in-time learning (JITL) is an useful on-line learning method and can be applied for data-based state estimation. Due to the reality of inaccurate measurement, an improved JITL method with strengthen robustness is necessary to be studied. In this paper, a robust version of just-in-time learning strategy is proposed. It is inspired from the leverage weight. By calculating the leverage impact, the outliers in high leverage cases are treated to reduce their weight and affect less on output prediction. A typical nonlinear system experiment is employed to prove the robust and veracity of the proposed strategy. Finally, the robust JITL is implemented for fault detection on a three-tank system to verify its applicability.
Efficient faulty variable selection and parsimonious reconstruction modelling for fault isolation
2016, Journal of Process Control
Citation Excerpt :
Online fault detection and isolation in modern industrial processes has been an extremely important practice driven by plant safety, product quality improvement, plant economics, etc., over the last few decades. Various methods [1–13] have been developed and reported in practical applications. Multivariate statistical process control (MSPC) charts based on multivariate statistical projection methods such as principal component analysis (PCA) [14] and partial least squares (PLS) [15,16] have been shown to be successful for online monitoring of new processes.
Reconstruction-based fault isolation, which explores the underlying fault characteristics and uses them to isolate the cause of the fault, has attracted special attention. However, it does not explore how the specific process variables change and which ones are most significantly disturbed under the influences of abnormality; thus, it may not be helpful to understanding the specifics of the fault process. In the present work, an efficient faulty variable selection algorithm is proposed that can detect the significant faulty variables that cover the most common fault effects and thus significantly contribute to fault monitoring. They are distinguished from the general variables that are deemed to follow normal rules and thus are uninformative to reveal fault effects. To further reveal the fault characteristics, the selected significant faulty variables are then chosen to obtain a parsimonious reconstruction model for fault isolation in which relative analysis is performed on these selected faulty variables to explore the relative changes from normal to fault condition. The faulty variable selection can not only focus more on the responsible variables but also exclude the influences of uninformative variables and thus probe more effectively into fault effects. It can also help in finding a more interesting and reliable model representation and better identify the underlying fault information. Its feasibility is illustrated with simulated faults using data from the Tennessee Eastman (TE) benchmark process.

View all citing articles on Scopus

View full text

A data-based framework for fault detection and diagnostics of non-linear systems with partial state measurement

Abstract

Introduction

Section snippets

System modeling using recurrent neural networks

Adaptive divided difference filter

Fault detection and isolation logic

Examples

Conclusions

Control Eng. Pract.

Control Eng. Pract.

J. Process Control

Control Eng. Pract.

Automatica

J. Process Control

Control Eng. Pract.

Automatica

Control Eng. Pract.

Automatica

Automatica

Control Eng. Pract.

Control Eng. Pract.

On-line sensor fault detection, isolation, and accommodation in automotive engines

IEEE Trans. Instrum. Meas.

Robust model-based fault detection and isolation for nonlinear processes using sliding modes

Int. J. Robust Nonlinear Control

Robust Model-Based Fault Diagnosis for Dynamic Systems

Actuator fault isolation and reconfiguration in transport-reaction processes

A.I.Ch.E. J.