A data-based framework for fault detection and diagnostics of non-linear systems with partial state measurement

https://doi.org/10.1016/j.engappai.2012.09.004Get rights and content

Abstract

A novel framework based on the use of dynamic neural networks for data-based process monitoring, fault detection and diagnostics of non-linear systems with partial state measurement is presented in this paper. The proposed framework considers the presence of three kinds of states in a generic system model: states that can easily be measured in real time and in-situ, states that are difficult to measure online but can be measured offline to generate training data, and states that cannot be measured at all. The motivation for such a categorization of state variables comes from a wide class of problems in the manufacturing and chemical industries, wherein certain states are not measurable without expensive equipments or offline analysis while some other states may not be accessible at all. The framework makes use of a recurrent neural network for modeling the hidden dynamics of the system from available measurements and uses this model along with a non-linear observer to augment the information provided by the measured variables. The performance of the proposed method is verified on a synthetic problem as well as a benchmark simulation problem.

Introduction

Process monitoring and fault detection methods may broadly be divided into two classes: signal-based methods and model-based methods, and a large number of applications may be found in literature for both the methods (Isermann, 2006, Patton et al., 2000). Signal-based methods (Chen and Liao, 2002, Qin, 2003) generally do not need mathematical models of the system but need data from faulty conditions to perform fault detection and diagnostics. This is desirable in many real world applications as the process may be too complex to model mathematically (as in manufacturing applications) or the effort required in developing a model may not be justifiable economically. Model-based methods (El-Farra and Ghantasala, 2007, Zarei and Poshtan, 2010, Castillo et al., 2012) on the other hand make use of the fact that faults may change the nature of the relationship between the measured inputs and outputs and thus allow the detection of deviations in quantities that are not directly observed. Fault detection may usually be done without the need for data from faulty conditions. Fault diagnosis, however, may, still require data from faulty conditions (Uppal et al., 2006), which is usually difficult to acquire in many applications. Therefore, this paper proposes an approach that tries to retain the benefits of both signal-based and model-based methods by developing a hybrid data and model-based framework that can learn dynamic process models from historical data and diagnose a class of faults without the need for data from faulty conditions or complex physics-based models of the system.

For many industrial systems, approximate data-based dynamic process models can be developed from historical process data using techniques such as auto regressive models with exogenous inputs (ARX) and neural networks. Recurrent Neural Networks (RNN) in particular have been found to be very effective in modeling non-linear dynamic systems (Hou et al., 2007, Lee et al., 2001) and can be used to approximate any discrete dynamic system, which can be represented in the state space form, to any desired degree of accuracy (Jin et al., 1995). This developed data-based model may then be used for fault detection and isolation (FDI).

Model based fault detection and isolation methods have also been called as analytical redundancy methods as they involve the comparison of measured signals with their estimates, based on models subject to the same input condition, to generate residuals. Many model based methods have been proposed in the literature and a short survey of these methods is provided here. Diagnostic information may be extracted from these residuals using simple limit checks or statistical tests (Montgomery, 2008). More robust methods of evaluating residuals including the use of adaptive threshold evaluation (Patton et al., 2000) and non-linear classifiers (Chen and Patton, 1999) have also been proposed. The generation of residuals, however, remains the focus of most model based FDI methods. Residuals can be generated using any of a number of different methods mentioned below. A direct comparison of measured outputs with physics based models (Moskwa et al., 2001, Song et al., 2003, Conatser et al., 2004) or data based models (Capriglione et al., 2003, Calado et al., 2006, Uppal et al., 2006, Witczak et al., 2006) is the most straight-forward way to generate residuals. Parity relation approaches generate the residual based upon consistency checking on system input and output data over a time window (Gertler, 1997). Parameter estimation approaches directly make use of system identification techniques to isolate changes in critical but unmeasurable system parameters (Isermann, 1993). Observer based methods for deterministic (Edwards et al., 2000, Hou and Patton, 1996, Frank and Ding, 1997) and stochastic systems (Tsai et al., 2007, Kobayashi and Simon, 2006, Li et al., 2005, Xiong et al., 2007) can be used to estimate unmeasured system states/parameters and with suitably designed gain matrices, they can be used to generate residuals which are robust to modeling uncertainties. A bank of dedicated observers can be used to isolate faults for multi-input multi-output systems. The problem of robust model-based FDI in non-linear process systems, especially for actuators, has received significant attention in recent years (El-Farra and Ghantasala, 2007, Zarei and Poshtan, 2010, Castillo et al., 2012). Variations in the structure of the observer banks, such as all input one output, all input all but one output, one input all output, etc. can be used to isolate actuator, sensor and component faults (Isermann, 1997). The parity relation based approach, the observer-based approach and the parameter estimation-based approaches are all related to each other and the correspondence between these approaches may be found in (Chen and Patton, 1999). In this work the observer based approach is used because of the flexibility it affords. In spite of using sophisticated training methods, the use of data-based models, such as the RNN, introduces additional uncertainty regarding model predictions and this should be given due consideration by any model-based fault detection and isolation scheme. While there are a number of non-linear observers that can be considered for the task, such as the Extended Kalman Filter (Reif et al., 1999) or the Unscented Kalman Filter (Julier and Uhlmann, 2004), this paper uses a stochastic non-linear observer called the Adaptive Divided Difference Filter (ADDF), which explicitly accounts for model error and is robust to it (Subrahmanya and Shin, 2009).

Most model-based methods for FDI developed so far assume the availability of a first-principles based model wherein important states of the system are part of the model and may be estimated if necessary. Data-based dynamic models on the other hand may not have interpretable states and if some of the important states of the system need to be estimated on-line (for the purpose of monitoring the process) from input–output measurements, then special care has to be taken to ensure that these states are modeled explicitly. In practice, the instrumentation required to measure all states may not be available and the data available for modeling would include inputs, outputs and selected states. The proposed framework then considers this important practical scenario, where there are three kinds of states in the system model: states that can easily be measured in real time and in situ, states that are difficult to measure online but can be measured offline to generate training data, and states that cannot be measured at all. The motivation for such a categorization of state variables comes from a wide class of problems in the manufacturing and chemical industries, wherein certain states (such as surface roughness in manufacturing or intermediate stream compositions in chemical processes) are not measurable without expensive equipments or offline analysis while some other states may not be accessible at all. The goal then is to distinguish faults belonging to three classes—actuator faults (these are assumed to change the influence of an input on the model), component faults (it is assumed that these faults can be detected and diagnosed by monitoring certain states of the system) and sensor faults (these are assumed to affect the measured states). While it is possible that there may be a number of faults in complex systems, which affect multiple elements (inputs, states and outputs), it is believed that the above categorization is still useful to get a general idea of the location and effect of a fault. To the best of the authors' knowledge, this is the first paper considering the combination of a data-based model-based FDI system with this practically important categorization of state variables.

A block diagram describing the architecture of the proposed framework is shown in Fig. 1. The methods used in the three major blocks in Fig. 1 will be described in the following sections. It should be noted that our works on various individual modules in Fig. 1 have been reported elsewhere and the main contribution of this work is the combination of these individual modules and the validation of the entire data-based fault detection and diagnostics scheme. First, the use of recurrent neural networks is proposed for the purpose of learning the dynamics of a system and a suitable structure and training algorithm for the RNN model is given (Subrahmanya and Shin, 2010). A description of the adaptive divided difference filter (ADDF), a robust stochastic non-linear observer for discrete-time systems (Subrahmanya and Shin, 2009), is given next. This is followed by a section on the fault detection and isolation logic for input (actuator), state (component) and output (sensor) faults. Finally a couple of examples, one based on a synthetic state-space model and one based on the DAMADICS simulation benchmark (Bartys et al., 2006), are given to demonstrate the feasibility of the proposed method.

Section snippets

System modeling using recurrent neural networks

Although a number of training methods have been proposed for RNNs as mentioned in the introduction, all these methods require a considerable amount of parameter and structure tuning from an experienced user. In order to automate the process of structure and parameter learning for RNNs, the authors recently proposed a constructive training method for RNNs (Subrahmanya and Shin, 2010). This method ensures the stability of an RNN with a single hidden layer throughout the training process as

Adaptive divided difference filter

The adaptive divided difference filter (Subrahmanya and Shin, 2009) is a modification of the divided difference filter (Norgaard et al., 2000) and belongs to the class of robust, non-linear, derivative-free stochastic filters. This section summarizes implementation details of the ADDF to make this document self contained. The interested reader is referred to (Subrahmanya and Shin, 2009) for a detailed description and analysis of the ADDF and its properties. The class of systems considered here

Fault detection and isolation logic

Three different modeling strategies, used to model the three assumed kinds of faults, are described below along with the reason behind choosing these strategies. The RNN model is assumed to have p outputs, m inputs and h hidden nodes. Assume that V=[v1vp] and B=[b1,…,bm], where vj are row vectors and bi are column vectors. At this stage, it is assumed that an RNN model of the plant has been developed using the training method described in Section 2 and can be represented as given in (3):

The

Examples

Example 1:

This section presents results based on applying the proposed framework to a synthetic example, which has specifically been designed to test its capabilities. The dynamics of the system considered in this example is given byx1(k+1)=0.5tanh(x1(k)x3(k))+[2+1.5x1(k)u1(k)1+x12(k)u12(k)]u1(k)+x4(k)x2(k+1)=[3+tanh(2x1(k))]u2(k)x3(k+1)=x2(k)[1+tanh(4x2(k))]+[x1(k)1+x12(k)]x4(k+1)=[0.1x1(k)+2x1(k)1+x12(k)]u2(k)y(k)=[x1(k)x2(k)x3(k)]

The training data was generated by using swept sine waves given by (u1(k

Conclusions

A novel framework for data-based process monitoring, fault detection and diagnostics of non-linear systems with partial state measurement was presented in this paper. The framework provides specific guidelines for combining multiple modules in a systematic manner to achieve the desired results. The main advantages of the proposed framework are that it is a completely data-based method, which requires data from only normal operating conditions while still providing a certain degree of diagnostic

References (36)

  • N. Subrahmanya et al.

    Adaptive divided difference filtering for simultaneous state and parameter estimation

    Automatica

    (2009)
  • F.J. Uppal et al.

    A neuro-fuzzy multiple-model observer approach to robust fault diagnosis based on the DAMADICS benchmark problem

    Control Eng. Pract.

    (2006)
  • M. Witczak et al.

    A GMDH neural network-based approach to robust fault diagnosis: application to the DAMADICS benchmark problem

    Control Eng. Pract.

    (2006)
  • D. Capriglione et al.

    On-line sensor fault detection, isolation, and accommodation in automotive engines

    IEEE Trans. Instrum. Meas.

    (2003)
  • I. Castillo et al.

    Robust model-based fault detection and isolation for nonlinear processes using sliding modes

    Int. J. Robust Nonlinear Control

    (2012)
  • J. Chen et al.

    Robust Model-Based Fault Diagnosis for Dynamic Systems

    (1999)
  • N.H. El-Farra et al.

    Actuator fault isolation and reconfiguration in transport-reaction processes

    A.I.Ch.E. J.

    (2007)
  • Golub, G.H., van Loan, C.F., 1989. Matrix computations. The Johns Hopkins University Press, Baltimore,...
  • Cited by (28)

    • Deep learning of complex process data for fault classification based on sparse probabilistic dynamic network

      2022, Journal of the Taiwan Institute of Chemical Engineers
      Citation Excerpt :

      During the decades of development, process monitoring has continuously integrated emerging technologies such as machine learning, pattern recognition and data mining, which greatly improve the performance of process monitoring. [4–6]. However, nonlinearity modeling among process variables and capture of dynamic data characteristics are two difficult tasks but both important for practical applications [7,8]. In order to achieve better performance of process monitoring, we need to further study on the basis of the proposed methods for these two tasks.

    • Parallel quality-related dynamic principal component regression method for chemical process monitoring

      2019, Journal of Process Control
      Citation Excerpt :

      With the rapid development of industrial process, a reliable process monitoring system is necessary. In recent decades, multivariate statistical process monitoring (MSPM) has received great attention and has been applied to many industrial cases successfully [1–6]. However, part of MSPM methods such as PCA only use process variables, which is unable to reveal the changes in quality variables accurately [7–9].

    • Efficient faulty variable selection and parsimonious reconstruction modelling for fault isolation

      2016, Journal of Process Control
      Citation Excerpt :

      Online fault detection and isolation in modern industrial processes has been an extremely important practice driven by plant safety, product quality improvement, plant economics, etc., over the last few decades. Various methods [1–13] have been developed and reported in practical applications. Multivariate statistical process control (MSPC) charts based on multivariate statistical projection methods such as principal component analysis (PCA) [14] and partial least squares (PLS) [15,16] have been shown to be successful for online monitoring of new processes.

    View all citing articles on Scopus
    View full text