Multi-agent based collaborative fault detection and identification in chemical processes

https://doi.org/10.1016/j.engappai.2010.01.026Get rights and content

Abstract

Fault detection and identification (FDI) has received significant attention in literature. Popular methods for FDI include principal component analysis, neural-networks, and signal processing methods. However, each of these methods inherit certain strengths and shortcomings. A method that works well under one circumstance might not work well under another when different features of the underlying process come to the fore. In this paper, we show that a collaborative FDI approach that combines the strengths of various heterogeneous FDI methods is able to maximize diagnostic performance. A multi-agent framework is proposed to realize such collaboration in practice where different FDI methods, i.e: principal component analysis, self-organizing maps, non-parametric approaches, or neural-networks are combined. Since the results produced by different FDI agents might be in conflict, we use decision fusion methods to combine FDI results. Two different methods – voting-based fusion and Bayesian probability fusion are studied here. Most monitoring and fault diagnosis algorithms are computationally complex, but their results are often needed in real-time. One advantage of the multi-agent framework is that it provides an efficient means for speeding up the execution time of the various FDI methods through seamless deployment in a large-scale grid. The proposed multi-agent approach is illustrated through fault diagnosis of the startup of a lab-scale distillation unit and the Tennessee Eastman Challenge problem. Extensive testing of the proposed method shows that combining diagnostic classifiers of different types can significantly improve diagnostic performance.

Introduction

Diagnosis of process faults in chemical processes has been an active area of research (Srinivasan, 2007). Successful identification of process faults at an early stage can increase the success rate of fault recovery during operations and prevent unnecessary shutdowns. Also, automatic detection and diagnosis of faults are necessary to prevent costly accidents by providing time critical diagnostic information to plant operators. In literature, several fault diagnosis methodologies have been proposed for fault detection and identification (FDI) in chemical processes (Venkatasubramanian et al., 2003b, Venkatasubramanian et al., 2003a; Srinivasan et al., 2005a, Srinivasan et al., 2005b, Ng and Srinivasan, 2009a). Each FDI method has its strengths and shortcomings, which are process and fault dependant. A method that works well under one circumstance might not work well under another when different features of the process come to the fore. Combining FDI methods of different types is hence an attractive solution for monitoring processes operating under a wide range of operating conditions.

In addition to being adaptive towards different operating conditions, combining different FDI methods can also achieve higher diagnostic resolution by combining the strengths of existing FDI methods. It has already been shown in the pattern recognition literature that a judicious combination of classifiers generally outperforms a single one (Rahman and Fairhurst, 1999; Lin et al., 2003; McArthur et al., 2004). The main reason for combination of classifiers is that different types of classifiers can often complement one another and improve performance as a result of collaboration. When diagnosing faults in complex processes, designing a perfect classifier for all possible scenarios can be difficult, and combining different fault diagnostic methods is shown to be a good alternative wherein different features of heterogeneous diagnostic classifiers can be synergistically consolidated. To facilitate the integration of heterogeneous diagnostic classifiers, a multi-agent system is proposed in this paper to integrate various FDI methods. The organization of this paper is as follows: Section 2 provides the review of some previous work in the fields of FDI, decision fusion and agent-based methods. Section 3 describes the proposed multi-agent approach for collaborative FDI and the underlying decision fusion strategies. The proposed multi-agent with its decision fusion strategies are tested with two case studies, namely startup of a lab-scale distillation column and the Tennessee Eastman Challenge problem in Section 4 and Section 5, respectively.

In general, existing FDI methods can be broadly classified into two categories namely qualitative model-based and quantitative model-based methods. Qualitative model-based methods include techniques such as trend analysis and expert systems. Trend analysis is based on the abstraction of process data into a set of trends (Cheung and Stephanopoulos, 1990). Monitoring is then performed on the identified trends, which are made up of primitives that describe the qualitative behavior of the process variables. Classical trend analysis approaches are based on monitoring an ordered set of primitives that describe the evolution of a process variable. When a fault occurs, process variables vary from their nominal ranges and exhibit trends that are characteristic of the fault. Hence, different faults can be mapped to their characteristic trend signatures. Extension of trend analysis through fuzzy reasoning is reported in Dash et al. (2003).

During transitions, each variable might display a different trend during different phases of the transition. There are also occasions where process exhibits different trends during transitions due to normal operating variations, thus complicating trend comparison. Classical trend analysis is therefore not sufficient to monitor transitions adequately. Sundarraman and Srinivasan, 2003a, Sundarraman and Srinivasan, 2003b overcome the above problems through enhanced trends. Three types of matching degrees – shape matching degree, magnitude matching degree, and duration matching degree – were introduced to facilitate trend comparison during transition. The main shortcoming of trend analysis is that it is designed for monitoring individual variables. It does not take into account the correlation between the variables in the process.

Expert systems, or rule-based systems, use rules to perform monitoring. They are best suited to situations where plant operators have a good knowledge regarding the nuances of the transitions and the underlying process. Honda and Kobayashi (2000) used a fuzzy rule-based inference system for the direct control of batch operations. The process phase is first recognized by fuzzy inference, and then a fuzzy neural-network based control system is used to control the batch process. They illustrated their methods on mevalotin precursor production, vitamin B2 production, and sake mashing processes. In Muthuswamy and Srinivasan (2003), a rule-based expert system is developed for automation and supervisory control of semi-batch fermentation processes. They characterized transitions using features in process variables and represented them as multivariate rules. These rules track the process across phases and automatically detect the current active phase using online data. Different monitoring rules are formulated for each phase of a transition. The rule-based transition characterization method was shown to be robust to measurement noise and easily comprehendible to the operators. Nevertheless, rule-based systems are process specific; at times, it is hard to extract rules to adequately model complex processes.

First-principle models, statistical models, signal processing models, and neural-networks are clustered under quantitative model-based systems. Extensive coverage of quantitative model-based approaches for monitoring and diagnosing faults during steady-state can be found in Chen and Patton (1999) and Venkatasubramanian et al. (2003c). Quantitative models are built either from first-principles knowledge or from using input–output data. In Bhagwat et al. (2003a), a non-linear model-based approach was proposed to monitor process transitions. Estimation of process states and residuals was achieved through open-loop observers and Kalman filters. To address the issues arising from the discontinuous nature of transition, the scheme uses knowledge of the standard operating procedure and divides each transition into phases. For monitoring, each phase is associated with a model component and different filters and observers are selected for fault detection in that phase. However, accurate models of highly complex processes operating in multiple regimes are seldom available and difficult to develop, thus limiting their practical applicability. Multiple model-based approaches have therefore been used to model, control, and monitor transitions. In Bhagwat et al. (2003b), a multi-linear model-based fault detection scheme was proposed based on decomposition of operation of a non-linear process into multiple locally linear regimes. Kalman filters and open-loop observers were used for state estimation and residuals generation in each regime. Analysis of residuals using thresholds, faults maps, and logic-charts enabled on-line detection and isolation of faults.

Signal processing methods can be applied to analyze the normal/abnormal status of a process by comparing the online profile of process variables with those of previously known runs. The underlying methods perform time synchronization between process signals from different runs before comparing them based on predefined similarity metrics. Methods for signal processing include dynamic time warping (DTW) and dynamic programming (DP). Applications of DTW for process monitoring can be found in Gollmer and Posten (1996) and Kassidas et al., 1998a, Kassidas et al., 1998b. One known shortcoming of DTW is its high computational cost, which grows exponentially with the length of process data. This can be minimized by using landmarks such as peaks or local minima in the signals to reduce the complexity of signal comparison (Srinivasan and Qian, 2005, Srinivasan and Qian, 2007). These landmarks, called singular points, can be used to decompose a long continuous signal into multiple, short, semi-continuous ones. However, one known shortcoming of DTW algorithm is the essential requirement that the starting and ending points of the signals to be compared should coincide. Such shortcomings obviate their direct practice for online applications since the points in the historical database that should be matched with the starting and ending points of the online signal are unknown. To overcome these shortcomings, Srinivasan and Qian (2006) proposed dynamic locus analysis which is an extension of Smith and Waterman (1981) discrete sequence comparison algorithm for online signals comparison.

With the increasing availability of inexpensive sensors, the number of measured variables for most industrial processes easily ranges in thousands. This has lead to the popularity of multivariate statistical methods, which bring forth powerful means to monitor transitions. Principal components analysis (PCA) is one such multivariate dimensionality reduction technique that is widely used for developing data-driven models (Jackson, 1991). Applications of PCA and its variants for process monitoring can be found in MacGregor and Kourti (1995) and Chen and Liu (2002). Most of the reported work in multivariate statistical analysis is directed to processes where the correlation between the process variables remains the same. These approaches are not directly applicable to transitions due to statistical non-stationarity and time-varying dynamics. In order to overcome this, an extension called dynamic PCA (DPCA) has been proposed (Ku et al., 1995). In Srinivasan et al. (2004), DPCA has been used to classify process states based on historical operating data. Process data is first segmented into modes and transitions. Steady-state modes are identified by using a moving window approach which is capable of rejecting outliers. A DPCA-based similarity factor is used to compare transitions with historical data, which can be used for online FDI. Since the run-length variations common across different instances of transient operations restrict the application of time-wise unfolding methods with PCA, multiple models can be used to overcome such shortcoming. Doan and Srinivasan (2008) and Ng and Srinivasan (2009b), and multiple PCA models are used for monitoring transient operations.

Neural-network based approaches are another popular area for fault diagnosis in continuous processes (Kavuri and Venkatasubramanian, 1993). They have been popular for classification and function approximation. In Fabro et al. (2005), recurrent neural-networks were used to identify process states and predict process behavior. Control actions for different phases of transition are provided through sets of fuzzy controllers. They illustrated their approach through a distillation-column startup case study. Theoretically, artificial neural-networks can approximate any well-defined non-linear function with arbitrary accuracy. Unfortunately, there is no universal criterion for selecting a specific structure of neural-network for a practical application. Usually the structure of the network is decided based on the input dimensionality and the complexity of the underlying classes. The construction of an accurate neural classifier for such multivariate, multi-class temporal classification problem suffers from the “curse of dimensionality”. To overcome the above drawbacks, Srinivasan et al. (2005c) proposed the use of two new neural network architectures, namely one-variable-one-network (OVON) and one-class-one-network (OCON). In both structures, the original classification problem is decomposed into a number of simpler classification problems. The new neural-networks architectures are hence simpler in structure, faster to train, and yield substantial improvement in classification accuracy compared to classical neural-network structure. However, a priori knowledge of the sub-states of each variable is needed to derive the sub-state identification layer of OVON, which can be cumbersome for processes with large number of variables. High misclassification rate is also reported during state change when there is no clear separation between the states.

Though there exist a variety of FDI methods for process monitoring and fault diagnosis, it is worth noting that each FDI approach has its corresponding strengths and shortcomings, which are process dependant. A method that works well under one circumstance might not work well under another when different features of the process come to the fore. A comparison of the strengths and shortcomings of different FDI methods is shown in Table 1. Since no single FDI method is able to address the numerous facets of process monitoring and fault diagnosis, collaboration among heterogeneous methods is needed to bring forth the benefits of each method to improve monitoring resolution and robustness of the FDI system. The rationale of such an approach is based on the precept that the strengths of various methods can be brought together to bear on the problem and the drawbacks of an individual method can be overcome through collaboration.

When multiple FDI methods are used for monitoring and fault diagnosis, the integration between the heterogeneous methods becomes a challenge. Distributed computing methods such as multi-agent systems, which facilitate collaboration among distributed entities, are an attractive approach to combine various FDI methods. An agent wrapper can be used to encapsulate a FDI method and its results made available to other agents via messages. Next, we briefly review previous application of multi-agent approach in the domain of monitoring and fault diagnosis.

Section snippets

Multi-agent systems

The term ‘agent’ is defined as a computer system that is situated in some environment, and is capable of performing autonomous actions in that environment in order to meet its design objectives (Wooldridge and Jennings, 1995). An agent can thus be viewed as an computational entity that automates some aspect of task management or decision making to benefit its end user. Agent-based approaches offer opportunities to solve complex problems collaboratively using heterogeneous methods. An agent is

Collaborative agents for managing efficient operations

An agent-based framework called collaborative agents for managing efficient operations (CAMEO) is described here to render effective management of process operations possible. The proposed framework is abstracted hierarchically into environment, host, and agents. An environment in CAMEO is a neighborhood that supports any plant operation. All entities within the plant are part of the environment, i.e., software, hardware, controllers, human operators, etc. An agent environment might contain one

Case study I: fault diagnosis of Tennessee Eastman challenge Problem

In this section, the proposed FDI method is tested for online disturbance identification on the Tennessee Eastman (TE) industrial challenge problem (Downs and Vogel, 1993). The TE process produces two products (G and H) and a byproduct (F) from reactants A, C, D, and E (Fig. 6). The control structure of Lyman and Georgakis (1995), as implemented by Chiang and Braatz (2003) is used here. The process has five units, namely: a two-phase reactor, a product condenser, a flash separator, a recycle

Case study II: fault diagnosis during distillation unit startup

In this section, the proposed method is tested on a lab-scale distillation unit. The schematic of the distillation unit is shown in Fig. 10. The distillation column is of 2 m height and 20 cm width and has 10 trays; the feed enters at tray 4. The system is well integrated with a control console and data acquisition system. Nineteen variables – all tray temperatures, reboiler and condenser temperature, reflux ratio, top and bottom column temperature, feed pump power, reboiler heat duty, cooling

Conclusions

A novel multi-agent based framework has been developed for detecting and diagnosing faults in the process industries. It offers a mean to integrate seamlessly various fault detection and identification techniques. The framework, called collaborative agents for managing efficient operations (CAMEO), consists of different monitoring methods, each modeled as a software agent, which observe the process in real-time and flag abnormalities independently. A decision fusion strategy forms the bedrock

References (62)

  • J.A. Fabro et al.

    Startup of a distillation column using intelligent control techniques

    Computers and Chemical Engineering

    (2005)
  • P. Foggia et al.

    Multiclassification: reject criteria for the Bayesian combiner

    Pattern Recognition

    (1999)
  • K. Gollmer et al.

    Supervision of bioprocess using a dynamic time warping algorithm

    Control Engineering Practice

    (1996)
  • H. Honda et al.

    Fuzzy control of bioprocess

    Journal of Bioscience and Bioengineering

    (2000)
  • A. Kassidas et al.

    Off-line diagnosis of deterministic faults in continuous dynamic multivariable processes using speech recognition methods

    Journal of Process Control

    (1998)
  • S.N. Kavuri et al.

    Representing bounded fault classes using neural networks with ellipsoidal functions

    Computers and Chemical Engineering

    (1993)
  • J. Kepner et al.

    MatlabMPI

    Journal of Parallel and Distributed Computing

    (2004)
  • W. Ku et al.

    Disturbance detection and isolation by dynamic principal component analysis

    Chemometrics and Intelligent Laboratory Systems

    (1995)
  • X. Lin et al.

    Performance analysis of pattern classifier combination by plurality voting

    Pattern Recognition Letters

    (2003)
  • P.R. Lyman et al.

    Plant-wide control of the Tennessee Eastman problem

    Computers and Chemical Engineering

    (1995)
  • J.F. MacGregor et al.

    Statistical process control of multivariate processes

    Control Engineering Practice

    (1995)
  • F.P. Maturana et al.

    Distributed multi-agent architecture for automation systems

    Expert Systems with Applications

    (2004)
  • K. Muthuswamy et al.

    Phase-based supervisory control for fermentation process development

    Journal of Process Control

    (2003)
  • Y.S. Ng et al.

    An adjoined multi-model approach for monitoring batch and transient operations

    Computers and Chemical Engineering

    (2009)
  • B. Özyurt et al.

    A hybrid hierarchical neural network-fuzzy expert system approach to chemical process fault diagnosis

    Fuzzy Sets and Systems

    (1996)
  • T.F. Smith et al.

    Identification of common molecular subsequences

    Journal of Molecular Biology

    (1981)
  • R. Srinivasan et al.

    Online fault diagnosis and state identification using dynamic locus analysis

    Chemical Engineering Science

    (2006)
  • R. Srinivasan et al.

    A framework for managing transitions in chemical plants

    Computers and Chemical Engineering

    (2005)
  • R. Srinivasan et al.

    Context-based recognition of process states using neural networks

    Chemical Engineering Science

    (2005)
  • R. Srinivasan et al.

    Neural network systems for multi-dimensional temporal pattern classification

    Computers and Chemical Engineering

    (2005)
  • A. Sundarraman et al.

    Monitoring transitions in chemical plants using enhanced trend analysis

    Computers and Chemical Engineering

    (2003)
  • Cited by (59)

    • Data-driven process monitoring and fault analysis of reformer units in hydrogen plants: Industrial application and perspectives

      2020, Computers and Chemical Engineering
      Citation Excerpt :

      As discussed above, a plethora of data-driven techniques is available for the development of a process monitoring tool. However, each method has its advantages and shortcomings; a method that works well for one system might not exhibit satisfactory performance for another (Dash and Venkatasubramanian, 2000; Ge et al., 2013 Ng and Srinivasan, 2010; Perk et al., 2010). have proposed combining multiple FDD methods in a multi-agent system for process monitoring; these multi-agent systems, however, do not make selection of specific FDD techniques any easier.

    • Fuzzy decision fusion system for fault classification with analytic hierarchy process approach

      2017, Chemometrics and Intelligent Laboratory Systems
      Citation Excerpt :

      The representative utility-based method is voting-based method [18–20], and the evidence-based methods include Bayesian fusion method [21], Dempster-Shafer method [22], decision templates [23], Borda count [24], etc [25]. Numbers of literature on fault detection and classification are about decision fusion method [21,25–30]. However, most fusion strategies have not considered the performance of each method.

    View all citing articles on Scopus
    View full text