Multi-agent based collaborative fault detection and identification in chemical processes

doi:10.1016/j.engappai.2010.01.026

Engineering Applications of Artificial Intelligence

Volume 23, Issue 6, September 2010, Pages 934-949

https://doi.org/10.1016/j.engappai.2010.01.026 Get rights and content

Abstract

Fault detection and identification (FDI) has received significant attention in literature. Popular methods for FDI include principal component analysis, neural-networks, and signal processing methods. However, each of these methods inherit certain strengths and shortcomings. A method that works well under one circumstance might not work well under another when different features of the underlying process come to the fore. In this paper, we show that a collaborative FDI approach that combines the strengths of various heterogeneous FDI methods is able to maximize diagnostic performance. A multi-agent framework is proposed to realize such collaboration in practice where different FDI methods, i.e: principal component analysis, self-organizing maps, non-parametric approaches, or neural-networks are combined. Since the results produced by different FDI agents might be in conflict, we use decision fusion methods to combine FDI results. Two different methods – voting-based fusion and Bayesian probability fusion are studied here. Most monitoring and fault diagnosis algorithms are computationally complex, but their results are often needed in real-time. One advantage of the multi-agent framework is that it provides an efficient means for speeding up the execution time of the various FDI methods through seamless deployment in a large-scale grid. The proposed multi-agent approach is illustrated through fault diagnosis of the startup of a lab-scale distillation unit and the Tennessee Eastman Challenge problem. Extensive testing of the proposed method shows that combining diagnostic classifiers of different types can significantly improve diagnostic performance.

Introduction

Diagnosis of process faults in chemical processes has been an active area of research (Srinivasan, 2007). Successful identification of process faults at an early stage can increase the success rate of fault recovery during operations and prevent unnecessary shutdowns. Also, automatic detection and diagnosis of faults are necessary to prevent costly accidents by providing time critical diagnostic information to plant operators. In literature, several fault diagnosis methodologies have been proposed for fault detection and identification (FDI) in chemical processes (Venkatasubramanian et al., 2003b, Venkatasubramanian et al., 2003a; Srinivasan et al., 2005a, Srinivasan et al., 2005b, Ng and Srinivasan, 2009a). Each FDI method has its strengths and shortcomings, which are process and fault dependant. A method that works well under one circumstance might not work well under another when different features of the process come to the fore. Combining FDI methods of different types is hence an attractive solution for monitoring processes operating under a wide range of operating conditions.

In addition to being adaptive towards different operating conditions, combining different FDI methods can also achieve higher diagnostic resolution by combining the strengths of existing FDI methods. It has already been shown in the pattern recognition literature that a judicious combination of classifiers generally outperforms a single one (Rahman and Fairhurst, 1999; Lin et al., 2003; McArthur et al., 2004). The main reason for combination of classifiers is that different types of classifiers can often complement one another and improve performance as a result of collaboration. When diagnosing faults in complex processes, designing a perfect classifier for all possible scenarios can be difficult, and combining different fault diagnostic methods is shown to be a good alternative wherein different features of heterogeneous diagnostic classifiers can be synergistically consolidated. To facilitate the integration of heterogeneous diagnostic classifiers, a multi-agent system is proposed in this paper to integrate various FDI methods. The organization of this paper is as follows: Section 2 provides the review of some previous work in the fields of FDI, decision fusion and agent-based methods. Section 3 describes the proposed multi-agent approach for collaborative FDI and the underlying decision fusion strategies. The proposed multi-agent with its decision fusion strategies are tested with two case studies, namely startup of a lab-scale distillation column and the Tennessee Eastman Challenge problem in Section 4 and Section 5, respectively.

In general, existing FDI methods can be broadly classified into two categories namely qualitative model-based and quantitative model-based methods. Qualitative model-based methods include techniques such as trend analysis and expert systems. Trend analysis is based on the abstraction of process data into a set of trends (Cheung and Stephanopoulos, 1990). Monitoring is then performed on the identified trends, which are made up of primitives that describe the qualitative behavior of the process variables. Classical trend analysis approaches are based on monitoring an ordered set of primitives that describe the evolution of a process variable. When a fault occurs, process variables vary from their nominal ranges and exhibit trends that are characteristic of the fault. Hence, different faults can be mapped to their characteristic trend signatures. Extension of trend analysis through fuzzy reasoning is reported in Dash et al. (2003).

During transitions, each variable might display a different trend during different phases of the transition. There are also occasions where process exhibits different trends during transitions due to normal operating variations, thus complicating trend comparison. Classical trend analysis is therefore not sufficient to monitor transitions adequately. Sundarraman and Srinivasan, 2003a, Sundarraman and Srinivasan, 2003b overcome the above problems through enhanced trends. Three types of matching degrees – shape matching degree, magnitude matching degree, and duration matching degree – were introduced to facilitate trend comparison during transition. The main shortcoming of trend analysis is that it is designed for monitoring individual variables. It does not take into account the correlation between the variables in the process.

Expert systems, or rule-based systems, use rules to perform monitoring. They are best suited to situations where plant operators have a good knowledge regarding the nuances of the transitions and the underlying process. Honda and Kobayashi (2000) used a fuzzy rule-based inference system for the direct control of batch operations. The process phase is first recognized by fuzzy inference, and then a fuzzy neural-network based control system is used to control the batch process. They illustrated their methods on mevalotin precursor production, vitamin B2 production, and sake mashing processes. In Muthuswamy and Srinivasan (2003), a rule-based expert system is developed for automation and supervisory control of semi-batch fermentation processes. They characterized transitions using features in process variables and represented them as multivariate rules. These rules track the process across phases and automatically detect the current active phase using online data. Different monitoring rules are formulated for each phase of a transition. The rule-based transition characterization method was shown to be robust to measurement noise and easily comprehendible to the operators. Nevertheless, rule-based systems are process specific; at times, it is hard to extract rules to adequately model complex processes.

First-principle models, statistical models, signal processing models, and neural-networks are clustered under quantitative model-based systems. Extensive coverage of quantitative model-based approaches for monitoring and diagnosing faults during steady-state can be found in Chen and Patton (1999) and Venkatasubramanian et al. (2003c). Quantitative models are built either from first-principles knowledge or from using input–output data. In Bhagwat et al. (2003a), a non-linear model-based approach was proposed to monitor process transitions. Estimation of process states and residuals was achieved through open-loop observers and Kalman filters. To address the issues arising from the discontinuous nature of transition, the scheme uses knowledge of the standard operating procedure and divides each transition into phases. For monitoring, each phase is associated with a model component and different filters and observers are selected for fault detection in that phase. However, accurate models of highly complex processes operating in multiple regimes are seldom available and difficult to develop, thus limiting their practical applicability. Multiple model-based approaches have therefore been used to model, control, and monitor transitions. In Bhagwat et al. (2003b), a multi-linear model-based fault detection scheme was proposed based on decomposition of operation of a non-linear process into multiple locally linear regimes. Kalman filters and open-loop observers were used for state estimation and residuals generation in each regime. Analysis of residuals using thresholds, faults maps, and logic-charts enabled on-line detection and isolation of faults.

Signal processing methods can be applied to analyze the normal/abnormal status of a process by comparing the online profile of process variables with those of previously known runs. The underlying methods perform time synchronization between process signals from different runs before comparing them based on predefined similarity metrics. Methods for signal processing include dynamic time warping (DTW) and dynamic programming (DP). Applications of DTW for process monitoring can be found in Gollmer and Posten (1996) and Kassidas et al., 1998a, Kassidas et al., 1998b. One known shortcoming of DTW is its high computational cost, which grows exponentially with the length of process data. This can be minimized by using landmarks such as peaks or local minima in the signals to reduce the complexity of signal comparison (Srinivasan and Qian, 2005, Srinivasan and Qian, 2007). These landmarks, called singular points, can be used to decompose a long continuous signal into multiple, short, semi-continuous ones. However, one known shortcoming of DTW algorithm is the essential requirement that the starting and ending points of the signals to be compared should coincide. Such shortcomings obviate their direct practice for online applications since the points in the historical database that should be matched with the starting and ending points of the online signal are unknown. To overcome these shortcomings, Srinivasan and Qian (2006) proposed dynamic locus analysis which is an extension of Smith and Waterman (1981) discrete sequence comparison algorithm for online signals comparison.

With the increasing availability of inexpensive sensors, the number of measured variables for most industrial processes easily ranges in thousands. This has lead to the popularity of multivariate statistical methods, which bring forth powerful means to monitor transitions. Principal components analysis (PCA) is one such multivariate dimensionality reduction technique that is widely used for developing data-driven models (Jackson, 1991). Applications of PCA and its variants for process monitoring can be found in MacGregor and Kourti (1995) and Chen and Liu (2002). Most of the reported work in multivariate statistical analysis is directed to processes where the correlation between the process variables remains the same. These approaches are not directly applicable to transitions due to statistical non-stationarity and time-varying dynamics. In order to overcome this, an extension called dynamic PCA (DPCA) has been proposed (Ku et al., 1995). In Srinivasan et al. (2004), DPCA has been used to classify process states based on historical operating data. Process data is first segmented into modes and transitions. Steady-state modes are identified by using a moving window approach which is capable of rejecting outliers. A DPCA-based similarity factor is used to compare transitions with historical data, which can be used for online FDI. Since the run-length variations common across different instances of transient operations restrict the application of time-wise unfolding methods with PCA, multiple models can be used to overcome such shortcoming. Doan and Srinivasan (2008) and Ng and Srinivasan (2009b), and multiple PCA models are used for monitoring transient operations.

Neural-network based approaches are another popular area for fault diagnosis in continuous processes (Kavuri and Venkatasubramanian, 1993). They have been popular for classification and function approximation. In Fabro et al. (2005), recurrent neural-networks were used to identify process states and predict process behavior. Control actions for different phases of transition are provided through sets of fuzzy controllers. They illustrated their approach through a distillation-column startup case study. Theoretically, artificial neural-networks can approximate any well-defined non-linear function with arbitrary accuracy. Unfortunately, there is no universal criterion for selecting a specific structure of neural-network for a practical application. Usually the structure of the network is decided based on the input dimensionality and the complexity of the underlying classes. The construction of an accurate neural classifier for such multivariate, multi-class temporal classification problem suffers from the “curse of dimensionality”. To overcome the above drawbacks, Srinivasan et al. (2005c) proposed the use of two new neural network architectures, namely one-variable-one-network (OVON) and one-class-one-network (OCON). In both structures, the original classification problem is decomposed into a number of simpler classification problems. The new neural-networks architectures are hence simpler in structure, faster to train, and yield substantial improvement in classification accuracy compared to classical neural-network structure. However, a priori knowledge of the sub-states of each variable is needed to derive the sub-state identification layer of OVON, which can be cumbersome for processes with large number of variables. High misclassification rate is also reported during state change when there is no clear separation between the states.

Though there exist a variety of FDI methods for process monitoring and fault diagnosis, it is worth noting that each FDI approach has its corresponding strengths and shortcomings, which are process dependant. A method that works well under one circumstance might not work well under another when different features of the process come to the fore. A comparison of the strengths and shortcomings of different FDI methods is shown in Table 1. Since no single FDI method is able to address the numerous facets of process monitoring and fault diagnosis, collaboration among heterogeneous methods is needed to bring forth the benefits of each method to improve monitoring resolution and robustness of the FDI system. The rationale of such an approach is based on the precept that the strengths of various methods can be brought together to bear on the problem and the drawbacks of an individual method can be overcome through collaboration.

When multiple FDI methods are used for monitoring and fault diagnosis, the integration between the heterogeneous methods becomes a challenge. Distributed computing methods such as multi-agent systems, which facilitate collaboration among distributed entities, are an attractive approach to combine various FDI methods. An agent wrapper can be used to encapsulate a FDI method and its results made available to other agents via messages. Next, we briefly review previous application of multi-agent approach in the domain of monitoring and fault diagnosis.

Section snippets

Multi-agent systems

The term ‘agent’ is defined as a computer system that is situated in some environment, and is capable of performing autonomous actions in that environment in order to meet its design objectives (Wooldridge and Jennings, 1995). An agent can thus be viewed as an computational entity that automates some aspect of task management or decision making to benefit its end user. Agent-based approaches offer opportunities to solve complex problems collaboratively using heterogeneous methods. An agent is

Collaborative agents for managing efficient operations

An agent-based framework called collaborative agents for managing efficient operations (CAMEO) is described here to render effective management of process operations possible. The proposed framework is abstracted hierarchically into environment, host, and agents. An environment in CAMEO is a neighborhood that supports any plant operation. All entities within the plant are part of the environment, i.e., software, hardware, controllers, human operators, etc. An agent environment might contain one

Case study I: fault diagnosis of Tennessee Eastman challenge Problem

In this section, the proposed FDI method is tested for online disturbance identification on the Tennessee Eastman (TE) industrial challenge problem (Downs and Vogel, 1993). The TE process produces two products (G and H) and a byproduct (F) from reactants A, C, D, and E (Fig. 6). The control structure of Lyman and Georgakis (1995), as implemented by Chiang and Braatz (2003) is used here. The process has five units, namely: a two-phase reactor, a product condenser, a flash separator, a recycle

Case study II: fault diagnosis during distillation unit startup

In this section, the proposed method is tested on a lab-scale distillation unit. The schematic of the distillation unit is shown in Fig. 10. The distillation column is of 2 m height and 20 cm width and has 10 trays; the feed enters at tray 4. The system is well integrated with a control console and data acquisition system. Nineteen variables – all tray temperatures, reboiler and condenser temperature, reflux ratio, top and bottom column temperature, feed pump power, reboiler heat duty, cooling

Conclusions

A novel multi-agent based framework has been developed for detecting and diagnosing faults in the process industries. It offers a mean to integrate seamlessly various fault detection and identification techniques. The framework, called collaborative agents for managing efficient operations (CAMEO), consists of different monitoring methods, each modeled as a software agent, which observe the process in real-time and flag abnormalities independently. A decision fusion strategy forms the bedrock

References (62)

A. Bhagwat et al.
Fault detection during process transitions: a model-based approach
Chemical Engineering Science
(2003)
A. Bhagwat et al.
Multi-linear model-based fault detection during process transitions
Chemical Engineering Science
(2003)
J. Chen et al.
On-line batch process monitoring using dynamic PCA and dynamic PLS models
Chemical Engineering Science
(2002)
J.T. Cheung et al.
Representation of process trends part I. A formal representation framework
Computers and Chemical Engineering
(1990)
L.H. Chiang et al.
Process monitoring using causal map and multivariate statistics: fault detection and identification
Chemometrics and Intelligent Laboratory Systems
(2003)
K.-J. Cho et al.
A study on the classified model and the agent collaboration model for network configuration fault management
Knowledge-Based Systems
(2003)
R.T. Clemen
Combining forecasts: a review and annotated bibliography
International Journal of Forecasting
(1989)
S. Dash et al.
Fuzzy-logic based trend classification for fault diagnosis of chemical processes
Computers and Chemical Engineering
(2003)
X.-T. Doan et al.
Online monitoring of multi-phase batch processes using phase-based multivariate statistical process control
Computers and Chemical Engineering
(2008)
J.J. Downs et al.
A plant-wide industrial process control problem
Computers and Chemical Engineering
(1993)

J.A. Fabro et al.

Startup of a distillation column using intelligent control techniques

Computers and Chemical Engineering

(2005)

P. Foggia et al.

Multiclassification: reject criteria for the Bayesian combiner

Pattern Recognition

(1999)

K. Gollmer et al.

Supervision of bioprocess using a dynamic time warping algorithm

Control Engineering Practice

(1996)

H. Honda et al.

Fuzzy control of bioprocess

Journal of Bioscience and Bioengineering

(2000)

A. Kassidas et al.

Off-line diagnosis of deterministic faults in continuous dynamic multivariable processes using speech recognition methods

Journal of Process Control

(1998)

S.N. Kavuri et al.

Representing bounded fault classes using neural networks with ellipsoidal functions

Computers and Chemical Engineering

(1993)

J. Kepner et al.

MatlabMPI

Journal of Parallel and Distributed Computing

(2004)

W. Ku et al.

Disturbance detection and isolation by dynamic principal component analysis

Chemometrics and Intelligent Laboratory Systems

(1995)

X. Lin et al.

Performance analysis of pattern classifier combination by plurality voting

Pattern Recognition Letters

(2003)

P.R. Lyman et al.

Plant-wide control of the Tennessee Eastman problem

Computers and Chemical Engineering

(1995)

J.F. MacGregor et al.

Statistical process control of multivariate processes

Control Engineering Practice

(1995)

F.P. Maturana et al.

Distributed multi-agent architecture for automation systems

Expert Systems with Applications

(2004)

K. Muthuswamy et al.

Phase-based supervisory control for fermentation process development

Journal of Process Control

(2003)

Y.S. Ng et al.

An adjoined multi-model approach for monitoring batch and transient operations

Computers and Chemical Engineering

(2009)

B. Özyurt et al.

A hybrid hierarchical neural network-fuzzy expert system approach to chemical process fault diagnosis

Fuzzy Sets and Systems

(1996)

T.F. Smith et al.

Identification of common molecular subsequences

Journal of Molecular Biology

(1981)

R. Srinivasan et al.

Online fault diagnosis and state identification using dynamic locus analysis

Chemical Engineering Science

(2006)

R. Srinivasan et al.

A framework for managing transitions in chemical plants

Computers and Chemical Engineering

(2005)

R. Srinivasan et al.

Context-based recognition of process states using neural networks

Chemical Engineering Science

(2005)

R. Srinivasan et al.

Neural network systems for multi-dimensional temporal pattern classification

Computers and Chemical Engineering

(2005)

A. Sundarraman et al.

Monitoring transitions in chemical plants using enhanced trend analysis

Computers and Chemical Engineering

(2003)

Cited by (59)

Industrial process fault detection and diagnosis framework based on enhanced supervised kernel entropy component analysis
2022, Measurement: Journal of the International Measurement Confederation
Most existing industrial process fault detection and diagnosis (FDD) techniques operate on data collected at a single scale and focus only on known faults. However, actual process data are inherently multiscale and unknown faults are always inevitable during system running. Therefore, they may perform unsatisfactorily. To tackle this problem, this paper develops a decentralized industrial process FDD framework using multiple enhanced supervised kernel entropy component analysis (enhanced SKECA) models, where each model acts as a fault indicator for one specific fault. Faults can be easily diagnosed by monitoring the outputs of all models within the framework. In particular, when new faults are identified, the framework can update itself only by adding the corresponding enhanced SKECA models without a complete rebuilding process. The monitoring results for the continuous stirred tank reactor (CSTR) process show that the proposed framework is effective in diagnosing both known and unknown faults.
Data-driven process monitoring and fault analysis of reformer units in hydrogen plants: Industrial application and perspectives
2020, Computers and Chemical Engineering
Citation Excerpt :
As discussed above, a plethora of data-driven techniques is available for the development of a process monitoring tool. However, each method has its advantages and shortcomings; a method that works well for one system might not exhibit satisfactory performance for another (Dash and Venkatasubramanian, 2000; Ge et al., 2013 Ng and Srinivasan, 2010; Perk et al., 2010). have proposed combining multiple FDD methods in a multi-agent system for process monitoring; these multi-agent systems, however, do not make selection of specific FDD techniques any easier.
Reformer boxes are complex, integrated, and high-temperature units, subject to various failures during continuous operations for extended time periods. Challenges in the development of high-fidelity first principle models, despite easy availability of process measurements motivated the development of data-driven, automated fault detection (FD) systems. Paucity of plant-wide implementation of FD technologies in the chemical industry, accentuates the absence of relevant practical guidelines and best practices. In this paper, a trivially replicable FD system has been developed for large-scale industrial reformer boxes of hydrogen manufacturing units. Actual process data from plant historian has been used for training and validation of a novel model, developed using a combination of partial least squares regression and principal components analysis. Abnormalities based on several important measurements around the reformer were identified. Explicit algorithmic details and insights obtained during development of the expert system have been provided for ease of replication and adaptability.
Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process
2020, Control Engineering Practice
This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.
Review on data-driven modeling and monitoring for plant-wide industrial processes
2017, Chemometrics and Intelligent Laboratory Systems
Data-driven modeling and applications in plant-wide processes have recently caught much attention in both academy and industry. This paper provides a systematic review on data-driven modeling and monitoring for plant-wide processes. First, methodologies of commonly used data processing and modeling procedures for the plant-wide process are presented. Detailed research statuses on various aspects for plant-wide process monitoring are reviewed since 2000. After that, extensions, opportunities, and challenges on data-driven modeling for plant-wide process monitoring are discussed and highlighted for future research.
Fuzzy decision fusion system for fault classification with analytic hierarchy process approach
2017, Chemometrics and Intelligent Laboratory Systems
Citation Excerpt :
The representative utility-based method is voting-based method [18–20], and the evidence-based methods include Bayesian fusion method [21], Dempster-Shafer method [22], decision templates [23], Borda count [24], etc [25]. Numbers of literature on fault detection and classification are about decision fusion method [21,25–30]. However, most fusion strategies have not considered the performance of each method.
Performance of the most existing fault detection and classification methods can only be guaranteed when each of their own assumptions are met. In other words, a method works well in one condition may not perform well in another. In this paper, a new analytic hierarchy process (AHP) based fuzzy decision fusion system is proposed to tackle the fault classification problem. The AHP approach is introduced to determine the priorities of different classifiers, which are further utilized as the weights in ensemble system. Comparing to conventional equal weighted fusion system, the proposed fuzzy fusion system is able to provide more rational and convincing fault classification result. Effectiveness of the proposed fuzzy fusion system with model evaluation is verified through the Tennessee Eastman (TE) benchmark process.
Abnormal situation management: Challenges and opportunities in the big data era
2016, Computers and Chemical Engineering
Although modern chemical processes are highly automatic, abnormal situation management (ASM) still heavily relies on human operators. Process fault detection and diagnosis (FDD) are one of the most important issues of ASM but few FDD systems have been satisfactorily applied in real chemical processes since the concept of FDD was proposed about 40 years ago. In this paper, developments of chemical process FDD are briefly reviewed. The reason why FDD has not been widely implemented in the chemical process industry is discussed. One of the insights gained is that some basic problems in FDD such as how to define faults and how many faults to diagnose have not even been addressed well while researchers tirelessly try to invent new methods to diagnose fault. A new framework is proposed based on the big data in a cloud computing environment of a big chemical corporation for addressing the challenging issues in ASM.

View all citing articles on Scopus

View full text

Multi-agent based collaborative fault detection and identification in chemical processes

Abstract

Introduction

Section snippets

Multi-agent systems

Collaborative agents for managing efficient operations

Case study I: fault diagnosis of Tennessee Eastman challenge Problem

Case study II: fault diagnosis during distillation unit startup

Conclusions

Chemical Engineering Science

Chemical Engineering Science

Chemical Engineering Science

Computers and Chemical Engineering

Chemometrics and Intelligent Laboratory Systems

Knowledge-Based Systems

International Journal of Forecasting

Computers and Chemical Engineering

Computers and Chemical Engineering

Computers and Chemical Engineering

Computers and Chemical Engineering

Pattern Recognition

Control Engineering Practice

Journal of Bioscience and Bioengineering

Journal of Process Control

Computers and Chemical Engineering

Journal of Parallel and Distributed Computing

Chemometrics and Intelligent Laboratory Systems

Pattern Recognition Letters

Computers and Chemical Engineering

Control Engineering Practice

Expert Systems with Applications

Journal of Process Control

Computers and Chemical Engineering

Fuzzy Sets and Systems

Journal of Molecular Biology

Chemical Engineering Science

Computers and Chemical Engineering

Chemical Engineering Science

Computers and Chemical Engineering

Computers and Chemical Engineering