Multi-level information fusion for spatiotemporal monitoring in water distribution networks

https://doi.org/10.1016/j.eswa.2014.11.014Get rights and content

Highlights

  • Multi-level information fusion offers improved water quality monitoring under uncertainty.

  • The NF-BRB and high-level BRB systems provide flexible inference frameworks based on WQP features.

  • Hybrid learning and dynamic fusion algorithms significantly enhance the detection performance.

  • Validation strategy with projecting anomaly signal patterns allows performance evaluation.

Abstract

This paper deals with online water quality monitoring in distribution networks based on surrogate water quality parameters (WQPs). The present strategy is based on multi-level information fusion using hierarchical belief rule-based (BRB) systems. Networked fuzzy belief rule-based (NF-BRB) and high-level BRB systems are introduced for information fusion at the feature level. Primary and secondary features are extracted from online WQP signals. Primary features are analyzed using the NF-BRB system that is built through knowledge elicitation from experts. Secondary features are interpreted through the high-level BRB system that employs a fuzzy partitioning on the feature sets and a hybrid learning strategy for its rule base construction. Finally, the dynamic fuzzy evidential fusion is introduced to aggregate the local and spatial assessments in each analysis window. As an important contribution of this paper, we propose a new validation method for event detection in the water distribution network (WDN) based on adaptive projection of the signal patterns attributed to anomaly events, obtained through contamination experiments in a pilot facility, to the real WQP signals measured across the WDN. Single and composite contamination events based on several biological and chemical contaminants are simulated to evaluate the performance of the proposed framework in event detection. The proposed multi-level information fusion framework obtains a high detection rate and a reduced number of false negative and positive results.

Introduction

Online monitoring of distributed infrastructure systems such as water distribution networks is necessary to assure the safety of water supply systems and management of health risks. Since water quality can deteriorate after leaving the treatment plant, monitoring policies are required at the distribution network level. Often various patterns of uncertainty and information deficiency about spatial relationships and interdependency of distributed observations impede effective monitoring and data interpretation in distributed systems. As a result, monitoring of distributed complex systems requires specific signal processing, data interpretation, and event analysis methods. One major shortcoming of the current water quality monitoring systems is the lack of inference mechanisms that provide interpretable operational information to the operators and facility management team.

The emerging sensor technology allows online measurements of surrogate WQPs (Ernest et al., 2008, WERF, 2008). Successful event detection and alarming in water treatment plants and distribution networks is largely dependent on the employed computational data analysis and their discriminatory ability. The current methods to detect anomalies in the WDN can be categorized in three groups including statistical (Allgeier et al., 2005, McKenna et al., 2007, Murray et al., 2010, Shang et al., 2007), empirical AI-based (Allgeier et al., 2005, Raciti et al., 2012), and data mining (Koch and McKenna, 2011, McKenna et al., 2007, Murray et al., 2010, Yang et al., 2008) methods. Some of these event detection systems are only suited for data collected in a single monitoring station to indicate the occurrence of a contamination event (Byer and Carlson, 2005, Cook et al., 2006, Murray et al., 2010, Yang et al., 2009) while other methods are based on two monitoring locations to improve the aggregation results using one of the nodes as the reference to compensate for the calibration error of the other node, varying time delays, and background noise (Yang et al., 2008, McKenna et al., 2007). Kumar et al. (2007) studied some of these event detection algorithms and demonstrated that tested algorithms do not provide unique responses in similar situations. Many of these problems are attributed to the inconsistency in the indications of WQPs and nonlinearity in the inference process. In order to improve the event detection performance, specific data fusion methods were introduced to incorporate other types of information including operational data (Hart, Mckenna, Murray, & Haxton, 2010), sampling location specific features (Raciti et al., 2012), and multiple sampling locations data (Hou et al., 2013, Koch and McKenna, 2011, Murray et al., 2010).

Due to the complex nature of water quality degradation process, various hypotheses can be associated with the deviation of WQPs from their regulatory ranges (Adams, 2009, Dawsey et al., 2006, Francisque et al., 2009, Lu and Huang, 2009, Mounce et al., 2009). While above methods use empirical AI-based or data mining algorithms, they do not employ a framework that allow incorporating various types of uncertainty in causal spatiotemporal relationships of WQPs to water quality events. Thus, more flexible knowledge representation frameworks that allow a set of hypotheses as the plausible indications of significant variations in WQPs can provide a promising solution.

The belief rule-based (BRB) system with an embedded belief structure in its consequent can model nonspecificity when the expert cannot make strong judgments. Alternatively, imprecise and incomplete data may be used to build the rule base that governs the BRB system (Chen et al., 2013, Liu et al., 2004, Yang et al., 2006). A well-known BRB system called “rule base inference methodology using evidential reasoning (RIMER)” has been employed in various applications that involve uncertainty and incompleteness (Wang et al., 2006, Xu et al., 2007, Yang and Xu, 2002a, Yang and Xu, 2002b). BRB systems were successfully employed for improved water quality assessment in one sampling location (Aghaarabi et al., 2014, Aminravan et al., 2011, Aminravan et al., 2012). However, BRB systems have never been adopted for spatiotemporal monitoring in the WDN.

This work involves the application of BRB systems and evidential reasoning algorithms to a real municipal problem: the determination of water quality degradation events through the fusion of surrogate online monitored WQPs. This problem is formulated as nonlinear function approximations of features extracted from surrogate WQPs, spatial variations of the monitored WQPs, and imprecise temporal relationships between monitored WQPs at different nodes. A multi-level information fusion framework is proposed to alleviate some deficiencies in data interpretation in previous research works for distributed information fusion in the WDN. The multi-level information fusion framework efficiently performs data aggregation in the WDN at measurement, feature, and decision levels. The multi-level information fusion framework encompasses three sub-modules: (i) spatiotemporal sensor data preprocessing, (ii) primary and secondary (parametric) feature extraction, and (iii) a flexible dynamic hierarchical BRB system. The schematic of the multi-level information fusion is presented in Fig. 1.

The first two modules prepare the initial WQP signals and extract features indicative of anomalies to be used as inputs to the hierarchical BRB system. Sensor data preprocessing module adaptively transforms the WQP sensor measurements and effectively reduces the effects of baseline variations and other data discrepancies. Then, primary and secondary feature extraction methods quantify significant deviation of preprocessed spatiotemporal data from regulatory patterns. The vectors of quantified features are mapped into water quality levels using the hierarchical BRB system. To aggregate the assessed quality levels in each analysis window a dynamic fuzzy evidential fusion algorithm is introduced. Extensive simulated water quality degradation events, originated from real dataset of the Quebec City’s main WDN and contamination tests in a pilot distribution facility, were exploited to validate the efficacy of the proposed framework when employed for event detection in the WDN.

Section snippets

Problem statement

In this section, the problem of spatiotemporal monitoring in a distributed system such as the WDN is introduced to demonstrate the motivations for the proposed multi-level information fusion. Fig. 2 presents a simple schematic diagram of a spanning tree G = (V, E) defined between three nodes v1, v2 and v3 in one stratum of the WDN. It is assumed that several surrogate WQPs at each node of the network are monitored with different sampling frequencies. Assuming that the average retention time (RT)

Evidence theory basics

This section presents some main aspects of the Dempster–Shafer theory (DST) (Dempster, 1967, Shafer, 1976) required to introduce the hierarchical BRB system. Let Θ be the frame of discernment of all the elements Ai, i = 1, …, N and its corresponding power set denoted by 2Θ is such thatΘ={A1,A2,,AN}2Θ={B|BΘ}={,A1,A2,,AN,{A1,A2},,{A2,A3},,Θ}The veracity of proposition B of 2Θ is characterized by a basic belief assignment (BBA) m defined asm:2Θ[0,1],AiΘm(Ai)=1,AiΘ,m(Ai)0Aiisafocalelement.

Feature extraction for anomaly detection

In general, the features extracted from online WQPs can be classified into four categories: statistical, temporal, signal shape, and composite features (Hou et al., 2013). Most of the time, a significant and persistent deviation from the regulatory ranges can be attributed to certain anomaly events. These deviations are quantified using the primary and secondary feature extraction methods summarized in the following.

Distributed multi-level information fusion: the hierarchical belief rule-based system

This section presents a flexible multi-level information fusion framework that can incorporate both distributed WQP sensor measurements, and imprecise and deficient subjective knowledge on the indication of each surrogate WQP into the fusion process. For illustration, the schematic of distributed multi-level information fusion framework for five nodes that belong to two strata in the WDN is shown in Fig. 3.

In the rest, the structure of hierarchical BRB system, the dynamic fuzzy evidential

Problem setting for validation

In order to validate the results obtained through the proposed multi-level information fusion framework, a case study of a monitoring problem of six sampling locations in the Quebec City’s main WDN is considered. Online measurements of common WQPs including residual Chlorine (Clf) concentration and turbidity (Tur) are available. Information fusion in one sampling location of the Quebec City’s main WDN using fuzzy evidential reasoning and fuzzy rule-based systems has been previously addressed (

Distributed monitoring

Often a variation from the baseline can be attributed to an anomaly only if evidence exists that support occurrence of an unusual event. The secondary conditions that should be satisfied to warrant an anomaly event are based on the predefined patterns in the online WQP signals. These patterns are quantified using the secondary feature extraction method presented in Section 2 and are used to train the high-level BRB system through the hybrid learning method.

The unprocessed sensor measurements

Conclusions

This paper introduced a multi-level information fusion framework with several configurations for online water quality monitoring in the WDN. The strategy adopted to fuse the information coming from multiple WQP sensors is based on a three-level fusion architecture composed of a local feature combination (at single time steps), a dynamic fuzzy evidential fusion for assessment combination in one analysis window and a spatial fusion algorithm for data aggregation across the nodes. The benefits of

Acknowledgment

The research reported in this manuscript was fully supported by Natural Sciences and Engineering Research Council (NSERC) Canada under the Strategic Project Grant (SPG) program.

References (54)

  • Z.J. Zhou et al.

    A sequential learning algorithm for online constructing belief-rule-based systems

    Expert Systems with Applications

    (2010)
  • J.A. Adams

    Shedding light on continuous water monitoring

    Journal of American Water Works Association

    (2009)
  • E. Aghaarabi et al.

    Comparative study of fuzzy evidential reasoning and fuzzy rule-based approaches: An illustration for water quality assessment in the water distribution network

    Stochastic Environmental Research and Risk Assessment

    (2014)
  • S. Allgeier et al.

    Overview of event detection systems for watersentinel

    (2005)
  • F. Aminravan et al.

    Evidential reasoning using extended fuzzy Dempster–Shafer theory for handling various facets of information deficiency

    International Journal of Intelligent Systems

    (2011)
  • Aminravan, F., Sadiq, R., Hoorfar, M., Rodriguez, M.J., & Najjaran, H. (2012). Multicriteria information fusion using a...
  • Aminravan F., Sadiq R., Hoorfar M., Rodriguez M.J., Francisque A., & Najjaran H. (2013). In Networked fuzzy belief...
  • D. Byer et al.

    Real-time detection of intentional chemical contamination in the distribution system

    Journal of the American Water Works Association

    (2005)
  • J.B. Cook et al.

    Distribution system monitoring research at charleston water system

  • Chu, D. Deshpande A., Hellerstein J.M., & Hong W. (2006). Approximate data collection in sensor networks using...
  • W.J. Dawsey et al.

    Bayesian belief networks to integrate monitoring evidence of water distribution system contamination

    Journal of Water Resources Planning and Management

    (2006)
  • A.P. Dempster

    Upper and lower probabilities induced by a multivariate mapping

    Annals of Mathematical Statistics

    (1967)
  • Ernest, A., Andrew, K., Chang, N., & Cheng, C. (2008). Spatial rule-based expert system for sensor network planning in...
  • A. Francisque et al.

    Prioritizing monitoring locations in a water distribution network: A fuzzy risk approach

    Journal of Water Supply: Research and Technology. AQUA

    (2009)
  • S. Guillaume et al.

    Generating an interpretable family of fuzzy partition from data

    IEEE Transactions on Fuzzy Systems

    (2004)
  • M. Guo et al.

    Evidential reasoning approach for multiattribute decision analysis under both fuzzy and interval uncertainty

    IEEE Transactions on Fuzzy Systems

    (2009)
  • Hart, D.B., Mckenna, S.A., Murray, R. & Haxton, T. (2010) Combining water quality and operational data for improved...
  • Cited by (12)

    • A belief rule-base approach to the assessment and improvement of seismic resilience of high-speed railway station buildings

      2023, Soil Dynamics and Earthquake Engineering
      Citation Excerpt :

      Knowledge expression is realized using the BRB expert system, and knowledge inference is realized using the ER algorithm. Hierarchical BRB was first proposed by Ref. [41] and further applied to complex systems [37,42]. The main idea of hierarchical BRB, taking a three hierarchical BRB in Fig. 1 as an example, is to first combine the evidence at the lowest level (D, E, and F) using sub-BRB-1 and sub-BRB-2, and then obtain the inference result B and C as the input to the aspect of the upper layer A using sub-BRB-3, such that the transfer of information from the bottom layer to the target layer can be achieved.

    • Optimization of selective withdrawal systems in hydropower reservoir considering water quality and quantity aspects

      2021, Expert Systems with Applications
      Citation Excerpt :

      International modeling and management studies have also extensively employed CE-QUAL-W2. Surrogate models as compact scalable analytical models emulate the behavior of complex systems based on limited data of computationally expensive simulations (Johnson & Rogers, 2002; Zou et al., 2007; Aminravan et al., 2015; Datta & Regis, 2016; Tsoukalas et al., 2016). Consider ℳ as a process-based simulation model with an accurate representation of natural processes, which takes a vector of decision variables Ҳ and generator output Ƴ, such that Ƴ= ℳ (Ҳ).

    View all citing articles on Scopus
    View full text