Constrained abductive reasoning with fuzzy parameters in Bayesian networks

https://doi.org/10.1016/S0305-0548(03)00204-1Get rights and content

Abstract

This work proposes a novel approach for solving abductive reasoning problems in Bayesian networks involving fuzzy parameters and extra constraints. The proposed method formulates abduction problems using nonlinear programming. To maximize the sum of the fuzzy membership functions subjected to various constraints, such as boundary, dependency and disjunctive conditions, unknown node belief propagation is completed. The model developed here can be built on any exact propagation methods, including clustering, joint tree decomposition, etc.

Introduction

Bayesian networks are widely used knowledge representation and reasoning tools for various domains under uncertainty [1], [2], [3], [4], [5], [6], [7]. Since an expert system requires both predictive and diagnostic information, two types of reasoning commonly are employed, namely deduction and abduction. Deduction is a logical process from a hypothesis to deduce evidence where probabilistic relationships are involved, and abduction is a logical process that hypothetically explains experimental observations [7].

Several methods have been developed for solving abductive reasoning problems in Bayesian networks. Exact methods exploit the independence structure contained in the network to efficiently propagate uncertainty [1], [6], [7]. Meanwhile, stochastic simulation methods provide an alternative approach suitable for highly connected networks, in which exact algorithms can be inefficient [7]. Recently, search-based approximate algorithms, which search for high probability configurations through a space of possible values, have emerged as a new alternative [8]. On the other hand, two key approaches have been proposed for symbolic inference in Bayesian networks, namely: the symbolic probabilistic inference algorithm (SPI) [9] and symbolic calculations based on slight modifications of standard numerical propagation algorithms [1], [10].

The above methods have two limitations for abductive reasoning:

  • (i)

    All relevant parameters are assumed to be crisp.

  • (ii)

    Extra constraints or knowledge regarding belief propagation in Bayesian networks are difficult to embed.

Those limitations restrict the usefulness of reasoning in Bayesian networks. First, the conditional probabilities between a node and its parents could be fuzzy parameters because of the difficulties of learning accurately the causal relationships among the nodes. Additionally, knowledge workers often acquire additional information regarding inferences in Bayesian networks, particularly when facing diverse diagnostic scenarios. This information can relate to boundary, dependency or disjunctive conditions. The above limitations are illustrated below using an example from Pearl [7].

Metastatic cancer is a possible cause of a brain tumor and is an explanation for increased total serum calcium. Either of these could explain a patient falling into a coma. Severe headache is also possibly associated with a brain tumor.

Fig. 1(a) shows a Bayesian network representing the above cause and effect relationships. Table 1 lists the causal influences in terms of conditional probability distributions. Each variable is characterized by the probability given the state of its parents. For instance: C∈{1,0} represents the dichotomy between having a brain tumor and not having one, +c denotes the assertion C=1 or “Brain tumor is present”, and −c is the negation of +c, namely, C=0. The root node, A, which has no parent, is characterized by its prior probability distribution. The above information can be used to solve the following reasoning problems.

Problem 1

Compute the posterior probability of every A, B, and C, given the conditional probabilities in Table 1, and a situation involving a patient who is suffering from a severe headache (E=1) but has not fallen into a coma (D=0); that is, compute P(a|−d,+e), P(b|−d,+e) and P(c|−d,+e).

Current abductive reasoning methods [1], [6], [7], [8], [9], [10] can solve Problem 1 successfully. However, if the parameters in Table 1 are fuzzy numbers, conventional methods may have difficulty in answering the queries. For instance, P(+b|+a) cannot be 0.8 but rather is a fuzzy number, say x̃1, where x̃1=P(+b|+a), and is associated with a membership function μ1(x̃1), represented as follows (see Fig. 2).μ̃1(x̃1)=5(x̃1−0.6)−5(|x̃1−0.8|+x̃1−0.8),0.6⩽x̃1⩽1,where “|∗|” denotes the absolute value of a term .

The above expression and Fig. 2 mean that the interval of x̃1 is between 0.6 and 1.0. If x̃1=0.8 then μ1(x̃1)=1, implying that x̃1=0.8 is the most possible situation. If x̃1⩽0.6 or x̃1⩾1 then μ1(x̃1)=0, the least-possible manifestation of x̃1. If x̃1=0.7, then μ1(x̃1)=0.5.

Fuzzy membership functions can be expressed in various ways. For example, denote x̃7=P(+d|+b,−c) and express μ7(x̃7) as the following function (Fig. 3).μ7(x̃7)=10(x̃7−0.7)−5(|x̃7−0.8|+x̃7−0.8)−5(|x̃7−0.85|+x̃7−0.85),0.7⩽x̃7⩽0.95.Here μ7(x̃7) is a trapezoid membership function and comprises four line segments, where 0.8⩽x̃7⩽0.85 has the maximal membership. A fuzzy membership function is frequently a concave function.

This work defines the fuzzy parameters x̃i, i=1,2,…,8, where P(+b|+a)=x̃1, P(+b|−a)=x̃2, P(+c|+a)=x̃3, P(+c|−a)=x̃4, P(+d|+b,+c)=x̃5, P(+d|−b,+c)=x̃6, P(+d|+b,−c)=x̃7, and P(+d|−b,−c)=x̃8. Table 2 lists the membership functions of the fuzzy parameters, among which μ7(x̃7) and μ8(x̃8) are trapezoid membership functions while the remainder are triangular functions. Next, consider Problem 2.

Problem 2

Compute the belief distributions P(a|−d,+c), P(b|−d,+c), and P(c|−d,+c), given the fuzzy membership functions in Table 2 and some constraints related to belief propagation.

Current abductive reasoning methods have difficulties in solving Problem 2 since it involves fuzzy information and extra constraints.

Consider abductive reasoning with constraints. For a given Bayesian network, knowledge workers (such as clinicians) may have professional judgments regarding the features of certain nodes and the relationships among them in particular diagnostic backgrounds. These features and relationships can take the form of various constraints.

  • (i)

    Boundary constraints: From additional information or observations, clinicians can infer that the posterior probability of A given E=1 and D=0 should be higher than 0.1 but lower than 0.3, which is expressed as0.1⩽P(+a|−d,+e)⩽0.3.

  • (ii)

    Functional dependency: The beliefs of certain nodes are functionally dependent. For example, clinicians can judge that the posterior probability of B is roughly a certain multiple of that of A given E=1 and D=0, which is expressed asP(+a|−d,+e)⩽2P(+b|−d,+e).

  • (iii)

    Disjunctive constraints: Sometimes disjunction may occur between nodes. For example, a doctor may estimate that either P(+a|−d,+e) or P(+b|−d,+e) is equal to or below 0.2, which is expressed asEitherP(+a|−d,+e)⩽0.2orP(+b|−d,+e)⩽0.2.

By introducing these constraints into the reasoning system, the following problems are formulated.

Problem 2.1

Compute the belief distributions P(a|−d,+c), P(b|−d,+c), and P(c|−d,+c), given the fuzzy membership functions in Table 2 and the following constraints.0.1⩽P(+a|−d,+e)⩽0.3,P(+b|−d,+e)⩽2P(+c|−d,+e)EitherP(+a|−d,+e)⩽0.2orP(+b|−d,+e)⩽0.2.

Problem 2.1 is more complicated and difficult than Problem 1 when solved using current propagation methods.

This study develops a new approach for solving Problem 2.1 based on optimization techniques. Section 2 first presents the mathematical expressions of reasoning with fuzzy parameters as well as the techniques for linearizing the nonlinear absolute terms. Section 3 then formulates Problem 2.1 as a nonlinear program and introduces some constraints relating to belief propagation. Next, Section 4 illustrates several numerical examples. The final section presents some concluding remarks.

Section snippets

Posterior probabilities with fuzzy parameters

First this study reviews the conventional methods for computing the posterior probabilities with crisp parameters. Consider the Bayesian network in Fig. 1(a) with the crisp information in Table 1. Clustering [1], [7] can transform Fig. 1(a) into an equivalent tree structure in Fig. 1(b), where nodes B and C are collapsed into a compound node Z=B&C. Let Z={z1,z2,z3,z4} be a set of cardinalities of Z and z1=(+b,+c), z2=(−b,+c), z3=(+b,−c), and z4=(−b,−c). Moreover, let WY denote the state of all

Abductive models with fuzzy parameters

Building upon Section 2, the abductive model for solving Problem 2.1 is formulated below.


Model 1.maxi=18μi(x̃i)s.t.(2.4),0.1⩽P(+a|−d,+e)⩽0.3,P(+b|−d,+e)⩽2P(+c|−d,+e),EitherP(+a|−d,+e)⩽0.2orP(+b|−d,+e)⩽0.2,where the objective function maximizes the sum of all fuzzy membership functions. Since (2.4) contains numerous nonseparate nonlinear terms, Model 1 is a highly nonlinear and nonconvex program. This work deals with the disjunctive constraint first and takes care of the nonlinear issue in

Numerical examples

Abductive reasoning problems in certain applications are solved below using the proposed constrained optimization approach.

Example 1

Problem 2.1 is solved using the following program:maxi=18μi(x̃i)s.t.μ1(x̃1)=5(x̃1−0.6)−5(|x̃1−0.8|+x̃1−0.8),μ2(x̃2)=10(x̃2−0.1)−10(|x̃2−0.2|+x̃2−0.2),μ3(x̃3)=10(x̃3−0.1)−15(|x̃3−0.2|+x̃3−0.2),μ4(x̃4)=25(x̃4−0.01)−17.5(|x̃4−0.05|+x̃4−0.05),μ5(x̃5)=5(x̃5−0.6)−5(|x̃5−0.8|+x̃5−0.8),μ6(x̃6)=10(x̃6−0.7)−10(|x̃6−0.8|+x̃6−0.8),μ7(x̃7)=10(x̃7−0.7)−5(|x̃7−0.8|+x̃7−0.8)−5(|x̃7−0.85|+x

Conclusions

This study develops a nonlinear programming model for dealing with constrained abductive reasoning on Bayesian networks. This model can be built on any exact propagation methods in Bayesian networks. The present study involves some fuzzy parameters and certain extra constraints. Optimization techniques, including piecewise linearization, are adopted to solve this nonlinear programming model and obtain optimal solutions to the abductive reasoning problems. Since the constraints in this model are

References (15)

There are more references available in the full text version of this article.

Cited by (0)

View full text