Elsevier

Information Fusion

Volume 6, Issue 2, June 2005, Pages 175-182
Information Fusion

Parameter estimation for Choquet fuzzy integral based on Takagi–Sugeno fuzzy model

https://doi.org/10.1016/j.inffus.2004.11.002Get rights and content

Abstract

Currently, both Choquet fuzzy integral and Takagi–Sugeno fuzzy models are popular synthetic evaluation and fuzzy modeling tools. In this paper, we prove that Choquet fuzzy integral is a special version of Takagi–Suguno fuzzy model in the sense of structure, thus the learning algorithm of the latter is used to develop a parameter estimation procedure for the former. The parameter estimation procedure actually is performed in each ordinal subspace of input space, in which all input data have unique ordering of components. The proposed approach in this paper has been proven to possess better performance than the existing ones by not only theoretical analysis but also experiments.

Introduction

The traditional tool of aggregation for information fusion is the weighted average method that is basically a linear integral (Lebesgue integral). It is assumed that all attributes are non-interactive and, hence, their weighted effects are viewed as additive ones. This assumption is not realistic in many applications [1]. To overcome this problem, a new mathematical tool, fuzzy integral [1], [2], has been introduced with respect to the non-additive fuzzy measure in place of the traditional weighted average. Aggregation functions given by the Choquet fuzzy integral (CFI) and the Sugeno fuzzy integral play an important role in multicriteria decision-making. Some well-known aggregation techniques such as the weighted arithmetic mean and OWA aggregation operators are just special cases of them [3]. In this paper, we focus on CFI that is the most popular aggregating tool for information fusion at present.

Despite the success of CFI in real applications, the practical use of fuzzy measures could be difficult, because for n attributes, one has to identify 2n parameters in order to define a fuzzy measure. This identification step is very important since all the knowledge concerning these attributes is embedded into the fuzzy measure. In nature, there are essentially three approaches [3]: identification based on the semantics, identification based on learning data, and combining semantics and learning data. However, this first approach is far from being a well-established methodology since it involves a non-negligible amount of experience, and there exist the same defect on the third one. By contrast, the second approach has been addressed in many applications. Especially, the issue from a set of input–output data to determine the fuzzy measure of fuzzy integral was considered as an inverse problem on the synthetic evaluation, and some approaches for this purpose have been described in literature [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14]. Genetic algorithm (GA) [4], [5], [6], [7] is a popular choice in this category, and some gradient descend-based and least square (LS)-based approaches [8], [9], [10], [11] also have shown their success to a certain extent. Grabisch [8] presented a heuristic method called HLMS for identifying parameters of CFI in the insufficient information, where we will address it carefully later in this paper. Chiang [12] developed a learning algorithm for CFI parameters by means of fuzzy neurocomputation, in which a λ-fuzzy measure has been shown powerful feasibility by gradient-descent method. Chen et al. [13], [14] proposed an efficient approach for general fuzzy measure in CFI. Interesting enough, Sugeno [15] and Fujimoto [16] gave a necessary and sufficient condition for a CFI to be decomposable into a group of piecewise linear functions [17]. These work enlighten our research in different views.

Takagi–Sugeno (TS) fuzzy model is very popular additive fuzzy system [18]. It can represent arbitrary non-linear functions by using a few linear functions to certain extent, thus being widely researched and achieving many results [19]. In addition to specific characters of CFI, we shall verify that it is only the special version of TS fuzzy model in the sense of structure. This means that the methods used TS fuzzy model help solve these problems in CFI.

This paper is organized as follows. Section 2 introduces several major characteristics of the CFI and the TS fuzzy models. Then a new TS-based approach of parameter estimation for CFI is discussed in Section 3. Two experiments are used to verify the feasibility and efficiency of the new approach in Section 4, and conclusions are presented in Section 5.

Section snippets

Major properties of CFI

Let X = {x1 ,x2,  , xn} be a finite set associated with n attributes on information source space, and denote P(X) as the power set X consisting of all subsets of X. A fuzzy measure over a set X is a function [1], [5] g:P(X)  [0, 1] such that

  • (1)

    g(Φ) = 0, g(X) = 1;

  • (2)

    g(A)  g(B) if A, B  P(X).


CFI is a fuzzy integral based on any fuzzy measure that provides alternative computational scheme for aggregating information. Assume h(x1), h(x2),  , h(xn) are the evidence provided by attributes, x1, x2,  , xn, respectively, then we

Computational structure of CFI

According to Eq. (1), when computing CFI, the input space H needs to be divided into different subspaces such that the components of all point vectors inside each subspace possess unique magnitude order. Each subspace above is called ordinal subspace in this paper. In total there exist n! different orders for n attributes, denoted by Dt, t = 1, 2,  , n! then each of them can be represented by Cn2 inequalities asDt=(h(xt1),h(xt2),,h(xtp),,h(xtq),,h(xtn)h(xtp)h(xtq),p=1,,n-1;q=p+1,,n).For

Experiments

Experimental conditions: Win2000, Matlab6.5, Celeron700M, 128M memory.

Conclusions

This paper analyzes the interrelation of two most popular models in data mining area and obtains a feasible approach for determining CFI parameters. In contrast of the existing approaches, the proposed approach is based on the popular and convinced WRLS algorithm. Its principle is simple yet has high accuracy. Likewise the WRLS is well known by most of engineers and many software products are available for handling various possible cases in application.

When the number of attributes in input

Acknowledgements

We would like to thank reviewers for their suggestions and for pointing out the errors in the initial version of our paper. These suggestions have been helpful to our work to a great extent and have been incorporated in the new version.

References (23)

  • M. Gribisch

    A graphical interpretation of the Choquet integral

    IEEE Trans. Fuzzy Syst.

    (2000)
  • Cited by (6)

    • Set characterization-selection towards classification based on interaction index

      2015, Fuzzy Sets and Systems
      Citation Excerpt :

      To our knowledge, the best classification accuracy results have been obtained with IncNet (97.1%) [30]. With CI classifiers, the best reported classification accuracy is 91.5% [31]. As shown in Table 10, GI feature selection and CI classification yield a classification accuracy of 97.5%.

    • On the fundamental convergence in the (C) mean in problems of information fusion

      2009, Journal of Mathematical Analysis and Applications
    • The mechanism based on fuzzy integral in multi-attribute optimal routing

      2011, Proceedings - 2011 4th IEEE International Symposium on Microwave, Antenna, Propagation and EMC Technologies for Wireless Communications, MAPE 2011
    • 2-order additive fuzzy measures identification method based on maximum entropy principle

      2010, Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
    • Nonlinear integrals and their applications in data mining

      2010, Nonlinear Integrals and their Applications in Data Mining
    View full text