A generalized TSK model with a novel rule antecedent structure: Structure identification and parameter estimation

https://doi.org/10.1016/j.compchemeng.2010.01.007Get rights and content

Abstract

TSK fuzzy models are convenient tools for describing complex nonlinear behavior. However, the existing combinatorial antecedent structure in TSK models makes them substantially suffer from the curse of dimensionality. In this work, a novel rule antecedent structure is proposed to design an efficient generalized TSK (GTSK) model by using fewer rules. The new rule antecedent only uses nonlinear variables. Additionally, one more degree of freedom is introduced to design antecedents to cover an antecedent space more efficiently, which further reduces the number of rules. The resultant GTSK model is identified in two stages. A novel recursive estimation based on spatially rearranged data is used to determine the consequent and antecedent variables. Model parameter values are obtained from partitioned antecedent space, which is the result of solving a series of splitting and regression problems.

Introduction

TSK (Takagi–Sugeno–Kang) fuzzy models gained popularity in fields including modeling (Chen, 2009, Cococcioni et al., 2007, Jacquin and Shamseldin, 2006), control (Guelton et al., 2009, Khiar et al., 2007), forecasting (Chang & Liu, 2008), and classification (Zhang, Zhou, Liu, & Harrington, 2006). The popularity is partially due to the interpretability rendered by the linguistic terms such as ‘High’, ‘Fast’, etc., used to construct rules. Also, the cause-and-effect format in an IF … THEN structure is friendly for human understanding and interpretation. Interpretability of a TSK model is enhanced by the divide-and-conquer concept inherited in the model, where each rule describes a part of model behavior, and rules work together for a complete description. One then understands a complex model by understanding each part of it. More importantly, the popularity is rooted in a fundamental reason: a TSK fuzzy model is a universal approximator (Kosko, 1994), which guaranties its ability to describe almost any nonlinear behavior given a sufficiently flexible structure.

The following presentation will be based upon a single-input–single-output (SISO) model. The extension to multiple-input–multiple-output (MIMO) models will be addressed when necessary. Eq. (1) is a general expression of SISO dynamic systems with dynamic orders ny, nu, pure time delay d, and an additive disturbance e(t)y(t)=fy(t1),,y(tny),u(td),,u(tnud)+e(t)where y is the system response and u is the system input.

A corresponding rule (the rth rule in a TSK model) to Eq. (1) is described byIF(y(t1) is A1rANDANDu(tnud) is Any+nu+1r)THENAr(z1)y(t)=kr+Br(z1)u(td)Ar(z1)=1+a1rz1++anyrznyBr(z1)=b0r+b1rz1++bnurznuwhere z is the backshift operator, and the expression y(t1) is A1rANDANDu(tnud) is Any+nu+1r is the antecedent of the rule. The variables y(t  1), …, y(t  ny), u(t  d), …, u(t  nu  d) are antecedent variables and A1r is the fuzzy subset for y(t  1) in the rule. The consequent of the rule is a local linear model Ar(z−1)y(t) = kr + Br(z−1)u(t  d) with dynamic orders ny and nu, pure time delay d and a constant kr.

A TSK model is defined by a collection of rules in Eq. (2). The identification of a TSK model needs to determine antecedent and consequent structures, and to estimate parameter values.

An important structure configuration in a TSK model is the number of rules, which should be set to balance model complexity with accuracy. Trials are often needed in practice to obtain the right number. Heuristics based on clustering are used to recognize the prototype rules (Dickerson and Kosko, 1996, Vernieuwe et al., 2006, Wang and Yang, 2009), which automatically results the number of rules. In Nelles (2001) the number of rules progressively grows in each step when an equal division in a dimension is conducted. The number of rules might also be determined in a backward fashion (Yen & Wang, 1999), where the TSK model initially has a large number of rules. In the end, a more compact model results by eliminating redundant rules. The computational burden of these techniques increases geometrically with dimensions (ny and nu). Additionally, heuristic-based stochastic procedures exist to gain both model structures and parameter values simultaneously (Du and Zhang, 2008, Guenounou et al., 2009, Lin, 2008, Lin and Xu, 2006), which however require even more computation resources.

The GTSK models in this work are for dynamic systems. Therefore, dynamic orders such as ny, nu and pure time delay, d in Eq. (1) are also important structure configurations. Dynamic order determination is well developed for linear systems where a preliminary analysis using autocorrelation and partial autocorrelation is able to estimate dynamic orders (Brockwell & Davis, 1998). For static linear systems, subset selection methods (Miller, 1990) are able to find influential regressors. Analysis of variance can also be used for regressor analysis (Lind & Liung, 2008). For nonlinear dynamic systems with unknown nonlinearities, there is no general method, and order analysis falls into two categories. One approach accepts either known or assumed nonlinear structures. There are various choices of nonlinear structures such as bilinear, Wiener, Hammerstein structures or their combinations. With known nonlinear structures, analysis might be conducted rigorously. Another approach does not depend on a predefined nonlinear structure. The geometric method (Molina, Sampson, Fitzgerald, & Niranjan, 1996), False Nearest Neighbor (Rhodes & Morari, 1995), and Lipschitz Quotient (He & Asada, 1993) all belong to the second category. These methods can be roughly argued upon the first-order Taylor series expansion. However, these methods are sensitive to noise (Nelles, 2001).

The number of rules is coupled with dimensions, ny and nu and more rules are required for a higher dimension. Reducing the number of rules in a TSK model is possible by allowing a dimension difference between antecedents and consequents. The difference is implicitly expressed in (Kawamoto, 1992, Takagi and Sugeno, 1985, Tanaka and Wang, 2001), where a TSK model is used to approximate a known nonlinear state-space model and the antecedent variables are defined as functions of system states. This results in a TSK model with different dimensions in its antecedent and consequent. Antecedent variable selection is explicitly mentioned in (Leith and Leithead, 1999, Shorten et al., 1999), where examples are given to select antecedent variables as nonlinear states from a known nonlinear state-space model. However, neither is applicable to the situation where only input–output data are available and the system model is unknown. In Pomares, Rojas, González, and Prieto (2002), variable selection was proposed in constructing a model for function approximation. However, selected variables are included in both antecedent and consequent, which creates difficulty with a high dimension problem.

In addition to the antecedent dimension affecting the number of rules, the geometric structure in the antecedent also has a significant impact. The antecedent in Eq. (2) assumes a combinatorial structure. Its permutations lead to many rule possibilities.

Once the model structure is determined, the next task is to determine model parameter values. Parameters for a dynamic TSK model include both antecedent and consequent parameters. Parameter estimation could be pursued by solving optimization problems. Steepest decent (Dickerson & Kosko, 1996) and Levenberg–Marquardt (Moreno-Velo, Baturone, Barriga, & Sánchez-Solano, 2007) are popular choices. They all provide local optimal solutions, and trials have to be made from different initial guesses in order to increase the probability of obtaining a global optimal solution (Iyer & Rhinehart, 1999). Genetic algorithms (Cordon et al., 2004) are also used for parameter estimation for a better local solution.

This work proposes several novelties in the successive stages of developing models.

In this work, the dynamic orders, ny, nu, and delay d are determined by selecting influential variables based on proposed recursive estimation, which rearranges the raw data spatially to organize nonlinearity (Section 3). The antecedent variables are then selected as a subset of the determined influential variables, as those which have nonlinear influence on the model output, and are identified by the proposed recursive estimation, again (Section 3E).

Also, in this work, the proposed rule structure has different dimensions in antecedents and consequents. The antecedent dimension is generally lower than the consequent dimension because it only includes variables which have a nonlinear impact on the output, argued to be necessary and sufficient. In addition, the antecedent is designed with one more degree of freedom to include variable interactions to efficiently occupy an antecedent space. The novel rule and antecedent structure reduce the number of rules required.

The number of rules, locations and shapes of local regions in the antecedents are determined by recursively partitioning the antecedent space (Section 4). Once the antecedent space is fully partitioned, the antecedent and consequent parameter values are estimated. The partition is conducted by solving a series of splitting and regression problems.

Section 2 presents the method to reduce the number of rules by using a more general antecedent structure. It concludes with a GTSK model consisting of rules with the new antecedent structure. The variable selection for a nonlinear dynamic process is presented in Section 3. It selects the important regressors defining the overall model dimension and the variables to be used in antecedents. Section 4 presents the details on estimating parameter values based on antecedent space partition. The steps for model development are summarized in Section 5. Several testing and applications are presented in Section 6.

Section snippets

Antecedent dimensions

The direct approach to reduce the number of rules is to control the system dimension, which is unfortunately determined by the nature of the problem but not by users. However, dimension reduction in the antecedent is still possible for a given dimension system by excluding variables that appear linearly.

To illustrate dimension reduction, consider the following dynamic model with three regressors, [y(t  1)y(t  2)u(t  1)]y(t)=y(t1)[y(t2)+2.5]+y(t1)2u(t1)

Using the rule structure in Eq. (2), the

Order determination and antecedent variable selection

The first step in modeling is variable selection. This work first determines the orders, ny and nu, and delay d for a nonlinear dynamic system as defined in Eq. (1). The value of ny, nu and d give the set of consequent variables in Eq. (12). Antecedent variables are then selected from the consequent variables.

Estimation of parameter values

The rule structure is determined now that (c1, …, cnc) and (x1, …, xnx) are selected. The next task is to determine the number of rules as well as the parameter values in each rule.

Procedure summary

The above procedure for converting input–output data to a GTSK model is summarized:

Step 1

Determine dynamic orders, ny, nu and delay d by using the SNNR (Section 3.2) to rearrange data, recursive estimation (Eq. (16)) to process rearranged data, and the modified FPE (Eq. (25)) to evaluate a particular choice of regressor set.

Step 2

Determine antecedent variables (c1, …, cnc) from consequent variables (x1, …, xnx) due to the selected (ny, nu, d) in Step 1 (Section 3.5).

Step 3

Recursively partition the antecedent

Testing models

Models used to test the proposed structure determination are:

  • Model 1 (Narendra & Parthasarathy, 1990):y(t)=0.3y(t1)+0.6y(t2)+0.6sin(πu(t1))+0.3sin(3πu(t1))+0.1sin(5πu(t1))+e(t)where e(t)  N(0,0.52).

  • Model 2 (Narendra & Parthasarathy, 1990):y(t)=y(t1)y(t2)(y(t1)+2.5)1+y(t1)2+y(t2)2+u(t1)+e(t)where e(t)  N(0,0.52)

  • Model 3 (Narendra & Parthasarathy, 1990):y(t)=y(t1)y(t2)y(t3)u(t2)(y(t3)1)+u(t1)1+y(t3)2+y(t2)2+e(t)where e(t)  N(0,0.052).

  • Model 4 (Narendra & Parthasarathy, 1990):y(t)

Conclusions

The proposed rule antecedent structure is able to substantially reduce the complexity in a TSK model.

Instead of directly estimating model parameters, the proposed approach solves a series of splitting and regression problems to partition the antecedent space as well as compute the antecedent and consequent parameters. The resultant antecedent partition is meaningful. The boundaries divide an antecedent space into regions, within which the process behavior is relatively linear.

The proposed

Acknowledgement

This work was supported in part by the Edward E. and Helen Turner Bartlett Foundation.

References (45)

  • C. Lin et al.

    A hybrid evolutionary learning algorithm for TSK-type fuzzy model design

    Mathematical and Computer Modelling

    (2006)
  • I. Lind et al.

    Regressor and structure selection in NARX models using a structured ANOVA approach

    Automatica

    (2008)
  • F.J. Moreno-Velo et al.

    Automatic tuning of complex fuzzy systems with Xfuzzy

    Fuzzy Sets and Systems

    (2007)
  • J. Ou et al.

    ISA Transactions

    Grouped neural network modeling for model predictive control

    (2002)
  • H. Vernieuwe et al.

    Comparison of clustering algorithms in the identification of Takagi–Sugeno models: A hydrological case study

    Fuzzy Sets and Systems

    (2006)
  • N. Wang et al.

    A fuzzy modeling method via Enhanced Objective Cluster Analysis for designing TSK model

    Expert Systems with Applications

    (2009)
  • Z. Zhang et al.

    An application of Takagi–Sugeno fuzzy system to the classification of cancer patients based on elemental contents in serum samples

    Chemometrics and Intelligent Laboratory Systems

    (2006)
  • S.P. Boyd et al.

    Convex optimization

    (2004)
  • L. Breiman et al.

    Classification and regression trees

    (1984)
  • P.J. Brockwell et al.

    Time series: Theory and methods

    (1998)
  • J.A. Dickerson et al.

    Fuzzy function approximation with ellipsoidal rules

    IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics

    (1996)
  • B. Hartmann et al.

    On the smoothness in local model networks

  • Cited by (9)

    • MGP-INTACTSKY: Multitree Genetic Programming-based learning of INTerpretable and ACcurate TSK sYstems for dynamic portfolio trading

      2015, Applied Soft Computing Journal
      Citation Excerpt :

      In this way, the expressive power that each rule can provide is increased and therefore a more accurate system can be developed [15]. Furthermore, the dimension difference between antecedents and consequents leads to a system having a smaller number of rules and a more compact rule base [45]. To learn such a rule base, the algorithm must have the ability to represent variable sized rules with different structures.

    • Forecasting semi-dynamic response of natural gas networks to nodal gas consumptions using genetic fuzzy systems

      2015, Energy
      Citation Excerpt :

      A method is reported for reducing number of rules required in the antecedent part of the system using nonlinear variables. Antecedent and consequent variables are determined in that approach using recursive estimation and parameters of the system are identified after solving a series of splitting and regression problems [20]. Fuzzy neural network based on TSK model is used for obtaining reasonable fuzzy space partition after applying a subtractive clustering algorithm to find cluster centers.

    • Optimization of fuzzy model using genetic algorithm for process control application

      2011, Journal of the Franklin Institute
      Citation Excerpt :

      This ability of the TS fuzzy model provides a better alternative way in solving the critical weakness found in conventional techniques when facing with nonlinear system identification. The recursive least squares estimation can be easily used for constructing a linear system as a means for tuning the TS fuzzy model [2,12,13]. Passino and Yurkovich had also described the potential of using the least squares estimation method to solve an array of linear equations at the consequent part of the fuzzy model [14].

    • Takagi-Sugeno Fuzzy Logic Systems

      2017, Nonlinear Physical Science
    • Nonlinear regression modeling for engineering applications: Modeling, model validation, and enabling design of experiments

      2016, Nonlinear Regression Modeling for Engineering Applications: Modeling, Model Validation, and Enabling Design of Experiments
    View all citing articles on Scopus
    View full text