A generalized TSK model with a novel rule antecedent structure: Structure identification and parameter estimation

doi:10.1016/j.compchemeng.2010.01.007

Computers & Chemical Engineering

Volume 34, Issue 8, 9 August 2010, Pages 1199-1219

https://doi.org/10.1016/j.compchemeng.2010.01.007 Get rights and content

Abstract

TSK fuzzy models are convenient tools for describing complex nonlinear behavior. However, the existing combinatorial antecedent structure in TSK models makes them substantially suffer from the curse of dimensionality. In this work, a novel rule antecedent structure is proposed to design an efficient generalized TSK (GTSK) model by using fewer rules. The new rule antecedent only uses nonlinear variables. Additionally, one more degree of freedom is introduced to design antecedents to cover an antecedent space more efficiently, which further reduces the number of rules. The resultant GTSK model is identified in two stages. A novel recursive estimation based on spatially rearranged data is used to determine the consequent and antecedent variables. Model parameter values are obtained from partitioned antecedent space, which is the result of solving a series of splitting and regression problems.

Introduction

TSK (Takagi–Sugeno–Kang) fuzzy models gained popularity in fields including modeling (Chen, 2009, Cococcioni et al., 2007, Jacquin and Shamseldin, 2006), control (Guelton et al., 2009, Khiar et al., 2007), forecasting (Chang & Liu, 2008), and classification (Zhang, Zhou, Liu, & Harrington, 2006). The popularity is partially due to the interpretability rendered by the linguistic terms such as ‘High’, ‘Fast’, etc., used to construct rules. Also, the cause-and-effect format in an IF … THEN structure is friendly for human understanding and interpretation. Interpretability of a TSK model is enhanced by the divide-and-conquer concept inherited in the model, where each rule describes a part of model behavior, and rules work together for a complete description. One then understands a complex model by understanding each part of it. More importantly, the popularity is rooted in a fundamental reason: a TSK fuzzy model is a universal approximator (Kosko, 1994), which guaranties its ability to describe almost any nonlinear behavior given a sufficiently flexible structure.

The following presentation will be based upon a single-input–single-output (SISO) model. The extension to multiple-input–multiple-output (MIMO) models will be addressed when necessary. Eq. (1) is a general expression of SISO dynamic systems with dynamic orders ny, nu, pure time delay d, and an additive disturbance e(t) $y (t) = f (\begin{array}{l} y (t - 1), \dots, y (t - n y), \\ u (t - d), \dots, u (t - n u - d) \end{array}) + e (t)$ where y is the system response and u is the system input.

A corresponding rule (the rth rule in a TSK model) to Eq. (1) is described by $\begin{array}{l} IF (y (t - 1) is A_{1}^{r} AND \dots AND u (t - n u - d) is A_{n y + n u + 1}^{r}) \\ THEN A^{r} (z^{- 1}) y (t) = k^{r} + B^{r} (z^{- 1}) u (t - d) \\ A^{r} (z^{- 1}) = 1 + a_{1}^{r} z^{- 1} + \dots + a_{n y}^{r} z^{- n y} \\ B^{r} (z^{- 1}) = b_{0}^{r} + b_{1}^{r} z^{- 1} + \dots + b_{n u}^{r} z^{- n u} \end{array}$ where z is the backshift operator, and the expression $y (t - 1) is A_{1}^{r} AND \dots AND u (t - n u - d) is A_{n y + n u + 1}^{r}$ is the antecedent of the rule. The variables y(t − 1), …, y(t − ny), u(t − d), …, u(t − nu − d) are antecedent variables and $A_{1}^{r}$ is the fuzzy subset for y(t − 1) in the rule. The consequent of the rule is a local linear model A^r(z⁻¹)y(t) = k^r + B^r(z⁻¹)u(t − d) with dynamic orders ny and nu, pure time delay d and a constant k^r.

A TSK model is defined by a collection of rules in Eq. (2). The identification of a TSK model needs to determine antecedent and consequent structures, and to estimate parameter values.

An important structure configuration in a TSK model is the number of rules, which should be set to balance model complexity with accuracy. Trials are often needed in practice to obtain the right number. Heuristics based on clustering are used to recognize the prototype rules (Dickerson and Kosko, 1996, Vernieuwe et al., 2006, Wang and Yang, 2009), which automatically results the number of rules. In Nelles (2001) the number of rules progressively grows in each step when an equal division in a dimension is conducted. The number of rules might also be determined in a backward fashion (Yen & Wang, 1999), where the TSK model initially has a large number of rules. In the end, a more compact model results by eliminating redundant rules. The computational burden of these techniques increases geometrically with dimensions (ny and nu). Additionally, heuristic-based stochastic procedures exist to gain both model structures and parameter values simultaneously (Du and Zhang, 2008, Guenounou et al., 2009, Lin, 2008, Lin and Xu, 2006), which however require even more computation resources.

The GTSK models in this work are for dynamic systems. Therefore, dynamic orders such as ny, nu and pure time delay, d in Eq. (1) are also important structure configurations. Dynamic order determination is well developed for linear systems where a preliminary analysis using autocorrelation and partial autocorrelation is able to estimate dynamic orders (Brockwell & Davis, 1998). For static linear systems, subset selection methods (Miller, 1990) are able to find influential regressors. Analysis of variance can also be used for regressor analysis (Lind & Liung, 2008). For nonlinear dynamic systems with unknown nonlinearities, there is no general method, and order analysis falls into two categories. One approach accepts either known or assumed nonlinear structures. There are various choices of nonlinear structures such as bilinear, Wiener, Hammerstein structures or their combinations. With known nonlinear structures, analysis might be conducted rigorously. Another approach does not depend on a predefined nonlinear structure. The geometric method (Molina, Sampson, Fitzgerald, & Niranjan, 1996), False Nearest Neighbor (Rhodes & Morari, 1995), and Lipschitz Quotient (He & Asada, 1993) all belong to the second category. These methods can be roughly argued upon the first-order Taylor series expansion. However, these methods are sensitive to noise (Nelles, 2001).

The number of rules is coupled with dimensions, ny and nu and more rules are required for a higher dimension. Reducing the number of rules in a TSK model is possible by allowing a dimension difference between antecedents and consequents. The difference is implicitly expressed in (Kawamoto, 1992, Takagi and Sugeno, 1985, Tanaka and Wang, 2001), where a TSK model is used to approximate a known nonlinear state-space model and the antecedent variables are defined as functions of system states. This results in a TSK model with different dimensions in its antecedent and consequent. Antecedent variable selection is explicitly mentioned in (Leith and Leithead, 1999, Shorten et al., 1999), where examples are given to select antecedent variables as nonlinear states from a known nonlinear state-space model. However, neither is applicable to the situation where only input–output data are available and the system model is unknown. In Pomares, Rojas, González, and Prieto (2002), variable selection was proposed in constructing a model for function approximation. However, selected variables are included in both antecedent and consequent, which creates difficulty with a high dimension problem.

In addition to the antecedent dimension affecting the number of rules, the geometric structure in the antecedent also has a significant impact. The antecedent in Eq. (2) assumes a combinatorial structure. Its permutations lead to many rule possibilities.

Once the model structure is determined, the next task is to determine model parameter values. Parameters for a dynamic TSK model include both antecedent and consequent parameters. Parameter estimation could be pursued by solving optimization problems. Steepest decent (Dickerson & Kosko, 1996) and Levenberg–Marquardt (Moreno-Velo, Baturone, Barriga, & Sánchez-Solano, 2007) are popular choices. They all provide local optimal solutions, and trials have to be made from different initial guesses in order to increase the probability of obtaining a global optimal solution (Iyer & Rhinehart, 1999). Genetic algorithms (Cordon et al., 2004) are also used for parameter estimation for a better local solution.

This work proposes several novelties in the successive stages of developing models.

In this work, the dynamic orders, ny, nu, and delay d are determined by selecting influential variables based on proposed recursive estimation, which rearranges the raw data spatially to organize nonlinearity (Section 3). The antecedent variables are then selected as a subset of the determined influential variables, as those which have nonlinear influence on the model output, and are identified by the proposed recursive estimation, again (Section 3E).

Also, in this work, the proposed rule structure has different dimensions in antecedents and consequents. The antecedent dimension is generally lower than the consequent dimension because it only includes variables which have a nonlinear impact on the output, argued to be necessary and sufficient. In addition, the antecedent is designed with one more degree of freedom to include variable interactions to efficiently occupy an antecedent space. The novel rule and antecedent structure reduce the number of rules required.

The number of rules, locations and shapes of local regions in the antecedents are determined by recursively partitioning the antecedent space (Section 4). Once the antecedent space is fully partitioned, the antecedent and consequent parameter values are estimated. The partition is conducted by solving a series of splitting and regression problems.

Section 2 presents the method to reduce the number of rules by using a more general antecedent structure. It concludes with a GTSK model consisting of rules with the new antecedent structure. The variable selection for a nonlinear dynamic process is presented in Section 3. It selects the important regressors defining the overall model dimension and the variables to be used in antecedents. Section 4 presents the details on estimating parameter values based on antecedent space partition. The steps for model development are summarized in Section 5. Several testing and applications are presented in Section 6.

Section snippets

Antecedent dimensions

The direct approach to reduce the number of rules is to control the system dimension, which is unfortunately determined by the nature of the problem but not by users. However, dimension reduction in the antecedent is still possible for a given dimension system by excluding variables that appear linearly.

To illustrate dimension reduction, consider the following dynamic model with three regressors, [y(t − 1)y(t − 2)u(t − 1)] $y (t) = y (t - 1) [y (t - 2) + 2.5] + y {(t - 1)}^{2} u (t - 1)$

Using the rule structure in Eq. (2), the

Order determination and antecedent variable selection

The first step in modeling is variable selection. This work first determines the orders, ny and nu, and delay d for a nonlinear dynamic system as defined in Eq. (1). The value of ny, nu and d give the set of consequent variables in Eq. (12). Antecedent variables are then selected from the consequent variables.

Estimation of parameter values

The rule structure is determined now that (c₁, …, c_nc) and (x₁, …, x_nx) are selected. The next task is to determine the number of rules as well as the parameter values in each rule.

Procedure summary

The above procedure for converting input–output data to a GTSK model is summarized:

Step 1

Determine dynamic orders, ny, nu and delay d by using the SNNR (Section 3.2) to rearrange data, recursive estimation (Eq. (16)) to process rearranged data, and the modified FPE (Eq. (25)) to evaluate a particular choice of regressor set.

Step 2

Determine antecedent variables (c₁, …, c_nc) from consequent variables (x₁, …, x_nx) due to the selected (ny, nu, d) in Step 1 (Section 3.5).

Step 3

Recursively partition the antecedent

Testing models

Models used to test the proposed structure determination are:

Model 1 (Narendra & Parthasarathy, 1990): $y (t) = 0.3 y (t - 1) + 0.6 y (t - 2) + 0.6 sin (π u (t - 1)) + 0.3 sin (3 π u (t - 1)) + 0.1 sin (5 π u (t - 1)) + e (t)$ where e(t) ∼ N(0,0.5²).
Model 2 (Narendra & Parthasarathy, 1990): $y (t) = \frac{y (t - 1) y (t - 2) (y (t - 1) + 2.5)}{1 + y {(t - 1)}^{2} + y {(t - 2)}^{2}} + u (t - 1) + e (t)$ where e(t) ∼ N(0,0.5²)
Model 3 (Narendra & Parthasarathy, 1990): $y (t) = \frac{y (t - 1) y (t - 2) y (t - 3) u (t - 2) (y (t - 3) - 1) + u (t - 1)}{1 + y {(t - 3)}^{2} + y {(t - 2)}^{2}} + e (t)$ where e(t) ∼ N(0,0.05²).
Model 4 (Narendra & Parthasarathy, 1990): $y (t)$

Conclusions

The proposed rule antecedent structure is able to substantially reduce the complexity in a TSK model.

Instead of directly estimating model parameters, the proposed approach solves a series of splitting and regression problems to partition the antecedent space as well as compute the antecedent and consequent parameters. The resultant antecedent partition is meaningful. The boundaries divide an antecedent space into regions, within which the process behavior is relatively linear.

The proposed

Acknowledgement

This work was supported in part by the Edward E. and Helen Turner Bartlett Foundation.

References (45)

P.-C. Chang et al.
A TSK type fuzzy rule based system for stock price prediction
Expert Systems with Applications
(2008)
R. Chen
RFM-based eco-efficiency analysis using Takagi–Sugeno fuzzy and AHP approach
Environmental Impact Assessment Review
(2009)
M. Cococcioni et al.
Estimating the concentration of optically active constituents of sea water by Takagi–Sugeno models with quadratic rule consequents
Pattern Recognition
(2007)
O. Cordon et al.
Ten years of genetic fuzzy systems: current framework and new trends
Fuzzy Sets and Systems
(2004)
H. Du et al.
Application of evolving Takagi–Sugeno fuzzy model to nonlinear system identification
Applied Soft Computing
(2008)
K. Guelton et al.
Robust dynamic output feedback fuzzy Lyapunov stabilization of Takagi–Sugeno systems—a descriptor redundancy approach
Fuzzy Sets and Systems
(2009)
O. Guenounou et al.
Multi-objective optimization of TSK fuzzy models
Expert Systems with Applications
(2009)
A.P. Jacquin et al.
Development of rainfall–runoff models using Takagi–Sugeno fuzzy inference systems
Journal of Hydrology
(2006)
D. Khiar et al.
Robust Takagi–Sugeno fuzzy control of a spark ignition engine
Control Engineering Practice
(2007)
C. Lin
An efficient immune-based symbiotic particle swarm optimization learning algorithm for TSK-type neuro-fuzzy networks design
Fuzzy Sets and Systems
(2008)

C. Lin et al.

A hybrid evolutionary learning algorithm for TSK-type fuzzy model design

Mathematical and Computer Modelling

(2006)

I. Lind et al.

Regressor and structure selection in NARX models using a structured ANOVA approach

Automatica

(2008)

F.J. Moreno-Velo et al.

Automatic tuning of complex fuzzy systems with Xfuzzy

Fuzzy Sets and Systems

(2007)

J. Ou et al.

ISA Transactions

Grouped neural network modeling for model predictive control

(2002)

H. Vernieuwe et al.

Comparison of clustering algorithms in the identification of Takagi–Sugeno models: A hydrological case study

Fuzzy Sets and Systems

(2006)

N. Wang et al.

A fuzzy modeling method via Enhanced Objective Cluster Analysis for designing TSK model

Expert Systems with Applications

(2009)

Z. Zhang et al.

An application of Takagi–Sugeno fuzzy system to the classification of cancer patients based on elemental contents in serum samples

Chemometrics and Intelligent Laboratory Systems

(2006)

S.P. Boyd et al.

Convex optimization

(2004)

L. Breiman et al.

Classification and regression trees

(1984)

P.J. Brockwell et al.

Time series: Theory and methods

(1998)

J.A. Dickerson et al.

Fuzzy function approximation with ellipsoidal rules

IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics

(1996)

B. Hartmann et al.

On the smoothness in local model networks

Cited by (9)

MGP-INTACTSKY: Multitree Genetic Programming-based learning of INTerpretable and ACcurate TSK sYstems for dynamic portfolio trading
2015, Applied Soft Computing Journal
Citation Excerpt :
In this way, the expressive power that each rule can provide is increased and therefore a more accurate system can be developed [15]. Furthermore, the dimension difference between antecedents and consequents leads to a system having a smaller number of rules and a more compact rule base [45]. To learn such a rule base, the algorithm must have the ability to represent variable sized rules with different structures.
In this paper, a Multitree Genetic Programming-based method is developed to learn an INTerpretable and ACcurate Takagi-Sugeno-Kang (TSK) fuzzy rule based sYstem (MGP-INTACTSKY) for dynamic portfolio trading. The MGP-INTACTSKY utilizes a TSK model with a new structure to develop a more interpretable and accurate system for dynamic portfolio trading. In the new structure of TSK, disjunctive normal form rules with variable structured consequent parts are developed in which the absence of some input variables is allowed. Input variables are the most influential technical indices which are selected by stepwise regression analysis. The technical indices are computed using wavelet transformed stock price series to eliminate the noise. The proposed system directly induces the preferred portfolio weights from the stock's technical indices through time. Here, genetic programming with the multitree structure is applied to learn the TSK fuzzy rule bases with the Pittsburgh approach. With this approach, the correlation of different stocks is properly considered during the evolutionary process. To evaluate the performance of the MGP-INTACTSKY for portfolio trading, the proposed model is implemented on the Tehran Stock Exchange as an emerging market as well as Toronto and Frankfurt Stock Exchanges as two mature markets. The experimental results show that the proposed model outperforms other methods such as the momentum strategy, the multitree genetic programming-based crisp system, the genetic algorithm-based first order TSK system, the buy and hold approach and the market's main index in terms of accuracy and interpretability.
Forecasting semi-dynamic response of natural gas networks to nodal gas consumptions using genetic fuzzy systems
2015, Energy
Citation Excerpt :
A method is reported for reducing number of rules required in the antecedent part of the system using nonlinear variables. Antecedent and consequent variables are determined in that approach using recursive estimation and parameters of the system are identified after solving a series of splitting and regression problems [20]. Fuzzy neural network based on TSK model is used for obtaining reasonable fuzzy space partition after applying a subtractive clustering algorithm to find cluster centers.
Abstract-Semi-dynamic behavior of natural gas distribution network and nodal gas consumptions are predicted. Traditional Hardy-Cross method for analysis of the gas network is replaced with a direct mathematical solution of mass conservation equations at network nodes to yield nodal static pressures and volumetric flow rates for the coming days. After the calculation of static pressure distribution in a network for near future days, the problem of pressure drop in the network which is a serious problem in cold seasons can be managed in advance. TSK (Takagi-Sugeno-Kang) fuzzy system is used for forecasting. Structure identification of the system is carried out using CVIs (Cluster Validity Indices) and PFCM (Possibilistic Fuzzy C-Means algorithm) to determine number of rules which is also chosen such that testing error of the system does not exceed a predefined value. Premise and t-norm parameters of the TSK system are tuned by GAs (Genetic Algorithms) and their consequent parameters are adjusted using LSE (Least Square Estimate). Comparison of testing error of the TSK system for modeling benchmark data with other popular methods demonstrates its suitability for forecasting nodal gas consumptions.
Optimization of fuzzy model using genetic algorithm for process control application
2011, Journal of the Franklin Institute
Citation Excerpt :
This ability of the TS fuzzy model provides a better alternative way in solving the critical weakness found in conventional techniques when facing with nonlinear system identification. The recursive least squares estimation can be easily used for constructing a linear system as a means for tuning the TS fuzzy model [2,12,13]. Passino and Yurkovich had also described the potential of using the least squares estimation method to solve an array of linear equations at the consequent part of the fuzzy model [14].
A technique for the modeling of nonlinear control processes using fuzzy modeling approach based on the Takagi–Sugeno fuzzy model with a combination of genetic algorithm and recursive least square is proposed. This paper discusses the identification of the parameters at the antecedent and consequent parts of the fuzzy model. For the antecedent fuzzy parameters, genetic algorithm is used to tune them while at the consequent part, recursive least squares approach is used to identify the system parameters. This approach is applied to a process control rig with three subsystems: a heating element, a heat exchanger and a compartment tank. Experimental results show that the proposed approach provides better modeling when compared with Takagi Sugeno fuzzy modeling technique and the linear modeling approach.
Model-based control using interval type-2 fuzzy logic systems
2018, Soft Computing
Takagi-Sugeno Fuzzy Logic Systems
2017, Nonlinear Physical Science
Nonlinear regression modeling for engineering applications: Modeling, model validation, and enabling design of experiments
2016, Nonlinear Regression Modeling for Engineering Applications: Modeling, Model Validation, and Enabling Design of Experiments

View all citing articles on Scopus

View full text

A generalized TSK model with a novel rule antecedent structure: Structure identification and parameter estimation

Abstract

Introduction

Section snippets

Antecedent dimensions

Order determination and antecedent variable selection

Estimation of parameter values

Procedure summary

Testing models

Conclusions

Acknowledgement

Expert Systems with Applications

Environmental Impact Assessment Review

Pattern Recognition

Fuzzy Sets and Systems

Applied Soft Computing

Fuzzy Sets and Systems

Expert Systems with Applications

Journal of Hydrology

Control Engineering Practice

Fuzzy Sets and Systems

Mathematical and Computer Modelling

Automatica

Fuzzy Sets and Systems

Grouped neural network modeling for model predictive control

Fuzzy Sets and Systems

Expert Systems with Applications

Chemometrics and Intelligent Laboratory Systems

Convex optimization

Classification and regression trees

Time series: Theory and methods

Fuzzy function approximation with ellipsoidal rules

IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics

On the smoothness in local model networks