A robust multiple regression model based on fuzzy random variables

https://doi.org/10.1016/j.cam.2020.113270Get rights and content

Abstract

In the present paper, a novel robust multiple regression model with fuzzy intercepts and non-fuzzy regression coefficients was proposed. A two-stage robust procedure adopted with fuzzy random variables and α-values of LR-fuzzy was also introduced to estimate the components of the model. Some common goodness-of-fit criteria were also used to evaluate the performance of the proposed method. The effectiveness of the proposed method was compared to some common fuzzy robust regression models through three numerical examples including a simulation study. The numerical results indicated the lower sensitivity of the proposed model to outliers and its higher precision compared to the other existing robust regression methods.

Introduction

Multiple regression analysis is the most basic and commonly used statistical technique to estimate the relationships between some independent predictor variables and a dependent response variable. Such techniques often rely on the least square errors. However, the least square estimations may misbehave when the data set involves some outliers. One approach to overcome this problem is to remove the influential observations from the least-square fit [1]. A modified approach, known as robust regression, employs a fitting criterion with no such susceptibility to these unusual data as much as the least squares. The most common general method of robust regression is M-estimation introduced by Huber [2]. It generally gives better accuracies compared to the least-square method as it utilizes a weighting mechanism to weigh down the influential observations. However, the traditional regression analysis often relies on exact information such as data or coefficients. Since its introduction by Tanaka et al. [3], the fuzzy regression methods have gained considerable attention in real-life applications. These methods can be categorized into two classes: the observations of the predictors can be either fuzzy numbers [4], [5], [6], [7], [8], [9], [10], [11], [12] or real value quantities [13], [14], [15], [16], [17].

Several outlier robust methods have been proposed based on fuzzy information [18], [19], [20], [21], [22], [23]. In this paper, a common and popular robust estimation method (called M-estimation [2]) was employed in cases where both predictors and responses are reported as random fuzzy quantities. For this purpose, a novel idea was proposed for the fuzzy multiple regression model. To do so, a two-stage procedure was proposed to estimate the fuzzy intercept and exact regression coefficients of the model at the presence of outliers. The performance of the developed method was then compared with several already-available fuzzy robust regression models in terms of some common goodness-of-fit criteria. For practical reasons, the proposed method was further evaluated through a simulated study and two applied examples. The numerical results indicated the sufficient accuracy of the proposed method (compared to others) at the presence of outliers in the data set.

The rest of this paper is organized as follows: Section 2 reviews some concepts including α-values of fuzzy numbers and fuzzy random variables. The methodology is proposed in Section 3 to estimate the fuzzy intercept and non-fuzzy coefficients of a fuzzy multiple regression model with fuzzy predictors and fuzzy responses. Section 4 illustrates three numerical examples to assess the effectiveness and performance of the proposed method relative to the other fuzzy multiple regression methods in terms of some common performance measures. Finally, the main contributions of this paper are summarized in Section 5.

Section snippets

Fuzzy numbers

This section reviews some basic definitions of fuzzy numbers based on [9].

A fuzzy set [24] A˜ of R (the real line) can be defined by its membership function μA˜:R[0,1]. For each α(0,1], the subset {xRμA˜(x)α} is called the α-cut of A˜ and is denoted by A˜[α]. The set A˜[0]= {xR:μA˜(x)>0}¯ is called the support of A˜ [25] where A¯ shows the closure of A. The lower and upper bounds of A˜[α], α[0,1] are denoted by A˜L[α] and A˜U[α], respectively. Moreover, a fuzzy set A˜ of R is called a

A robust multiple regression model based on fuzzy random variables

A multiple regression model with fuzzy predictors and fuzzy responses is introduced in this section at the presence of outliers in the data set. Assume that the observed data on n FRVs are denoted by Y˜i,(X˜i1,X˜i2,,X˜ik). Based on the aforementioned data set, the following fuzzy multiple linear regression model can be considered: Y˜i=β˜0j=1k(βjX˜ij)ε˜i,i=1,2,,n,where

  • 1.

    Y˜iF(R) denotes the fuzzy response,

  • 2.

    X˜ijF(R) are fuzzy predictors,

  • 3.

    β˜0F(R) presents the fuzzy intercept,

  • 4.

    βjR stands for the

Numerical examples

This section compares the feasibility and effectiveness of the proposed fuzzy multivariate regression model with several robust fuzzy regression models. The main focus was on some common fuzzy multiple robust regression models introduced by Akbari and Hesamian [4], Chachi et al. [19], and D’Urso et al. [13]. To conduct a comparative study, the measures explained in Remark 3.1 were applied to calculate the goodness-of-fit criteria.

Example 4.1 A Simulation Study

Here, a set of m=10 simulated data of size n=100 was generated

Conclusion

A novel and robust methodology was developed in this study for a fuzzy multiple regression model with fuzzy predictors, response variable, and intercept, and non-fuzzy coefficients. To this end, fuzzy statistics relevant to fuzzy random variables and the conventional M-estimator were employed to estimate the known exact regression coefficients and control the effect of outliers on the regression performances. In this context, a generalized subtraction and a modified defuzzified method were

Acknowledgments

The authors would like to thank the editor and anonymous reviewer for their constructive suggestions and comments, which improved the presentation of this work.

References (36)

  • ZengW. et al.

    Fuzzy least absolute linear regression

    Appl. Soft Comput.

    (2017)
  • D’UrsoP. et al.

    Robust fuzzy regression analysis

    Inform. Sci.

    (2011)
  • ZadehL.A.

    Fuzzy sets as a basis for a theory of possibility

    Fuzzy Sets and Systems

    (1978)
  • StefaniniL.

    A generalization of Hukuhara difference and division for interval and fuzzy arithmetic

    Fuzzy Sets and Systems

    (2010)
  • PuriM.L. et al.

    Differentials of fuzzy functions

    J. Math. Anal. Appl.

    (1983)
  • ZwickR. et al.

    Measures of similarity amongst fuzzy concepts: A comparative analysis

    Internat. J. Approx. Reason.

    (1987)
  • MaronnaR.A. et al.

    Robust Statistics: Theory and Methods

    (2006)
  • HuberP.

    Robust Statistics

    (1981)
  • Cited by (7)

    • Fuzzy regression analysis based on M-estimates

      2022, Expert Systems with Applications
    • An OWA-based approach to quantile fuzzy regression

      2021, Computers and Industrial Engineering
      Citation Excerpt :

      Hesamian & Akbari (2020a) investigated a robust varying coefficient approach to fuzzy multiple regression model. Hesamian & Akbari (2020b) adopted a two-stage robust procedure to propose and estimate the components of a robust multiple regression model with fuzzy intercepts and crisp coefficient. Khammar, Arefi, & Akbari (2020, 2021) presented general approaches to fit fuzzy regression models crisp/fuzzy input and fuzzy output.

    • Quantile Fuzzy Varying Coefficient Regression based on kernel function

      2021, Applied Soft Computing
      Citation Excerpt :

      There are several robust approaches in fuzzy regression theory. For example, we refer to Arefi [1], Chang and Lee [2], Oussalsh and Schutter [3], Sanli and Apaydin [4], Shon [5], Choi and Buckley [6], Nasrabadi and Hashemi [7], Kula et al. [8], D’Urso and Massari [9], Chachi and Taheri [10], Taheri and Kelkinnama [11], Zeng et al. [12], Chachi [13], Khammar et al. [14], Taheri and Chachi [15], Hesamian and Akbari [16] and Khammar et al. [17]. Shen et al. [18] proposed a generalized fuzzy regression model (called fuzzy varying coefficient model) in which the fuzzy coefficients of a model are dependent upon the covariate variables.

    • Estimating Probability Values Based on Naïve Bayes for Fuzzy Random Regression Model

      2023, International Journal of Advanced Computer Science and Applications
    View all citing articles on Scopus
    View full text