On robust fuzzy c-regression models

doi:10.1016/j.fss.2014.12.004

Fuzzy Sets and Systems

Volume 279, 15 November 2015, Pages 112-129

https://doi.org/10.1016/j.fss.2014.12.004 Get rights and content

Abstract

One of the most popular clustering methods based on minimization of a criterion function is the fuzzy c-means one. Its generalization by application of hyperplane shaped prototypes of the clusters is known as the Fuzzy C-Regression Models (FCRM) method. Although with this generalization many new applications of clustering emerged, it appeared to be rather sensitive to poor initialization and to the presence of noise and outliers in data. In this paper we introduce a new objective function, using the Huber's M-estimators and the Yager's OWA operators to overcome the disadvantages of the approach considered. We derive and describe an algorithm for minimization of the objective function defined. We have called it the Fuzzy C-Ordered-Regression Models (FCORM) clustering algorithm. The algorithm is compared to a few other important reference ones. To this end experiments on synthetic data with various types of noise and different numbers of outliers are carried out. We investigate the methods performance in the conditions that can be encountered in signal analysis. Large-scale simulations demonstrate the competitiveness and usefulness of the method proposed.

Introduction

The unsupervised classification of data into groups is called clustering. The method plays an important role in many engineering fields, such as pattern recognition, computer vision, machine learning, image analysis, communication, knowledge discovery, data mining and so on [1], [10], [19]. In the traditional so-called hard clustering, the groups (clusters) are disjoint. Each data item belongs to one cluster only. In [43] Zadeh introduced the notion of a fuzzy membership function. It allowed to associate with each data item and each cluster a real number in the interval [0,1] representing the “grade of membership” of this item in the cluster considered. This way Zadeh formed the basis for the development of fuzzy (or soft) clustering. However, the idea itself has been introduced by Ruspini [35] and Dunn [9], and generalized by Bezdek who developed an approach based on criterion function minimization [1].

One of the most popular clustering methods based on criterion function minimization is the Fuzzy C-Means (FCM) method which has successfully been applied to a wide variety of problems [1], [5], [6], [21], [24], [25], [33]. Many modifications of the FCM method have been proposed in the literature. Most of them rely on an inclusion of additional information into the clustering process. In an important group of FCM modifications, the information about the shapes of clusters prototypes is exploited. This information is relayed to the algorithms as the constraints on these shapes: in [1] and [26] the prototypes were constrained to linear varieties or linear elliptotypes in a feature space, in [12] to hyperellipsoidal ones and in [11] to hyperspheres in a feature space. This work deals with a modification of the Fuzzy C-Regression Models (FCRM) method, which is also called as the method of Fuzzy Switching Regression Models (FSRM)¹ [13], where the prototypes are constrained to functions (usually but not necessarily the linear ones). Many modifications of the above algorithm have recently appeared in the literature. The method proposed by Menard introduces additionally a so called noise cluster and, besides, it takes into account uncertainty with regard to data membership in linear regression models, near the models intersection [29]. In [23] ε-insensitive loss function was applied to the determination of data distances from the regression lines. A combination of clustering methods and the method of support vector regression was presented in [3]. The concept of α-cut implementation of the fuzzy c-regression models was introduced by Yang et al. [39]. To obtain robustness against bad initialization, the mountain clustering method was used [38] for the determination of linear regression parameters. In [40] support vector machines method was combined with clustering in the kernel space to improve robustness in classification problems.

In another group of fuzzy clustering modifications, the information about the data non-Gaussian distribution and the presence of noise and outliers is taken into account. This group includes: possibilistic clustering [8], [19], fuzzy noise clustering [7], $L_{p}$ ( $0 < p < 1$ ) norm clustering [14], $L_{1}$ and $L_{\infty}$ norm clustering [2], [16], [17], fuzzy c-ordered-means clustering [27], time-domain-constrained clustering [25], ε-insensitive fuzzy clustering [20] and ε-insensitive fuzzy c-regression models clustering [23].

Two different approaches to clustering algorithms robustifying against outliers are worth emphasizing. They are: application of reweighting scenario based Huber's M-estimators [15] and the use of the ordered weighted averaging (OWA) operation [41]. In [27] both approaches were combined for the first time (the aim was to robustify a modification of the FCM method). The goal of this paper is to show that the Huber's M-estimators and the Yager's OWA operators can be used together to improve robustness of the method of the fuzzy c-regression models significantly. The second goal of this work is to investigate the performance of the proposed method when applied to data in the presence of noise and outliers. We will first present the proposed fuzzy c-ordered-regression models method, which exploits both approaches to be more robust, and then we will investigate its performance.

The investigations are aimed not only to present the proposed method competitiveness with respect to the reference methods but also to show its high practical value. To this end its application to the Q wave onset determination in the ECG signals can be considered. The Q wave begins the depolarization of the heart ventricles. Depolarization is followed by repolarization which ends at the offset of the T wave. Thus the both phases of the heart contraction are covered by a so called QT interval, presented in Fig. 1. Precise determination of the interval borders is of high clinical importance, yet it is rather difficult when the analyzed signals are noisy [18]. Fitting two linear models to the signal segments preceding and following the Q wave, we can determine the wave onset at the point where the lines intersect. Thus the switching regression models can help us to solve this important problem from the field of biomedical signal processing.

The paper is organized as follows: A detailed description of the Fuzzy C-Ordered Regression Models (FCORM) method is presented in Section 2. The experiments on different synthetic data, simulating the real life signals, are presented and discussed in Section 3. Finally, conclusions are drawn in Section 4.

Section snippets

Fuzzy c-ordered regression models

Suppose we have a set, ${Tr}^{(N)} = {(x_{k}, y_{k})}_{k = 1}^{N}$ , where each independent datum $x_{k} \in R^{t}$ has a corresponding dependent datum $y_{k} \in R$ , and N is dataset cardinality. Although the data pairs $(x_{k}, y_{k})$ are unlabeled, we assume that they are drawn from the switching regression models, which consist of c models in the following linear form: $y_{k} = w_{0}^{(i)} + {\tilde{w}}^{(i) ⊤} x_{k} + e_{k}^{(i)}$ for $k = 1, 2, \dots, N$ , where $e_{k}^{(i)}$ represents uncertainty (with zero mean) for the kth pair and the ith model, and $w^{(i)} = {[w_{0}^{(i)}, {\tilde{w}}^{(i) ⊤}]}^{⊤} \in R^{t + 1}$ are the parameters

Numerical experiments and discussion

All experiments were performed in MATLAB™ 7.5 environment on NTT Intel® Core™ i5-3570 CPU @ 3.40 GHz with 6 GB RAM, running Windows 8. The Huber's and the sigmoidal loss functions were applied with the following values of their parameters $δ = 1.0$ , and $α = 8.0$ , $β = 1.0$ , and the weighting functions (SOWA and PLOWA) with $p_{c} = 0.9$ , $p_{l} = 0.1$ and $p_{a} = 0.1$ . The iterations were stopped as soon as the Frobenius norm of the difference between the successive U matrices decreased below $10^{- 3}$ . The uniform and Gaussian

Conclusions

Dealing with the unknown data, we must be aware that they can contain different types of noise and even some outliers. In order to obtain reasonable results in such conditions, we should use robust processing methods. Therefore we have proposed two modifications aimed to robustify the well known fuzzy c-regression models method. We have combined the Huber's M-estimators, based on different loss functions, with the Yager's OWA operators to develop a new method. Incorporation of the above

Acknowledgements

The authors are grateful to the anonymous referees for their constructive comments that have helped to improve the paper. The work was performed using the infrastructure supported by POIG.02.03.01-24-099/13 grant: GeCONiI – Upper Silesian Center for Computational Science and Engineering.

References (43)

R.N. Davé
Characterization and detection of noise in clustering
Pattern Recognit. Lett.
(1991)
K. Jajuga
$L_{1}$ -norm based fuzzy clustering
Fuzzy Sets Syst.
(1991)
M. Kotas
Projective filtering of time-aligned ECG beats for repolarization duration measurement
Comput. Methods Programs Biomed.
(2007)
J.M. Leski
Towards a robust fuzzy clustering
Fuzzy Sets Syst.
(2003)
J.M. Leski
Neuro-fuzzy system with learning tolerant to imprecision
Fuzzy Sets Syst.
(2003)
J.M. Leski et al.
A time-domain-constrained fuzzy clustering method and its application to signal analysis
Fuzzy Sets Syst.
(2005)
M. Menard
Fuzzy clustering and switching regression models using ambiguity and distance rejects
Fuzzy Sets Syst.
(2001)
W. Pedrycz
Conditional fuzzy c-means
Pattern Recognit. Lett.
(1996)
E.H. Ruspini
A new approach to clustering
Inf. Control
(1969)
K.-L. Wu et al.
Mountain c-regressions method
Pattern Recognit.
(2010)

L.A. Zadeh

Fuzzy sets

Inf. Control

(1965)

J.C. Bezdek

Pattern Recognition with Fuzzy Objective Function Algorithms

(1982)

L. Bobrowski et al.

c-Means clustering with the $L_{1}$ and $l_{\infty}$ norms

IEEE Trans. Syst. Man Cybern.

(1991)

C.-C. Chuang

Fuzzy weighted support vector regression with a fuzzy partition

IEEE Trans. Syst. Man Cybern., Part B, Cybern.

(2007)

R.D. Cook

Detection of influential observation in linear regression

Technometrics

(1977)

R. Czabanski et al.

Predicting the risk of low-fetal birth weight from cardiotocographic signals using ANBLIR system with deterministic annealing and ε-insensitive learning

IEEE Trans. Inf. Technol. Biomed.

(2010)

E. Czogala et al.

Fuzzy and Neuro-Fuzzy Intelligent Systems

(2000)

R.N. Davé et al.

Robust clustering methods: a unified view

IEEE Trans. Fuzzy Syst.

(1997)

J.C. Dunn

A fuzzy relative of the ISODATA process and its use in detecting compact well-separated cluster

J. Cybern.

(1973)

B.S. Everitt

Cluster Analysis

(1993)

M. Girolami

Mercer Kernel-based clustering in feature space

IEEE Trans. Neural Netw.

(2002)

Cited by (0)

View full text

On robust fuzzy c-regression models

Abstract

Introduction

Section snippets

Fuzzy c-ordered regression models

Numerical experiments and discussion

Conclusions

Acknowledgements

Pattern Recognit. Lett.

Fuzzy Sets Syst.

Comput. Methods Programs Biomed.

Fuzzy Sets Syst.

Fuzzy Sets Syst.

Fuzzy Sets Syst.

Fuzzy Sets Syst.

Pattern Recognit. Lett.

Inf. Control

Pattern Recognit.

Inf. Control

Pattern Recognition with Fuzzy Objective Function Algorithms

c-Means clustering with the L1 and l∞ norms

IEEE Trans. Syst. Man Cybern.

Fuzzy weighted support vector regression with a fuzzy partition

IEEE Trans. Syst. Man Cybern., Part B, Cybern.

Detection of influential observation in linear regression

Technometrics

Predicting the risk of low-fetal birth weight from cardiotocographic signals using ANBLIR system with deterministic annealing and ε-insensitive learning

IEEE Trans. Inf. Technol. Biomed.

Fuzzy and Neuro-Fuzzy Intelligent Systems

Robust clustering methods: a unified view

IEEE Trans. Fuzzy Syst.

A fuzzy relative of the ISODATA process and its use in detecting compact well-separated cluster

J. Cybern.

Cluster Analysis

Mercer Kernel-based clustering in feature space

IEEE Trans. Neural Netw.

c-Means clustering with the $L_{1}$ and $l_{\infty}$ norms