Improvements on ν-Twin Support Vector Machine

doi:10.1016/j.neunet.2016.03.011

Neural Networks

Volume 79, July 2016, Pages 97-107

https://doi.org/10.1016/j.neunet.2016.03.011 Get rights and content

Abstract

In this paper, we propose two novel binary classifiers termed as “Improvements on $ν$ -Twin Support Vector Machine: I $ν$ -TWSVM and I $ν$ -TWSVM (Fast)” that are motivated by $ν$ -Twin Support Vector Machine ( $ν$ -TWSVM). Similar to $ν$ -TWSVM, I $ν$ -TWSVM determines two nonparallel hyperplanes such that they are closer to their respective classes and are at least $ρ$ distance away from the other class. The significant advantage of I $ν$ -TWSVM over $ν$ -TWSVM is that I $ν$ -TWSVM solves one smaller-sized Quadratic Programming Problem (QPP) and one Unconstrained Minimization Problem (UMP); as compared to solving two related QPPs in $ν$ -TWSVM. Further, I $ν$ -TWSVM (Fast) avoids solving a smaller sized QPP and transforms it as a unimodal function, which can be solved using line search methods and similar to I $ν$ -TWSVM, the other problem is solved as a UMP. Due to their novel formulation, the proposed classifiers are faster than $ν$ -TWSVM and have comparable generalization ability. I $ν$ -TWSVM also implements structural risk minimization (SRM) principle by introducing a regularization term, along with minimizing the empirical risk. The other properties of I $ν$ -TWSVM, related to support vectors (SVs), are similar to that of $ν$ -TWSVM. To test the efficacy of the proposed method, experiments have been conducted on a wide range of UCI and a skewed variation of NDC datasets. We have also given the application of I $ν$ -TWSVM as a binary classifier for pixel classification of color images.

Introduction

Support vector machine (SVM) has evolved as an efficient machine learning tool for binary classification problems (Cortes & Vapnik, 1995). SVM has its foundation in statistical learning theory and its formulation is based on SRM principle (Burges, 1998, Vapnik, 1999, Vapnik, 2000). The optimization task involves the minimization of a convex quadratic function subject to linear inequality constraints. SVM was initially developed to solve classification problems, but later it was extended to regression problems. Over the past few decades, various amendments to SVM have been suggested, such as Lagrangian support vector machine (LSVM) (Mangasarian & Musicant, 2001), a smooth support vector machine (SSVM) for classification (Lee & Mangasarian, 2001), least squares support vector machine (LS-SVM) (Suykens & Vandewalle, 1999), proximal support vector machine (PSVM) (Mangasarian & Wild, 2001) and generalized eigenvalue proximal SVM (GEPSVM) (Mangasarian & Wild, 2006).

GEPSVM is a nonparallel plane classifier that generates two hyperplanes instead of one, as opposed to SVM. Taking motivation from GEPSVM, Jayadeva et al. proposed TWSVM (Jayadeva et al., 2007, Khemchandani, 2008). TWSVM is a binary classifier that attempts to generate two nonparallel hyperplanes such that each plane is closer to its own class and is as far as possible from the other class. Thus, TWSVM comprises of a pair of QPPs such that, in each QPP, the objective function corresponds to a particular class and the constraints are determined by patterns of the other class. Thus, TWSVM gives rise to two smaller sized QPPs and makes it almost four times faster than standard SVM. Many extensions of TWSVM have been made, which have been discussed in the survey paper by Tian and Qi (2014). Shao, Zhang, Wang, and Deng (2011) proposed twin bounded support vector machine (TBSVM) that tries to minimize the structural risk by adding a regularization term, with the idea of maximizing the margin. Similar to TBSVM, Tian, Ju, Qi, and Shi (2014) proposed improved TWSVM (ITWSVM). Recently, Khemchandani, Goyal, and Chandra (2016) proposed TWSVR for regression problems using TWSVM framework.

Schölkopf et al. proposed new support vector machine ( $ν$ -SVM) for classification and regression (Scholkopf et al., 1999, Schölkopf et al., 2000), which is a modification of SVM. $ν$ -SVM introduced a priori chosen parameter $ν$ that determines an upper bound on the training error and a lower bound on the number of support vectors. Recently, Peng extended the concept of $ν$ -SVM to TWSVM and proposed $ν$ -TWSVM (Peng, 2010). In TWSVM, the patterns of one class are at least a unit distance away from the hyperplane of other class; this might increase the number of SVs which may lead to poor generalization ability. The parameter $ν$ in $ν$ -TWSVM controls the bounds on the number of SVs, similar to $ν$ -SVM, and further the unit distance of TWSVM is modified to variable $ρ$ , which is optimized in the primal problem involved therein. Further, $ν$ -TWSVM can be interpreted as a pair of minimum generalized Mahalanobis-norm problems on two reduced convex hulls (RCHs).

In this paper, we propose $I ν$ -TWSVM that generates two nonparallel hyperplanes and its key features are listed below:

•
$I ν$ -TWSVM solves a smaller-sized QPP and a UMP, instead of a pair of QPPs as solved by $ν$ -TWSVM and TWSVM-based classifiers. Therefore, the two implementations of $I ν$ -TWSVM have efficient training time as compared to $ν$ -TWSVM.
•
For linear case, the hyperplane for one of the twin problems of $I ν$ -TWSVM is obtained by solving a UMP in the feature dimension, while $ν$ -TWSVM solves a QPP with constraints defined by number of data points in other class. Hence, $I ν$ -TWSVM solves a simpler optimization problem.
•
Unlike $ν$ -TWSVM and TWSVM, the formulation of $I ν$ -TWSVM is based on the principle of SRM and hence it has got good generalization ability, with an added advantage that $I ν$ -TWSVM is much faster than both of these classifiers.
•
$I ν$ -TWSVM uses a single parameter $ν$ to control the bounds on the training error and number of support vectors, whereas $ν$ -TWSVM uses two such parameters— $ν_{1}$ and $ν_{2}$ .
•
In $I ν$ -TWSVM (Fast), we have modified the first problem of $I ν$ -TWSVM as minimization of a unimodal function for which line search methods can be used; this further avoids solving the QPP. The other problem is formulated as a UMP, similar to $I ν$ -TWSVM. Hence, $I ν$ -TWSVM (Fast) is a faster version of our proposed work. It is experimentally proved to be more time-efficient than $ν$ -TWSVM and $I ν$ -TWSVM.

The paper is organized as follows: Section 2 gives a brief description of TWSVM and $ν$ -TWSVM and explains the notations used in the rest of the paper. Section 3 introduces “Improvements on $ν$ -Twin Support Vector Machine” and is followed by experimental results on benchmark datasets in Section 4. The performance of $I ν$ -TWSVM for pixel classification is also investigated in this section. Finally, the paper is concluded in Section 5.

Section snippets

Twin support vector machines (TWSVM)

TWSVM (Jayadeva et al., 2007, Khemchandani, 2008) is a binary classifier that determines two nonparallel hyperplanes by solving two related SVM-type problems, each of which is smaller than the problem in a conventional SVM. The nonparallel hyperplanes of TWSVM are given by $x^{T} w_{1} + b_{1} = 0 and x^{T} w_{2} + b_{2} = 0 .$

The formulation of pair of QPPs in TWSVM is similar to that of a typical SVM, but all patterns do not appear in the constraints of either problem at the same time. Let the data points belonging to

Improvements on $ν$ -twin support vector machine

In this section, we propose two novel classifiers “Improvements on $ν$ -Twin Support Vector Machine, namely $I ν$ -TWSVM and $I ν$ -TWSVM (Fast)”, developed on the lines of TWSVM (Jayadeva et al., 2007) and further based on $ν$ -TWSVM (Peng, 2010). (From this point onwards, we will refer to first implementation as $I ν$ -TWSVM and second as $I ν$ -TWSVM (Fast).) Unlike TWSVM, $I ν$ -TWSVM solves a smaller sized QPP and a UMP as compared to solving a related pair of QPPs for obtaining two nonparallel hyperplanes. The

Numerical experiments

To evaluate the performance of the proposed work, we compare $I ν$ -TWSVM and $I ν$ -TWSVM (Fast) with TBSVM (Shao et al., 2011) and $ν$ -TWSVM (Peng, 2010). The performance is measured in terms of classification accuracy and computational efficiency of these algorithms. The experiments are performed in MATLAB version 8.0 under Microsoft Windows environment on a machine with 3.40 GHz CPU and 16 GB RAM.

Conclusions

In this paper, we have proposed two novel classifiers as “Improvements on $ν$ -Twin Support Vector Machine: $I ν$ -TWSVM and $I ν$ -TWSVM (Fast)”, which improve the learning time of Twin support vector machine (TWSVM) based classifiers, specifically $ν$ -TWSVM. In $I ν$ -TWSVM, we solve a smaller sized quadratic programming problem (QPP) and an unconstrained optimization problem (UMP), whereas TWSVM based classifiers solve a pair of QPPs. Hence, $I ν$ -TWSVM is computationally faster than TBSVM and $ν$ -TWSVM and has

Acknowledgments

The authors would like to thank the editor and anonymous reviewers whose valuable comments and feedback have helped us to improve the content and presentation of the paper.

References (37)

A.K. Jain
Data clustering: 50 years beyond k-means
Pattern Recognition Letters
(2010)
J.F. Khan et al.
A customized gabor filter for unsupervised color image segmentation
Image and Vision Computing
(2009)
R. Khemchandani et al.
TWSVR: Regression via twin support vector machine
Neural Networks
(2016)
X. Peng
A $ν$ -twin support vector machine ( $ν$ -tsvm) classifier and its geometric algorithms
Information Sciences
(2010)
Z. Qi et al.
Laplacian twin support vector machine for semi-supervised classification
Neural Networks
(2012)
Arbelaez, P., Fowlkes, C., & Martin, D. (2007). The berkeley segmentation dataset and benchmark. See...
Blake, C., & Merz, C. J. (1998). {UCI} repository of machine learning databases. URL:...
C.J. Burges
A tutorial on support vector machines for pattern recognition
Data Mining and Knowledge Discovery
(1998)
S. Chandra et al.
Numerical optimization with applications
(2009)
C. Cortes et al.
Support-vector networks
Machine Learning
(1995)

J. Demšar

Statistical comparisons of classifiers over multiple data sets

The Journal of Machine Learning Research

(2006)

R.O. Duda et al.

Pattern classification

(2012)

S.R. Gunn

Support vector machines for classification and regression

V.J. Hodge et al.

A survey of outlier detection methodologies

Artificial Intelligence Review

(2004)

S. Holm

A simple sequentially rejective multiple test procedure

Scandinavian Journal of Statistics

(1979)

Jayadeva et al.

Twin support vector machines for pattern classification

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2007)

N.L. Johnson et al.

Lognormal distributions

R. Khemchandani

Mathematical programming applications in machine learning

(2008)

Cited by (33)

A new algorithm for support vector regression with automatic selection of hyperparameters
2023, Pattern Recognition
The hyperparameters in support vector regression (SVR) determine the effectiveness of the support vectors with fitting and predictions. However, the choice of these hyperparameters has always been challenging in both theory and practice. The $ν$ -support vector regression eliminates the need to specify an $ϵ$ value elegantly, but at the cost of specifying or postulating a $ν$ value. We propose an extended primal objective function arising from probability regularization leading to an automatic selection of $ϵ$ , and we can express $ν$ as an explicit function of $ϵ$ . The resultant hyperparameter values can be interpreted as ‘working’ values required only in training but not testing or prediction. This regularized algorithm, namely $ϵ^{*}$ -SVR, automatically provides a data-dependent $ϵ$ and is found to have a close connection to the $ν$ -support vector regression in the sense that $ν$ as a fraction is a sensible function of $ϵ$ . The $ϵ^{*}$ -SVR automatically selects both $ν$ and $ϵ$ values. We illustrate these findings with some public benchmark datasets.
Adaptive robust learning framework for twin support vector machine classification
2021, Knowledge-Based Systems
In general, introducing robust distance metrics and loss functions in the learning process can improve the robustness of the algorithms. In this work, we first propose a new robust loss function called adaptive capped $L_{θ ε}$ -loss. For different problems, we can choose different loss functions through adaptive parameter $θ$ during the learning process. Secondly, we propose a new robust distance metric induced by correntropy (CIM) that is based on Laplacian kernel. The CIM contains first and higher-order moments from samples. Further, we demonstrate some important and interesting properties of the $L_{θ ε}$ -loss and CIM, such as robustness, boundedness, nonconvexity, etc. Finally, we apply the to $L_{θ ε}$ -loss and CIM to twin support vector machine (TWSVM) and develop an adaptive robust learning framework, namely adaptive robust twin support vector machine (ARTSVM). The proposed ARTSVM not only inherits the advantages of TWSVM but also improves the robustness of classification problems. A non-convex optimization method, DC (difference of convex functions) programming algorithm (DCA) is used to solve the proposed ARTSVM, and the convergence of the algorithm is proved theoretically. Experiments on multiple datasets show that the proposed ARTSVM is competitive with existing methods.
Multi-category ternion support vector machine
2019, Engineering Applications of Artificial Intelligence
This paper proposes a three-class classifier termed as ‘Ternion support vector machine’ (TerSVM) and its tree based multi-category classification approach termed as ‘Multi-category ternion support vector machine’ (M-TerSVM). The proposed classifier, TerSVM, is motivated by twin multi-class support vector classification (Twin-KSVC) and evaluates the data patterns for three outputs $(+ 1, - 1, 0)$ . Twin-KSVC has very high computational complexity, which makes it infeasible for real-world problems. Our proposed classifier, TerSVM, overcomes this limitation, as it formulates three unconstrained minimization problems (UMPs), instead of quadratic programming problems as solved by Twin-KSVC. The UMPs of TerSVM are solved as systems of linear equations which determine three proximal nonparallel hyperplanes. TerSVM can also be used as a binary classifier. This work also proposes a multi-category classification algorithm, M-TerSVM, that extents our three-class classifier (TerSVM) into multi-category framework. For a $K$ -class problem, M-TerSVM constructs a classifier model in the form of a ternion tree of height $⌊ K ∕ 2 ⌋$ , where the data is partitioned into three groups at each level. Our algorithm uses a novel procedure to identify a reduced training set which improves its learning time. Numerical experiments performed on synthetic and benchmark datasets indicate that M-TerSVM outperforms other classical multi-category approaches like one-against-all and Twin-KSVC, in terms of generalization ability and learning time. This paper proposes the application of M-TerSVM for handwritten digit recognition and color image classification.
Fast Laplacian twin support vector machine with active learning for pattern classification
2019, Applied Soft Computing Journal
Citation Excerpt :
In order to show the effectiveness of proposed algorithm on non-mean centered data and with varying sizes, the NDC [33] datasets are skewed by making use of log-normal distribution. The exponential noise has thus been added to the usual NDC datasets and the resulting dataset has been termed as exp-NDC [26]. The feature dimension of exp-NDC data is 32.
In this paper, we propose a semi-supervised classifier termed as Fast Laplacian Twin Support Vector Machine ( $F L a p - T W S V M$ ) with an objective to reduce the requirement of labeled data and simultaneously lessen the training time complexity of a traditional Laplacian Twin Support Vector Machine semi-supervised classifier. $F L a p - T W S V M$ is faster than existing Laplacian twin support vector machine as it solves a smaller size Quadratic Programming Problem (QPP) along with an Unconstrained Minimization Problem (UMP) to obtain decision hyperplanes which can also handle heteroscedastic noise present in the training data. Traditional semi-supervised classifiers generally have no explicit control over the choice of labeled data available for training, hence to overcome this limitation, we propose a pool-based active learning framework which identifies most informative examples to train the learning model. Moreover, the aforementioned framework has been extended to deal with multi-category classification scenarios. Several experiments have been performed on machine learning benchmark datasets which proves the utility of the proposed classifier over traditional Laplacian Twin Support Vector Machine ( $L a p - T W S V M$ ) and active learning based Support Vector Machine ( $S V M_{A L}$ ). The efficacy of the proposed framework has also been tested on human activity recognition problem and content based image retrieval system.
Twin support vector machines: A survey
2018, Neurocomputing
Citation Excerpt :
The theoretical analysis showed that this method can be interpreted as a pair of minimum generalized Mahalanobis-norm problems on two reduced convex hulls. In order to further improve the performance of v-TWSVM, Khemchandani et al. [102] proposed two novel binary classifiers termed as “Improvements on v-twin support vector machine: Iv-TWSVM and Iv-TWSVM (Fast)” that were motivated by v-TWSVM. The significant advantage of Iv-TWSVM over v-TWSVM was that Iv-TWSVM solved one smaller-sized Quadratic Programming Problem (QPP) and one Unconstrained Minimization Problem (UMP).
Twin support vector machines (TWSVM) is a new machine learning method based on the theory of Support Vector Machine (SVM). Unlike SVM, TWSVM would generate two non-parallel planes, such that each plane is closer to one of the two classes and is as far as possible from the other. In TWSVM, a pair of smaller sized quadratic programming problems (QPPs) is solved, instead of solving a single large one in SVM, making the computational speed of TWSVM approximately 4 times faster than the standard SVM. At present, TWSVM has become one of the popular methods because of its excellent learning performance. In this paper, the research progress of TWSVM is reviewed. Firstly, it analyzes the basic theory of TWSVM, then tracking describes the research progress of TWSVM including the learning model and specific applications in recent years, finally points out the research and development prospects. This helps researchers to effectively use TWSVM as an emerging research approach, encouraging them to work further on performance improvement.
Support spinor machine
2017, Digital Signal Processing: A Review Journal
We generalize a support vector machine to a support spinor machine by using the mathematical structure of wedge product over vector machine in order to extend field from vector field to spinor field. The separated hyperplane is extended to Kolmogorov space in time series data which allow us to extend a structure of support vector machine to a support tensor machine and a support tensor machine moduli space. Our performance test on support spinor machine is done over one class classification of end point in physiology state of time series data after empirical mode analysis and compared with support vector machine test. We implement algorithm of support spinor machine by using Holo-Hilbert amplitude modulation for fully nonlinear and nonstationary time series data analysis.

View all citing articles on Scopus

View full text

Improvements on ν-Twin Support Vector Machine

Abstract

Introduction

Section snippets

Twin support vector machines (TWSVM)

Improvements on ν-twin support vector machine

Numerical experiments

Conclusions

Acknowledgments

Pattern Recognition Letters

Image and Vision Computing

Neural Networks

Information Sciences

Neural Networks

A tutorial on support vector machines for pattern recognition

Data Mining and Knowledge Discovery

Numerical optimization with applications

Support-vector networks

Machine Learning

Statistical comparisons of classifiers over multiple data sets

The Journal of Machine Learning Research

Pattern classification

Support vector machines for classification and regression

A survey of outlier detection methodologies

Artificial Intelligence Review

A simple sequentially rejective multiple test procedure

Scandinavian Journal of Statistics

Twin support vector machines for pattern classification

IEEE Transactions on Pattern Analysis and Machine Intelligence

Lognormal distributions

Mathematical programming applications in machine learning

Improvements on $ν$ -Twin Support Vector Machine

Improvements on $ν$ -twin support vector machine