Coupled compressed sensing inspired sparse spatial-spectral LSSVM for hyperspectral image classification

doi:10.1016/j.knosys.2015.01.006

Knowledge-Based Systems

Volume 79, May 2015, Pages 80-89

https://doi.org/10.1016/j.knosys.2015.01.006 Get rights and content

Abstract

Inspired by the recently developed Compressed Sensing (CS) theory, this study advances a sparse Spatial-Spectral Least Square Support Vector Machine (SS-LSSVM) for Hyperspectral Image Classification (HIC). In our work, hyperspectral pixels are redefined in both the spectral domain and spatial domain by adaptively selecting their spatial neighbors according to the edge-map. The weighted sum of spectral and spatial features is utilized to construct an SS-LSSVM model. The SS-LSSVM is regarded as a topology comprised of a large number of support vectors, and a sparse SS-LSSVM is derived from a Coupled Compressed Sensing (CCS) of this topology. The sparsity of our proposed CCS inspired Sparse SS-LSSVM (CCS4-LSSVM) improves the classification accuracy of SS-LSSVM for HIC. Furthermore, by combining spectral information and adaptively extracted spatial information together, CCS4-LSSVM cannot only avoid the speckle-like misclassification of original LS-SVM but also reduce the influence of noisy pixels. The performance of our proposed method is evaluated on some hyperspectral image data, and the results show that it can achieve higher classification accuracy than the Spatial-Spectral SVM (SS-SVM) and Spatial-Spectral LSSVM (SS-LSSVM).

Introduction

The ample information contained in the hyperspectral data can be used to classify classes of the same species, making the Hyperspectral Image Classification (HIC) very attractive in recent years. Among the supervised HIC approaches, Support Vector Machine (SVM) obtains excellent classification result, because it makes a tradeoff between bias and variance via using a compact topology which is established by a small number of carefully selected support vectors [1]. SVM can handle large input spaces efficiently, work with a relatively small number of labeled training samples and deal with noisy samples in a robust way [2], [3]. And various studies are presented to further improve the performance of SVM in recent years. Firstly, multiple-output support vector regression (MSVR) model is designed for addressing the multi-variate application problems [4], [5]. And Fuzzy SVM model is also proposed for efficiently dealing with outliers or noises in classification [6]. Furthermore, due to the importance of the parameters in SVM, many effective parameters optimization methods of SVM are proposed in recent years. The most popular way among them is the grid search, which exhaustively searches on the parameters space for the validation error minimization [7], [8]. Besides grid search, hybrid algorithm such as hybrid comprehensive learning particle swarm optimizer with Broyden–Fletcher–Goldfarb–Shanno (CLPSO–BFGS) algorithm [9], memetic algorithm such as particle swarm optimization and pattern search (PSO–PS) based memetic algorithm [10], and firefly algorithm based methods [4], [11] are proposed to tune the parameters of SVM [9], [10], support vector regression (SVR) [11] and MSVR [4] respectively. These methods greatly improve the stability of the parameter settings, and thus the SVM tuning parameters by them can obtain good generalization performances. Therefore, SVM is well-suited for HIC and has demonstrated excellent performance in terms of accuracy and robustness in recent years [12], [13]. Furthermore, SVM methods for HIC were used in many areas, such as prostate cancer detection [14], land cover classification [15] and plant diseases detection [16], and have obtained good effects.

Although the generalization ability of SVM is very well, the computational complexity of SVM is high, for it solves a set of inequalities constrained quadratic programming. Least Square Support Vector Machine (LS-SVM) is a modified version of the standard SVM, which replaces the inequality constraints with equality constraints when solving quadratic programming [17]. Therefore, LS-SVM is more computationally attractive than SVM and has been applied to HIC [18], [19]. However, LS-SVM does not perform model selection and loses the sparseness of SVM, so the storage cost, computation cost and prediction error of generalization all increase when it is used in HIC. Even though many pruning algorithms have been developed to impose the sparseness on original LS-SVM [20], [21], these approaches need to iteratively omit the training samples and retrain the reduced LS-SVM. Several works aim for alleviating this problem by imposing sparsity on LS-SVM [22], [23]. These techniques are capable of iteratively constructing a sparse LS-SVM when training. However, the iterative training cost is prohibitive. Moreover, the convergence of these algorithms are dependent on the success of optimization algorithms.

In this paper, inspired by the recently developed Compressed Sensing (CS) theory [24], [25], a compact LS-SVM with sparse topology is established to realize accurate classification of hyperspectral images. This sparse model can be considered as a low-dimensional measurement of the original LS-SVM. The sparse topology can be then obtained by learning a compressive measurement matrix from training data and then reducing the useless support vectors by solving a Multiple Measurement Vector (MMV) optimization problem via CS technology. On the other hand, one can observe that neighboring hyperspectral pixels likely belong to the same class. That is, there is spatial homogeneity in the labels of hyperspectral images, which is beneficial for classifying hyperspectral images. Therefore, incorporating spatial information into the classification can improve the classification accuracy [26], [27], [28]. In paper [26], [27], [28], the spatial neighbors set of a given hyperspectral pixel is defined as a small window centered on the given pixel. However, for the pixels lying on the edges of numerous classes, this spatial homogeneity assumption is invalid. In the tiny window, there are some noisy pixels which do not belong to the same class of the center pixel. Taking these noisy pixels as spatial neighbors of the center pixel will involve noisy information in the hyperspectral pixels and decrease the classification accuracy of hyperspectral image. In our work, the LS-SVM is regularized by casting a local adaptive spatial homogeneity assumption on hyperspectral images. The hyperspectral pixel is redefined both in the spectral domain and spatial domain by adaptively selecting its spatial neighbors according to the edge-map. The weighted sum of spectral and spatial features is utilized to construct a Spatial-Spectral Least Square Support Vector Machine (SS-LSSVM) model in this study. And then a Coupled Compressed Sensing inspired Sparse SS-LSSVM (CCS4-LSSVM) for HIC is advanced. By combining spectral information and adaptively extracted spatial information together, CCS4-LSSVM cannot only avoid the speckle-like misclassification of original LS-SVM but also reduce the influence of noisy pixels.

Compared with the available HIC approaches, our proposed CCS4-LSSVM has the following characteristics: (1) CCS4-LSSVM is more computationally attractive than SVM, for it solves a linear system instead of a quadratic programming. (2) The performance of CCS4-LSSVM for HIC is comparable with SVM due to its sparse topology. (3) CCS4-LSSVM is constructed via a one-step strategy and the designed compressive measurement matrix coupled with the dictionary matrix guarantees the high incoherence with dictionary, which avoids the iterative selection of important support vectors and makes a rapid and high-accuracy HIC possible. (4) By combining spectral information with the adaptively extracted spatial neighbors together, CCS4-LSSVM can avoid the influence of noisy pixels and the speckle-like misclassification of the original LS-SVM. Some experiments are conducted on several hyperspectral data to compare the proposed method with its counterparts, and the results show that it can achieve higher classification accuracy than Spatial-Spectral SVM (SS-SVM) and Spatial-Spectral LSSVM (SS-LSSVM).

The remainder of the paper is organized as follows: Section 2 depicts the proposed CCS4-LSSVM. In Section 3, some experiments are conducted to investigate the performance of our proposed method. A conclusion is presented in Section 4.

Section snippets

Coupled Compressed Sensing inspired Sparse Spatial-Spectral Least Square Support Vector Machine (CCS4-LSSVM) for HIC

CS provides a new information acquisition and processing framework that allows us to reconstruct sparse or compressible signals from a small set of measurements. Assume that a set of signals $X = [x_{1}, x_{2}, \dots, x_{d}] \in R^{N \times d}$ is compressible under a dictionary $Ψ \in R^{N \times N} : X = Ψ Θ$ , where ${‖Θ‖}_{row, 0} = K$ is the number of rows that contain nonzero elements. Many applications matching the properties of CS involve distribution acquisition of multiple correlated signals. The multiple signal case where all l involved signals are

Experimental results and discussions

In this section, the performance of our proposed CCS4-LSSVM is evaluated on three hyperspectral image data downloaded from http://www.ehu.es/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes.

Conclusions

In order to achieve accurate and rapid classification for hyperspectral images with a small number of training data, a Coupled Compressed Sensing inspired Sparse Spatial-Spectral Least Squares Support Vector Machine (CCS4-LSSVM) is proposed based on recently developed CS theory. By casting a sparse assumption on the support vectors, CCS4-LSSVM can derive a sparse topology from a coupled compressive matrix stemmed from the dictionary and obtained by MMV optimization algorithm. Experimental

Acknowledgements

This work was supported by the National Basic Research Program of China (973 Program) under Grant No. 2013CB329402, the Fundamental Research Funds for the Central Universities BDY021429, the Huawei Innovation Research Program, the Kunshan innovation institute of Xidian University, National Science Foundation of China under Grant Nos. 91438103, 91438201, 61072108, 61173090, 51207002, NCET-10-668, the Foreign Scholars in University Research and Teaching Programs (No. B07048), the fundamental

References (40)

T. Xiong et al.
Multiple-output support vector regression with a firefly algorithm for interval-valued stock price index forecasting
Knowl.-Based Syst.
(2014)
Y. Bao et al.
Multi-step-ahead time series prediction using multiple-output support vector regression
Neurocomputing
(2014)
S. Li et al.
Tuning SVM parameters by using a hybrid CLPSO–BFGS algorithm
Neurocomputing
(2010)
Y. Bao et al.
A PSO and pattern search based memetic algorithm for SVMs parameters optimization
Neurocomputing
(2013)
T. Kavzoglu et al.
A kernel functions analysis for support vector machines for land cover classification
Int. J. Appl. Earth Obs.
(2009)
T. Rumpf et al.
Early detection and classification of plant diseases with support vector machines based on hyperspectral reflectance
Comput. Electron. Agric.
(2010)
Y.C. Li et al.
Improved sparse least-squares support vector machine
Neurocomputing
(2006)
T. Xiong et al.
Beyond one-step-ahead forecasting: evaluation of alternative multi-step-ahead forecasting models for crude oil price
Energy Econ.
(2013)
T. Xiong et al.
Does restraining end effect matter in EMD-based modeling framework for time series prediction? Some experimental evidences
Neurocomputing
(2014)
T. Xiong et al.
Interval forecasting of electricity demand: a novel bivariate EMD-based support vector regression modeling framework
Int. J. Electr. Power Energy Syst.
(2014)

F. Melgani et al.

Classification of hyperspectral remote sensing images with support vector machines

IEEE Trans. Geosci. Remote Sens.

(2004)

N. Cristianini et al.

An Introduction to Support Vector Machines

(2000)

G. Camps-Valls et al.

Robust support vector method for hyperspectral data classification and knowledge discovery

IEEE Trans. Geosci. Remote Sens.

(2004)

X. Yang et al.

A kernel fuzzy c-means clustering-based fuzzy support vector machine algorithm for classification problems with outliers or noises

IEEE Trans. Fuzzy Syst.

(2011)

C.W. Hsu, C.C. Chang, C.J. Lin, A Practical Guide to Support Vector Classification....

G. Moore et al.

Model selection for primal SVM

Mach. Learn.

(2011)

Z. Hu et al.

Electricity load forecasting using support vector regression with memetic algorithms

Sci. World J.

(2013)

J. Plaza et al.

Multi-channel morphological profiles for classification of hyperspectral images using support vector machines

Sensors

(2009)

C.H. Li et al.

A spatial-contextual support vector machine for remotely sensed image classification

IEEE Trans. Geosci. Remote Sens.

(2012)

H. Akbari et al.

Hyperspectral imaging and quantitative analysis for prostate cancer detection

J. Biomed. Opt.

(2012)

Cited by (43)

Monitoring and detecting coal miners' fatigue status using MPA-LSSVM in the vision of smart mine
2023, Process Safety and Environmental Protection
With booming computer technology and diverse computer-based smart applications, intelligent monitoring and detection of the fatigue state of coal miners (miners) has attracted extensive attention from enterprises. In order to accurately and fast knowledge the fatigue status of Miners and reduce production accidents. The article proposes a Fatigue Monitoring and Detection (FMD) model based on a fusion Machine Learning (ML) approach: Marine Predators Algorithm (MPA)-optimized Least Squares Support Vector Machine (LSSVM). Firstly, the physiological information of electroencephalogram (ECG) of coal miners before and after manual handling operations was collected using the MP160 recorder (multiconductance physiological 160) produced by BIOPAC, USA. Using the paired-samples t-test method, the characteristic indicators reflecting miners' fatigue were extracted from the ECG. Secondly, Principal Component Analysis (PCA) was used to optimize the selected feature indicators and establish the depth fatigue feature parameter set to characterize the fatigue level of miners. Finally, the proposed MPA-LSSVM-based FMD model is applied to recognize Miners' fatigue levels. The results show that the selected indexes can effectively reflect the fatigue status of Miners. The proposed MPA-LSSVM-based FMD model has higher recognition accuracy than SVM and LSSVM models (13.99% and 18.68% higher, respectively) and better robustness. Therefore, the proposed MPA-LSSVM-based FMD model can accurately and effectively identify the fatigue status of Miners.
A large-scale hyperspectral dataset for flower classification
2022, Knowledge-Based Systems
Citation Excerpt :
Compared with RGB images, hyperspectral images (HSI) of the natural scene can effectively describe the spectral distribution and provide intrinsic and discriminative spectral information of the scene [14–21]. The rich spectral details in HSI can show the deterministic information about the lighting and material, which is beneficial to classification [22–24], object detection [25] and material analysis [26,27]. However, most of the existing hyperspectral image datasets are not suitable for classification.
Flowers have great cultural value, economic value and ecological value in our life. Accurate classification of flowers facilitates various applications of flowers. However, existing datasets for the visual classification task mainly focus on common RGB images. It limits the application of powerful deep learning techniques on specific domains like the spectral analysis of flowers. In this paper, we collect a large-scale hyperspectral flower image dataset named HFD100 for flower classification. Specifically, it contains more than 10700 hyperspectral images which belong to 100 categories. In addition, we perform several baseline experiments on the HFD100 dataset. Experimental results show that this dataset brings the challenges of inter and intra-class variance. We believe our HFD100 will facilitate future research on flower classification, spectral analysis of flowers and fine-grained classification. The collected dataset will be publicly available to the community.
Hyperspectral image classification based on discriminative locality preserving broad learning system
2020, Knowledge-Based Systems
Recently, broad learning system (BLS) has been widely used for its simple, fast and excellent generalization ability in hyperspectral image (HSI) classification. However, how to implement a broad learning system for fine-grained classification of hyperspectral images with a few-shot setting is still a challenging problem. In this paper, we proposed a new method based on the discriminative locality preserving broad learning system (DPBLS) for hyperspectral image classification by exploiting the manifold structure between neighbouring pixels of hyperspectral image. To make full use of the spectral and spatial information of hyperspectral images.we firstly leverage edge-preserving filters to fuse both spectral and spatial features of hyperspectral image samples. Secondly, we introduce discriminative information and local manifold structure of samples into the broad learning system to enhance the discriminative ability of output weights and improve its performance on hyperspectral image classification task. In order to verify the performance of the framework proposed in this paper, we conducted experiments on four hyperspectral image datasets. experiment results show that the method we proposed is well-performed on hyperspectral image classification tasks.
A new ADCS method based on guided filter for tea HSIs
2020, Optik
To improve the reconstructed performance of mass data storage, transmission and preserving spectral characteristics for tea hyperspectral images(HSIs), a new adaptive distributed compressive sensing method based on guided filter (ADCSGF) is proposed. According to the spectral characteristics of tea HSIs, all bands can be divided into different parts which can be further grouped with different band count. Bands of each group are compressed and reconstructed by distributed compressive sensing method, in which the adaptive bit stream allocation strategy based on the residual error is used to obtain the target bit rate for each non-key band and the key band is regarded as the guided filter band to improve the quality of reconstructed non-key bands in each group. The experimental results showed that ADCSGF can improve the subjective quality of image reconstruction and achieve at least a 1.5 dB higher peak signal-to-noise ratio (PSNR) of spectral dimension decorrelation method (SSDC) than that of distributed compressive sensing based on guided filter (DCSGF). ADCSGF can obtain better reconstructed spectral curve at the sampling rate of 0.1Bpp(Bytes per pixel) and achieve better normalized root mean square error (RMSE) performance at the sampling rate from 0.2Bpp to 0.5Bpp than those of DCSGF.
A feature selection approach for hyperspectral image based on modified ant lion optimizer
2019, Knowledge-Based Systems
Feature selection is one of the most important issues in hyperspectral image (HSI) classification to achieve high correlation between the adjacent bands. The main concern is selecting fewer bands with the highest accuracy as possible. Generally, it is a combinatorial optimization problem and cannot be fully solved by swarm intelligence algorithms. Ant lion optimizer (ALO) is a newly proposed swarm intelligence algorithm that mimics the swarming behaviour of antlions. In addition, wavelet support vector machine (WSVM) is able to enhance the stability of the classification result, and Lévy flight helps swarm intelligence algorithms jump out of the local optimum. Therefore, in this paper, a novel feature selection method based on a modified ALO (MALO) and WSVM is proposed to reduce the dimensionality of HSIs. The proposed method is compared with some state-of-the-art algorithms on some HSI datasets. Moreover, a new evaluating criteria is formulated to estimate the performance of feature selection, and the classification accuracy and selected number of bands are balanced as much as possible. Experimental results demonstrate that the proposed method outperforms other approaches, finds the optimal solution with a reasonable convergence orientation, and its classification accuracy is satisfied with fewer bands, it is robust, adaptive and might be applied for practical work of feature selection.
Improving deep ensemble vehicle classification by using selected adversarial samples
2018, Knowledge-Based Systems
Most image classification algorithms aim to maximize the percentage of class labels that are predicted correctly. These algorithms often missclassify images from minority categories as into the dominant categories. To overcome the issue of unbalanced data for classifying vehicles from traffic surveillance images, we propose a semi supervised pipeline focused on integrating deep neural networks with data augmentation based on generative adversarial nets (GANs). The proposed approach consists of three main stages. In the first stage, we trained several GANs on the original dataset to generate adversarial samples for the rare classes. In the second stage, an ensemble of CNN models with different architectures are trained on the original imbalanced data set, and then a sample selection step is performed to filter out the low-quality adversarial samples. In the final stage, the aforementioned ensemble model is refined on the augmented dataset by adding the selected adversarial samples. Experiments on the highly imbalanced large benchmark “MIOvision Traffic Camera Dataset (MIO-TCD)” classification challenge dataset demonstrate that the proposed framework is able to increase the mean performance of some categories to some extent, while maintaining a high overall accuracy, compared with the baseline.

View all citing articles on Scopus

View full text

Coupled compressed sensing inspired sparse spatial-spectral LSSVM for hyperspectral image classification

Abstract

Introduction

Section snippets

Coupled Compressed Sensing inspired Sparse Spatial-Spectral Least Square Support Vector Machine (CCS4-LSSVM) for HIC

Experimental results and discussions

Conclusions

Acknowledgements

Knowl.-Based Syst.

Neurocomputing

Neurocomputing

Neurocomputing

Int. J. Appl. Earth Obs.

Comput. Electron. Agric.

Neurocomputing

Energy Econ.

Neurocomputing

Int. J. Electr. Power Energy Syst.

Classification of hyperspectral remote sensing images with support vector machines

IEEE Trans. Geosci. Remote Sens.

An Introduction to Support Vector Machines

Robust support vector method for hyperspectral data classification and knowledge discovery

IEEE Trans. Geosci. Remote Sens.

A kernel fuzzy c-means clustering-based fuzzy support vector machine algorithm for classification problems with outliers or noises

IEEE Trans. Fuzzy Syst.

Model selection for primal SVM

Mach. Learn.

Electricity load forecasting using support vector regression with memetic algorithms

Sci. World J.

Multi-channel morphological profiles for classification of hyperspectral images using support vector machines

Sensors

A spatial-contextual support vector machine for remotely sensed image classification

IEEE Trans. Geosci. Remote Sens.

Hyperspectral imaging and quantitative analysis for prostate cancer detection

J. Biomed. Opt.