Hyperspectral remote sensing image classification based on tighter random projection with minimal intra-class variance algorithm

doi:10.1016/j.patcog.2020.107635

Pattern Recognition

Volume 111, March 2021, 107635

https://doi.org/10.1016/j.patcog.2020.107635 Get rights and content

Highlights

•
A significant improvement on Gaussian dimensional bounds for RP is proposed with detailed proved.
•
The number of spectral vectors of the proposed algorithm is larger than that of the traditional RP.
•
Considering the class separability, the TRP-MIV matrix with sample assistance provides a promising avenue for dimensionality reduction of hyperspectral remote sensing image.
•
It is the first application of the TRP-MIV algorithm for hyperspectral remote sensing image classification.

Abstract

Aiming at solving the problem of image size limiting in the traditional Random Projection (RP) algorithm, a novel Tighter Random Projection (TRP), which combines the scheme with Minimal Intra-class Variance (TRP-MIV) for hyperspectral remote sensing image classification is proposed. First, a new tighter dimensional boundary for expanding image size with the TRP-MIV matrix selected by multiple sampling for improving the class separability is defined to reduce dimension. Then the proposed algorithm is implemented, which integrates TRP-MIV for dimensionality reduction and Minimum Distance (MD) classifier for image classification. Finally, the image size and dimensionality reduction are evaluated by the number of spectral pixels under different theorems, and the spectral difference before and after dimensionality reduction, respectively. Classification performance is evaluated by kappa coefficient, Overall Accuracy (OA), Average Accuracy (AA), Average Precision Rate (APR) and running time. Classification results are obtained from the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) scanner and the Reflective Optics System Imaging Spectrometer (ROSIS) scanner, which indicate that the proposed algorithm is efficient and promising.

Introduction

Hyperspectral remote sensing image classification is the process of dividing hyperspectral remote sensing image into a set of adjacent homogeneous regions and determining their specific classes [1]. Because of the consecutive and extensive spectral bands, hyperspectral remote sensing image classification needs to face many problems, such as the curse of dimensionality [2], serious time-consuming [3] and computational cost [4]. At the same time, the variance within the same class is large in hyperspectral remote sensing image, which easily results in the small class separability [5], [6]. To solve these problems, dimensionality reduction that guarantees the large class separability is the basic and key technology for hyperspectral remote sensing image classification. Therefore, it is very necessary to reduce the dimension of the hyperspectral remote sensing image first, and then the classification operation is performed on the low dimensional image [7].

It is difficult to find out a few dimensions that are actually necessary for hyperspectral remote sensing image classification. The main algorithms currently used in dimensionality reduction include Principal Component Analysis (PCA) [8], Independent Component Analysis (ICA) [9], Linear Discriminant Analysis (LDA) [10], Locally Linear Embedding (LLE) [11], and Random Projection (RP) [12]. Specifically, PCA can reduce the image to any dimension, while it takes a lot of time to calculate the covariance matrix [13]. ICA can be used for parallel computing, which can greatly reduce the time of calculation. However, when the number of features is larger than the dimension of original data, this algorithm will be difficult in optimize. That is, it will face the problem of long training time [14]. For LDA, although prior knowledge of classes can be used, it still has the phenomenon of over fitting [15]. LLE has relatively small computational complexity, but the algorithm is sensitive to the selection of nearest neighbor samples. Namely, the number of nearest neighbor has great influence on the final result of dimensionality reductions [16]. RP, an emergent dimensionality reduction technology, has been used in many fields, such as biology [17], environmental monitoring [18], pattern recognition [19], and disaster monitoring [20]. Due to its computational tractability compared to other algorithms, RP is a valuable algorithm for dimensionality reduction of hyperspectral remote sensing image, which provides a feasible mapping of the Johnson-Lindenstrauss (JL) lemma [21]. However, because the relationship between the original dimension and the number of spectral pixels is exponential [22], RP can only be applied to a small size hyperspectral remote sensing image. Moreover, due to the highly randomness of the traditional RP, different low dimensional images will be produced by different RP matrices during the projection process, which may result in different classification results [23], [24].

Due to the importance and difficulty of classification problems, many researchers have been attracted to do research in this area, and many classification algorithms for dimensionality reduction have been proposed. It can be divided into supervised classification algorithm [25] and unsupervised classification algorithm [26] according to the condition of prior information. The representative supervised classification algorithms include Minimum Distance (MD) [27], Convolution Neural Network (CNN) [28], and Support Vector Machine (SVM) [29]. Su et al. [30] proposed a land parcel extraction algorithm based on training data and MD classifier, which combines PCA for extracting the first principal component of the original satellite image, MD for pre-classification, and watershed algorithm for subdividing. Be careful that the result of each step determines whether the algorithm can ultimately achieve the ideal extraction result. Li et al. [31] proposed a new Hyperspectral image Reconstruction with deep CNN (HRCNN) algorithm based on feature enhancement.The algorithm combines the CNN algorithm for image reconstruction and the Extreme Learning Machine (ELM) for classification. When there is a big difference between the test set and the training set, even if the parameters are adjusted, it is difficult to improve the adaptability of the CNN model. Porta et al. [32] studied a hyperspectral image classification algorithm based on Compressed Sensing (CS) [33], which utilizes CS with the Restricted Isometry Property (RIP) for compressing the image and SVM algorithm for classification. The difference between CS and RP is that CS limits the length of data points before and after compressing, while RP controls the distance between data points before and after projecting. So, RP is more conducive to the identification of similarity. The typical unsupervised classification algorithms include K-means clustering algorithm [34] and Fuzzy C-Means (FCM) clustering algorithm [35]. Among them, cluster ensemble algorithms based on RP and fuzzy or probabilistic clustering algorithms are the most interesting. Numerous models have been proposed to this end [36], [37]. The idea behind these algorithms is that RP is proceeded to generate multiple projection results in a low dimensional space. Then, each projection result is clustered by the fuzzy or probabilistic clustering algorithm to create the membership matrix. Finally, all membership matrices are integrated to generate the final clustering result by various ways. Popescu et al. [38] proposed a Random Projection Fuzzy C-Means algorithm (RPFCM) for big data clustering, which concatenates all membership matrices to generate the concatenated matrix. Then the similarity matrix is defined by calculating the product of the concatenated matrix. When applied to large-scale image, the algorithm is time-consuming and needs more storage places for placing the similarity matrix. Therefore, the image size will be greatly limited for ordinary software. To avoid the product operation, Ye et al. [39] studied a FCM clustering algorithm with RP to extract the feature, which is the spectral clustering on all membership matrices. It is more effective, robust and suitable for a wider range of geometric data sets. When the original dimension is not large enough, the number of spectral pixels will be greatly constrained to achieve the purpose of dimensionality reduction. To reduce time, Rathore et al. [40] proposed a novel Cumulative Agreement Fuzzy C-Means algorithm (CAFCM), which uses the cluster validity indices to sort all membership matrices and accumulates aggregates to obtain the final clustering result. However, this algorithm cannot get an ideal result for bad projection results. Besides, because the relationship between the number of spectral pixels and the original dimension in the traditional RP algorithm is positively correlated, the image size is greatly restricted for hyperspectral remote sensing image with a low original dimension.

To solve the problem of image size, this paper proposes a novel Tighter Random Projection (TRP) with Minimal Intra-class Variance (TRP-MIV) algorithm for dimensionality reduction of hyperspectral remote sensing image. That is a different version of RP with tighter boundary and wider image size. Meanwhile, TRP-MIV algorithm is proposed to select the TRP-MIV matrix with the help of samples based on the idea of maximizing the class separability. After reducing the dimensions of samples and testing images, MD classifier is devised to classify by measuring similarity between low dimensional testing images and each class feature center of low dimensional samples. Experimental results show that the proposed dimensionality reduction algorithm can be applied to a larger image size and maintain greater class separability, which can effectively improve the subsequent accuracy of hyperspectral remote sensing image classification. This paper is organized in following. The traditional RP algorithm and the proposed algorithm are provided in Sections 2 and 3, respectively. In Section 4, the experimental results and discussion are outlined. Section 5 introduces conclusions.

Section snippets

Traditional random projection

Hyperspectral remote sensing images have the capacity of detecting more detailed and accurate earth surface information. However, due to its high spectral resolution, it is necessary for hyperspectral remote sensing image classification to reduce the original dimension. As a dimensionality reduction algorithm, RP will effectively maintain pair wise distances based on JL lemma, which has attracted a growing number of attentions of researchers. What surprised us is that RP is easier to work and

The proposed algorithm

The traditional RP algorithm has main problem in the application of hyperspectral remote sensing image dimensionality reduction. For dimensionality reduction of hyperspectral remote sensing image, the maximum amount of data, that is, the maximum number of vectors is determined by the original dimension according to Eq. (4). This means that the image size is limited by the number of the original dimension. Take into account this problem, a new TRP-MIV scheme is proposed in this paper.

Experimental results and discussion

To validate the feasibility and effectiveness of the proposed algorithm, classification experiments in real hyperspectral remote sensing images were performed on a PC with Intel (R) Core (TM) i5-4460, 3.20GHz and 8GB memory using MATLAB R2016a. The experimental images are captured from the Indian Pines scene in North-western Indiana of the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) sensor with the wavelength range from 0.4  ×  10⁶ to 2.5  ×  10⁶ meters, the Pavia Centre scene in

Conclusions

RP, which aims to reduce the number of bands by linear projection of the original data using random projection matrix, has been a disciplinary field attracting a lot of researchers over the recent years. In this stage, a systematic review and summary on existing RP algorithms can promote the further development of the research area. Motivated by such, our work focuses on the following points. (1) A significant improvement on Gaussian dimensional bounds for RP is proposed with detailed proved.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by Department of Science and Technology of Liaoning Province of China (LJ2019JL001). The authors would like to thank Prof. P. Gamba for providing the ROSIS Pavia data, Prof. D. Landgrebe for making the AVIRIS Indian Pines hyperspectral data set, and Dr. L. Johnson and Dr. J. A. Gualtieri for providing the AVIRIS Salinas data set used in our experiments.

Zhao Quanhua received her M.Sc. degree in 2004 and Ph.D. degree in 2009 both from Liaoning Technical University. Now she is a professor and doctoral supervisor in Liaoning Technical University. Her main research interests include modeling and analysis of remote sensing image, and application of random geometry.

References (41)

Y. Tarabalka et al.
Segmentation and classification of hyperspectral images using watershed transformation
Pattern Recognit.
(2010)
G. Lixin et al.
Segmented minimum noise fraction transformation for efficient feature extraction of hyperspectral images
Pattern Recognit.
(2015)
Q. Du et al.
Implementation of real-time constrained linear discriminant analysis to remote sensing image classification
Pattern Recognit.
(2005)
E. Barshan et al.
Supervised principal component analysis: visualization, classification and regression on subspaces and submanifolds
Pattern Recognit.
(2011)
A. Salazar et al.
A general procedure for learning mixtures of independent component analyzers
Pattern Recognit.
(2010)
S. Wang et al.
Semi-supervised linear discriminant analysis for dimension reduction and classification
Pattern Recognition
(2016)
R. Avogadri et al.
Fuzzy ensemble clustering based on random projections for dna micro-array data analysis
Artif. Intell. Med.
(2009)
D. Achlioptas
Database-friendly random projections: johnson-lindenstrauss with binary coins
J. Comput. Syst. Sci.
(2003)
L. Liu et al.
Sorted random projections for robust rotation-invariant texture classification
Pattern Recognit.
(2012)
X.Y. Wang et al.
Color image segmentation using pixel wise support vector machine classification
Pattern Recognit.
(2011)

B. Su et al.

Discrimination of land use patterns in remote sensing image data using minimum distance algorithm and watershed algorithm

Eng. Agric. Envir. Food

(2013)

Y. Li et al.

Hyperspectral image reconstruction by deep convolutional neural network for classification

Pattern Recognit.

(2017)

J. Gao et al.

Tibrio, Dimensionality reduction via compressive sensing,

Pattern Recognit. Lett.

(2012)

L. Wang et al.

Robust level set image segmentation via a local correntropy-based k-means clustering

Pattern Recognit.

(2014)

F. Yang et al.

Exploring the diversity in cluster ensemble generation: random sampling and random projection

Expert Syst. Appl.

(2014)

P.J. Du et al.

Review of hyperspectral remote sensing image classification

J. Remote Sens.

(2016)

X. Zhe et al.

Directional statistics-based deep metric learning for image classification and retrieval

Pattern Recognit.

(2018)

S. Sun et al.

Active learning with gaussian process classifier for hyperspectral image classification

IEEE Trans. Geoence Remote Sens.

(2014)

W. Li et al.

Kernel collaborative representation with tikhonov regularization for hyperspectral image classification

IEEE Geoence Remote Sens. Lett.

(2015)

D. Li et al.

Spatial-spectral neighbour graph for dimensionality reduction of hyperspectral image classification

Int. J. Remote Sens.

(2019)

Cited by (26)

Hyperspectral image classification using Second-Order Pooling with Graph Residual Unit Network
2024, Expert Systems with Applications
Convolutional Neural Networks (CNNs) have become increasingly popular for hyperspectral image (HSI) classification due to their ability to capture spatial and spectral information using fixed square filters. However, CNNs are less effective than Graph Convolutional Networks (GCNs) in capturing intricate relationships between elements in non-Euclidean domains. Nevertheless, applying GCNs to HSI classification involves significant computational demands, and the presence of data redundancy and noisy spectral bands in HSIs can adversely affect performance. To address this issue, we propose Second-Order Pooling with a Graph Residual Unit Network (SOPGRU) that exploits multilevel graphs to explore the spatial topologies of HSIs. The HSI is unfolded onto these multilevel graphs using residual connections, which enable direct access to earlier layer information by exploiting the spatial topology of the HSI hierarchically by generating the multiscale features for each pixel. Within the SOPGRU network, pooling and up-pooling functions are incorporated to transfer features of varying scales. To overcome the limitations mentioned earlier, we introduce a channel-spatial fusion attention (CSFA) mechanism as a preprocessing step. This mechanism effectively reduces information redundancy and eliminates noisy spectral–spatial features that can harm overall performance. Experimental evaluation on four benchmark datasets demonstrates that our proposed model achieves state-of-the-art performance in HSI classification. Furthermore, this research shows how a Graph Convolutional Network (GCN) interacts with spatial topologies and spectral bands for effective classification.
ENGA: Elastic Net-Based Genetic Algorithm for human action recognition
2023, Expert Systems with Applications
Video surveillance and activity monitoring are the practical real-time applications of Human Action Recognition (HAR). A fusion of several Convolutional Neural Network (CNN) architectures has been widely used for effective HAR and achieved impressive results. Feature fusion of multiple pre-trained models also extracts redundant features due to the combinations of identical layers in all CNN architectures. In this study, network-level fusion is proposed, which reduces the possibility of having identical layers throughout the fusion process and helps extract unique features. Three pre-trained models, i.e., NASNetLarge, DenseNet201, and DarkNet53 are selected and analyzed to select the most efficient combinations of layers among these networks. Selected combinations of these networks are fused using five proposed strategies, i.e., sum, max, concatenation, convolutional and bilinear fusion. In the end, a proposed minimized CNN architecture is utilized to extract descriptors, which are optimized using the proposed Elastic Net-based Genetic Algorithm (ENGA) approach. A two-phase hybrid ENGA technique is suggested to pick features using both GA and EN. GA is used in the initial stage to reduce the dimensionality of retrieved features. To eliminate the unnecessary features, EN regularization is put into place in the second phase. The proposed ENGA model is evaluated on four publicly available datasets including UTKinect-Action, MSR-Action3D dataset, Florence3D-Action dataset, and Youtube-8 m, and achieved 99.63%, 99.69%. 98.63% and 91.46% accuracies, respectively.
Optimum supervised classification algorithm identification by investigating PlanetScope and Skysat multispectral satellite data of Covid lockdown
2023, Geosystems and Geoenvironment
This research identifies the optimum supervised classification algorithm based on modeling Covid 19 lockdown situations all around the World. The deadly Covid 19 viruses suddenly stopped the fast-moving world and all the commercial and noncommercial activities were stalled for an uncertain period during 2020-2021. In this work, object-based image classification approaches have been used to compare pre-Covid and post-Covid (at the time lockdown) images of the study area. These study areas are Washington DC, USA, Sao Paulo, Brazil, Cairo, Egypt, Afghanistan/Iran border, and Beijing, China. All the study areas possess different geographical conditions but have a similar situation of Covid 19 lockdowns. Six supervised image classification techniques are known as Parallelepiped classification ( $P P C$ ), Minimum distance classification ( $M D C$ ), Mahalanobis distance classification ( $M a D C$ ), Maximum likelihood classification ( $M L C$ ), Spectral angle mapper classification ( $S A M C$ ) and Spectral information divergence classification ( $S I D C$ ) are used to classify the satellite data of the study area. Thus based on classification results and statistical features, it has been observed that $P P C$ has obtained the least significant results. In contrast, the most reliable results and highest classification accuracies are obtained through $M D C$ , $M a D C$ , and $M L C$ classification algorithms.
Deep Semantic-Visual Alignment for zero-shot remote sensing image scene classification
2023, ISPRS Journal of Photogrammetry and Remote Sensing
Deep neural networks have achieved promising progress in remote sensing (RS) image classification, for which the training process requires abundant samples for each class. However, it is time-consuming and unrealistic to annotate labels for each RS category, given the fact that the RS target database is increasing dynamically. Zero-shot learning (ZSL) allows for identifying novel classes that are not seen during training, which provides a promising solution for the aforementioned problem. However, previous ZSL models mainly depend on manually-labeled attributes or word embeddings extracted from language models to transfer knowledge from seen classes to novel classes. Those class embeddings may not be visually detectable and the annotation process is time-consuming and labor-intensive. Besides, pioneer ZSL models use convolutional neural networks pre-trained on ImageNet, which focus on the main objects appearing in each image, neglecting the background context that also matters in RS scene classification. To address the above problems, we propose to collect visually detectable attributes automatically. We predict attributes for each class by depicting the semantic-visual similarity between attributes and images. In this way, the attribute annotation process is accomplished by machine instead of human as in other methods. Moreover, we propose a Deep Semantic-Visual Alignment (DSVA) that take advantage of the self-attention mechanism in the transformer to associate local image regions together, integrating the background context information for prediction. The DSVA model further utilizes the attribute attention maps to focus on the informative image regions that are essential for knowledge transfer in ZSL, and maps the visual images into attribute space to perform ZSL classification. With extensive experiments, we show that our model outperforms other state-of-the-art models by a large margin on a challenging large-scale RS scene classification benchmark. Moreover, we qualitatively verify that the attributes annotated by our network are both class discriminative and semantic related, which benefits the zero-shot knowledge transfer.
Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery
2022, Pattern Recognition
Citation Excerpt :
On the contrary, single-stage detectors, such as RetinaNet [17], SSD [18], EfficientDet [19], YOLO [20], etc., take advantage of regression to achieve localization, along with recognition. Recent advances in optical sensor technology and CNN-based algorithms have encouraged detection performance in several remote sensing scenarios [21–29]. However, the objects may appear under varying conditions of illumination, weather, resolution, and occlusions.
Cross-modality fusing complementary information of multispectral remote sensing image pairs can improve the perception ability of detection algorithms, making them more robust and reliable for a wider range of applications, such as nighttime detection. Compared with prior methods, we think different features should be processed specifically, the modality-specific features should be retained and enhanced, while the modality-shared features should be cherry-picked from the RGB and thermal IR modalities. Following this idea, a novel and lightweight multispectral feature fusion approach with joint common-modality and differential-modality attentions are proposed, named Cross-Modality Attentive Feature Fusion (CMAFF). Given the intermediate feature maps of RGB and thermal images, our module parallel infers attention maps from two separate modalities, common- and differential-modality, then the attention maps are multiplied to the input feature map respectively for adaptive feature enhancement or selection. Extensive experiments demonstrate that our proposed approach can achieve the state-of-the-art performance at a low computation cost.
Deep neural networks-based relevant latent representation learning for hyperspectral image classification
2022, Pattern Recognition
The classification of hyperspectral image is a challenging task due to the high dimensional space, with large number of spectral bands, and low number of labeled training samples. To overcome these challenges, we propose a novel methodology for hyperspectral image classification based on multi-view deep neural networks which fuses both spectral and spatial features by using only a small number of labeled samples. Firstly, we process the initial hyperspectral image in order to extract a set of spectral and spatial features. Each spectral vector is the spectral signature of each pixel of the image. The spatial features are extracted using a simple deep autoencoder, which seeks to reduce the high dimensionality of data taking into account the neighborhood region for each pixel. Secondly, we propose a multi-view deep autoencoder model which allows fusing the spectral and spatial features extracted from the hyperspectral image into a joint latent representation space. Finally, a semi-supervised graph convolutional network is trained based on thee fused latent representation space to perform the hyperspectral image classification. The main advantage of the proposed approach is to allow the automatic extraction of relevant information while preserving the spatial and spectral features of data, and improve the classification of hyperspectral images even when the number of labeled samples is low. Experiments are conducted on three real hyperspectral images respectively Indian Pines, Salinas, and Pavia University datasets. Results show that the proposed approach is competitive in classification performances compared to state-of-the-art.

View all citing articles on Scopus

Jia Shuhan is a Ph.D. student in Liaoning Technical University now. Her main research interests are the identification and extraction of remote sensing image information.

Li Yu received his Ph.D. degree in 2010 from University of Waterloo. Now, he is a professor and doctoral supervisor in Liaoning Technical University. His main research interests are remote sensing data processing theory and basic application research, including spatial statistics, random geometry, fuzzy mathematics, object geometry and feature extraction.

View full text

Hyperspectral remote sensing image classification based on tighter random projection with minimal intra-class variance algorithm

Highlights

Abstract

Introduction

Section snippets

Traditional random projection

The proposed algorithm

Experimental results and discussion

Conclusions

Declaration of Competing Interest

Acknowledgments

Pattern Recognit.

Pattern Recognit.

Pattern Recognit.

Pattern Recognit.

Pattern Recognit.

Artif. Intell. Med.

J. Comput. Syst. Sci.

Pattern Recognit.

Pattern Recognit.

Eng. Agric. Envir. Food

Pattern Recognit.

Pattern Recognit. Lett.

Pattern Recognit.

Expert Syst. Appl.

Review of hyperspectral remote sensing image classification

J. Remote Sens.

Directional statistics-based deep metric learning for image classification and retrieval

Pattern Recognit.

Active learning with gaussian process classifier for hyperspectral image classification

IEEE Trans. Geoence Remote Sens.

Kernel collaborative representation with tikhonov regularization for hyperspectral image classification

IEEE Geoence Remote Sens. Lett.

Spatial-spectral neighbour graph for dimensionality reduction of hyperspectral image classification

Int. J. Remote Sens.