Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

Qayyum, Abdul; Saeed Malik, Aamir; Saad, Naufal M.; Iqbal, Mahboob; Abdullah, Mohd Faris; Rasheed, Waqas; Abdullah, Tuan A. B. Rashid; Bin Jafaar, Mohd Yaqoob

doi:10.1007/s00521-017-3300-5

Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

Original Article
Published: 30 December 2017

Volume 31, pages 3587–3607, (2019)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Abdul Qayyum¹,
Aamir Saeed Malik¹,
Naufal M. Saad¹,
Mahboob Iqbal¹,
Mohd Faris Abdullah¹,
Waqas Rasheed¹,
Tuan A. B. Rashid Abdullah² &
…
Mohd Yaqoob Bin Jafaar³

1105 Accesses
8 Citations
Explore all metrics

Abstract

This work offers an approach to aerial image classification for use in remote sensing object recognition, image processing and computer vision. Sparse coding (SC) is used to classify unmanned-aerial-vehicle (UAV) and satellite images because SC representation can generalize a large dataset and improve the detection of distinctive features by reducing calculation time for feature matching and classification. Features from images are extracted based on the following descriptors: (a) Scale Invariant Feature Transform; (b) Histogram of Oriented Gradients; and (c) Local Binary Patterns. SC representation and local image features are combined to represent global features for classification. Features are deployed in a sparse model to store descriptor features using extant dictionaries such as (a) the Discrete Cosine Transform and (b) the Discrete Wavelet Transform. An additional two dictionaries are proposed as developed for the present work: (c) the Discrete Ridgelet Transform (DRT) and (d) the Discrete Tchebichef Transform. The DRT dictionary is constructed by using the Ricker wavelet function to generate finite Ridgelet transforms as basis elements for a hybrid dictionary. Different pooling methods have also been employed to convert sparse-coded features into a feature matrix. Various machine learning algorithms are then applied to the feature matrix to classify objects contained in UAV and satellite imagery data. Experimental results show that the SC model secured better accuracy rates for extracted discriminative features contained in remote sensing images. The authors concluded that the proposed SC technique and proposed dictionaries provided feasible solutions for image classification and object recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Classification Modeling of Multi-Featured Remote Sensing Images Based on Sparse Representation

A remote sensing image classification method based on sparse representation

Article 20 February 2016

Shulei Wu, Huandong Chen, … Guokang Zhu

Remote Targets Recognition Based on Adaptive Weighting Feature Dictionaries and Joint Sparse Representations

Article 07 September 2018

Wei Wang, Junwu Chen, … Xin Wang

References

Hu F, Xia G-S, Hu J, Zhang L (2015) Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens 7(11):14680–14707
Article Google Scholar
Zhou G (2009) Near real-time orthorectification and mosaic of small UAV video flow for time-critical event response. IEEE Trans Geosci Remote Sens 47(3):739–747
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7(1):11–32
Article Google Scholar
Puissant A, Hirsch J, Weber C (2005) The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. Int J Remote Sens 26(4):733–745
Article Google Scholar
Yang Y, Newsam S (2013) Geographic Image retrieval using local invariant features. IEEE Trans Geosci Remote Sens 51(2):818–832
Article Google Scholar
Yang Y, Newsam S (2008) Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery. In: 2008 15th IEEE international conference on image processing, 2008, pp 1852–1855
Zhou XS, Huang TS (2003) Relevance feedback in image retrieval: a comprehensive review. Multimed Syst 8(6):536–544
Article Google Scholar
van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596
Article Google Scholar
Wu L, Hoi SCH, Yu N (2010) Semantics-preserving bag-of-words models and applications. IEEE Trans Image Process 19(7):1908–1920
Article MathSciNet MATH Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition—volume 2 (CVPR’06), vol 2, pp 2169–2178
Olshausen BA, Field DJ (1997) Strategy employed by V1 ? Vis Res 37(23):3311–3325
Article Google Scholar
Mairal J, Elad M, Sapiro G (2008) Sparse representation for color image restoration. IEEE Trans Image Process 17(1):53–69
Article MathSciNet MATH Google Scholar
Yang J, Wright J, Huang T, Ma Y (2008) Image super-resolution as sparse representation of raw image patches. In: 2008 IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: IEEE Conference on computer vision and pattern recognition, pp 1–8
Zeng S, Gou J, Yang X (2017) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput Appl
Ranzato M, Huang FJ, Boureau Y-L, LeCun Y (2007) Unsupervised Learning of Invariant feature hierarchies with applications to object recognition. In: IEEE conference on computer vision and pattern recognition, pp 1–8
Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on computer vision and pattern recognition, pp 1794–1801
Gao S, Tsang IW-H, Chia L-T, Zhao P (2010) Local features are not lonely & #x2013; Laplacian sparse coding for image classification. In: IEEE computer society conference on computer vision and pattern recognition, pp 3555–3561
Agarwal V, Bhanot S (2017) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl
Lee JW, Lee JB, Park M, Song SH (2005) An extensive comparison of recent classification tools applied to microarray data. Comput Stat Data Anal 48(4):869–885
Article MathSciNet MATH Google Scholar
Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21(5):631–643
Article Google Scholar
Nex F, Remondino F (2014) UAV for 3D mapping applications: a review. Appl Geomatics 6(1):1–15
Article Google Scholar
Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision, 1999, vol 2, pp 1150–1157
Mondragon IF, Campoy P, Correa JF, Mejias L (2007) Visual model feature tracking for UAV Control. In IEEE international symposium on intelligent signal processing 2007, pp 1–6
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Article Google Scholar
Ke y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004., vol 2, pp 506–513
Cai D, He X, Han J (2007) Spectral regression: a unified approach for sparse subspace learning. In: Seventh IEEE international conference on data mining (ICDM 2007), 2007, pp 73–82
Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning. In: Proceedings of the 24th international conference on machine learning—ICML’07, 2007, pp 759–766
Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Yi (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Article Google Scholar
Siebert S, Teizer J (2014) Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Autom Constr 41:1–14
Article Google Scholar
Elad M, Figueiredo MAT, Ma Y (2010) On the role of sparse and redundant representations in image processing. Proc IEEE 98(6):972–982
Article Google Scholar
Mallat SG (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415
Article MATH Google Scholar
Mallat SG (1994) Adaptive time-frequency decompositions. Opt Eng 33(7):2183
Article Google Scholar
Wei Q, Bioucas-Dias J, Dobigeon N, Tourneret J-Y (2015) Hyperspectral and multispectral image fusion based on a sparse representation. IEEE Trans Geosci Remote Sens 53(7):3658–3668
Article Google Scholar
Qayyum A, Malik AS, Nuafal M, Mazher M, Ahmad RF, Abdullah MF (2015) Evaluation of optimization algorithms for sparse and redundant dictionaries. In: IEEE student symposium in biomedical engineering & sciences (ISSBES), 2015, pp 128–133
Do MN, Vetterli M (2003) The finite ridgelet transform for image representation. IEEE Trans Image Process 12(1):16–28
Article MathSciNet MATH Google Scholar
Rahmalan H, Abu NA, Wong SL (2010) Using tchebichef moment for fast and efficient image compression. Pattern Recognit Image Anal 20(4):505–512
Article Google Scholar
Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: IEEE computer society conference on computer vision and pattern recognition 2010, pp 2559–2566
Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 994–1000
Agarwal A, Triggs B (2006) Hyperfeatures—multilevel local coding for visual recognition, 2006, pp 30–43
Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790
Article Google Scholar
Benediktsson JA, Pesaresi M, Arnason K (2003) Classification and feature extraction for remote sensing images from urban areas based on morphological transformations. IEEE Trans Geosci Remote Sens 41(9):1940–1949
Article Google Scholar
Manikandan J, Venkataramani B (2009) Design of a modified one-against-all SVM classifier. In: 2009 IEEE international conference on systems, man and cybernetics, 2009, pp 1869–1874
Ou G, Murphey YL (2007) Multi-class pattern classification using neural networks. Pattern Recognit 40(1):4–18
Article MATH Google Scholar
Kowalski PA, Kulczycki P (2015) Interval probabilistic neural network. Neural Comput Appl 28:817–834
Article Google Scholar
Orlowska-Kowalska T, Kaminski M (2014) Influence of the optimization methods on neural state estimation quality of the drive system with elasticity. Neural Comput Appl 24(6):1327–1340
Article Google Scholar
Huang G, Huang G-B, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Networks 61:32–48
Article MATH Google Scholar
Huang G, Song S, Gupta JN, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417
Article Google Scholar
Zhang L, Jack LB, Nandi AK (2005) Extending genetic programming for multi-class classification by combining K-nearest neighbor. In: Proceedings. (ICASSP’05). IEEE international conference on acoustics, speech, and signal processing, 2005, vol 5, pp 349–352

Download references

Acknowledgements

This project was supported by The Ministery of Energy, Green Technology and Water, Malaysia (Cost Center:AB0315-G07) collaborated with Universiti Teknologi PETRONAS.

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Centre for Intelligent Signal and Imaging Research (CISIR), Universiti Teknologi PETRONAS, 32610, Bandar Seri Iskandar, Perak, Malaysia
Abdul Qayyum, Aamir Saeed Malik, Naufal M. Saad, Mahboob Iqbal, Mohd Faris Abdullah & Waqas Rasheed
UNITEN, Putrajaya Campus, Jalan IKRAM-UNITED, 43000, Kajang, Selangor, Malaysia
Tuan A. B. Rashid Abdullah
Sabah Electricity Sdn Bhd (SESB), Ground Floor, Wisma Lembaga Letrik Sabah, Jln Tuaran, Karamunsing, 88400, Kota Kinabalu, Sabah, Malaysia
Mohd Yaqoob Bin Jafaar

Authors

Abdul Qayyum
View author publications
You can also search for this author in PubMed Google Scholar
Aamir Saeed Malik
View author publications
You can also search for this author in PubMed Google Scholar
Naufal M. Saad
View author publications
You can also search for this author in PubMed Google Scholar
Mahboob Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Mohd Faris Abdullah
View author publications
You can also search for this author in PubMed Google Scholar
Waqas Rasheed
View author publications
You can also search for this author in PubMed Google Scholar
Tuan A. B. Rashid Abdullah
View author publications
You can also search for this author in PubMed Google Scholar
Mohd Yaqoob Bin Jafaar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdul Qayyum.

Ethics declarations

Conflict of interest

The authors state that there is no conflict of interests relating the publication of this paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Qayyum, A., Saeed Malik, A., Saad, N.M. et al. Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach. Neural Comput & Applic 31, 3587–3607 (2019). https://doi.org/10.1007/s00521-017-3300-5

Download citation

Received: 25 July 2016
Accepted: 21 December 2017
Published: 30 December 2017
Issue Date: August 2019
DOI: https://doi.org/10.1007/s00521-017-3300-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

Abstract

Access this article

Similar content being viewed by others

Classification Modeling of Multi-Featured Remote Sensing Images Based on Sparse Representation

A remote sensing image classification method based on sparse representation

Remote Targets Recognition Based on Adaptive Weighting Feature Dictionaries and Joint Sparse Representations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

Abstract

Access this article

Similar content being viewed by others

Classification Modeling of Multi-Featured Remote Sensing Images Based on Sparse Representation

A remote sensing image classification method based on sparse representation

Remote Targets Recognition Based on Adaptive Weighting Feature Dictionaries and Joint Sparse Representations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation