Skip to main content
Log in

Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

This work offers an approach to aerial image classification for use in remote sensing object recognition, image processing and computer vision. Sparse coding (SC) is used to classify unmanned-aerial-vehicle (UAV) and satellite images because SC representation can generalize a large dataset and improve the detection of distinctive features by reducing calculation time for feature matching and classification. Features from images are extracted based on the following descriptors: (a) Scale Invariant Feature Transform; (b) Histogram of Oriented Gradients; and (c) Local Binary Patterns. SC representation and local image features are combined to represent global features for classification. Features are deployed in a sparse model to store descriptor features using extant dictionaries such as (a) the Discrete Cosine Transform and (b) the Discrete Wavelet Transform. An additional two dictionaries are proposed as developed for the present work: (c) the Discrete Ridgelet Transform (DRT) and (d) the Discrete Tchebichef Transform. The DRT dictionary is constructed by using the Ricker wavelet function to generate finite Ridgelet transforms as basis elements for a hybrid dictionary. Different pooling methods have also been employed to convert sparse-coded features into a feature matrix. Various machine learning algorithms are then applied to the feature matrix to classify objects contained in UAV and satellite imagery data. Experimental results show that the SC model secured better accuracy rates for extracted discriminative features contained in remote sensing images. The authors concluded that the proposed SC technique and proposed dictionaries provided feasible solutions for image classification and object recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Hu F, Xia G-S, Hu J, Zhang L (2015) Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens 7(11):14680–14707

    Article  Google Scholar 

  2. Zhou G (2009) Near real-time orthorectification and mosaic of small UAV video flow for time-critical event response. IEEE Trans Geosci Remote Sens 47(3):739–747

    Article  Google Scholar 

  3. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  4. Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7(1):11–32

    Article  Google Scholar 

  5. Puissant A, Hirsch J, Weber C (2005) The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. Int J Remote Sens 26(4):733–745

    Article  Google Scholar 

  6. Yang Y, Newsam S (2013) Geographic Image retrieval using local invariant features. IEEE Trans Geosci Remote Sens 51(2):818–832

    Article  Google Scholar 

  7. Yang Y, Newsam S (2008) Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery. In: 2008 15th IEEE international conference on image processing, 2008, pp 1852–1855

  8. Zhou XS, Huang TS (2003) Relevance feedback in image retrieval: a comprehensive review. Multimed Syst 8(6):536–544

    Article  Google Scholar 

  9. van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596

    Article  Google Scholar 

  10. Wu L, Hoi SCH, Yu N (2010) Semantics-preserving bag-of-words models and applications. IEEE Trans Image Process 19(7):1908–1920

    Article  MathSciNet  MATH  Google Scholar 

  11. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition—volume 2 (CVPR’06), vol 2, pp 2169–2178

  12. Olshausen BA, Field DJ (1997) Strategy employed by V1 ? Vis Res 37(23):3311–3325

    Article  Google Scholar 

  13. Mairal J, Elad M, Sapiro G (2008) Sparse representation for color image restoration. IEEE Trans Image Process 17(1):53–69

    Article  MathSciNet  MATH  Google Scholar 

  14. Yang J, Wright J, Huang T, Ma Y (2008) Image super-resolution as sparse representation of raw image patches. In: 2008 IEEE conference on computer vision and pattern recognition, 2008, pp 1–8

  15. Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: IEEE Conference on computer vision and pattern recognition, pp 1–8

  16. Zeng S, Gou J, Yang X (2017) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput Appl

  17. Ranzato M, Huang FJ, Boureau Y-L, LeCun Y (2007) Unsupervised Learning of Invariant feature hierarchies with applications to object recognition. In: IEEE conference on computer vision and pattern recognition, pp 1–8

  18. Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on computer vision and pattern recognition, pp 1794–1801

  19. Gao S, Tsang IW-H, Chia L-T, Zhao P (2010) Local features are not lonely & #x2013; Laplacian sparse coding for image classification. In: IEEE computer society conference on computer vision and pattern recognition, pp 3555–3561

  20. Agarwal V, Bhanot S (2017) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl

  21. Lee JW, Lee JB, Park M, Song SH (2005) An extensive comparison of recent classification tools applied to microarray data. Comput Stat Data Anal 48(4):869–885

    Article  MathSciNet  MATH  Google Scholar 

  22. Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21(5):631–643

    Article  Google Scholar 

  23. Nex F, Remondino F (2014) UAV for 3D mapping applications: a review. Appl Geomatics 6(1):1–15

    Article  Google Scholar 

  24. Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision, 1999, vol 2, pp 1150–1157

  25. Mondragon IF, Campoy P, Correa JF, Mejias L (2007) Visual model feature tracking for UAV Control. In IEEE international symposium on intelligent signal processing 2007, pp 1–6

  26. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630

    Article  Google Scholar 

  27. Ke y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004., vol 2, pp 506–513

  28. Cai D, He X, Han J (2007) Spectral regression: a unified approach for sparse subspace learning. In: Seventh IEEE international conference on data mining (ICDM 2007), 2007, pp 73–82

  29. Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning. In: Proceedings of the 24th international conference on machine learning—ICML’07, 2007, pp 759–766

  30. Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: IEEE conference on computer vision and pattern recognition, 2008, pp 1–8

  31. Wright J, Yang AY, Ganesh A, Sastry SS, Ma Yi (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227

    Article  Google Scholar 

  32. Siebert S, Teizer J (2014) Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Autom Constr 41:1–14

    Article  Google Scholar 

  33. Elad M, Figueiredo MAT, Ma Y (2010) On the role of sparse and redundant representations in image processing. Proc IEEE 98(6):972–982

    Article  Google Scholar 

  34. Mallat SG (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415

    Article  MATH  Google Scholar 

  35. Mallat SG (1994) Adaptive time-frequency decompositions. Opt Eng 33(7):2183

    Article  Google Scholar 

  36. Wei Q, Bioucas-Dias J, Dobigeon N, Tourneret J-Y (2015) Hyperspectral and multispectral image fusion based on a sparse representation. IEEE Trans Geosci Remote Sens 53(7):3658–3668

    Article  Google Scholar 

  37. Qayyum A, Malik AS, Nuafal M, Mazher M, Ahmad RF, Abdullah MF (2015) Evaluation of optimization algorithms for sparse and redundant dictionaries. In: IEEE student symposium in biomedical engineering & sciences (ISSBES), 2015, pp 128–133

  38. Do MN, Vetterli M (2003) The finite ridgelet transform for image representation. IEEE Trans Image Process 12(1):16–28

    Article  MathSciNet  MATH  Google Scholar 

  39. Rahmalan H, Abu NA, Wong SL (2010) Using tchebichef moment for fast and efficient image compression. Pattern Recognit Image Anal 20(4):505–512

    Article  Google Scholar 

  40. Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: IEEE computer society conference on computer vision and pattern recognition 2010, pp 2559–2566

  41. Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 994–1000

  42. Agarwal A, Triggs B (2006) Hyperfeatures—multilevel local coding for visual recognition, 2006, pp 30–43

  43. Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790

    Article  Google Scholar 

  44. Benediktsson JA, Pesaresi M, Arnason K (2003) Classification and feature extraction for remote sensing images from urban areas based on morphological transformations. IEEE Trans Geosci Remote Sens 41(9):1940–1949

    Article  Google Scholar 

  45. Manikandan J, Venkataramani B (2009) Design of a modified one-against-all SVM classifier. In: 2009 IEEE international conference on systems, man and cybernetics, 2009, pp 1869–1874

  46. Ou G, Murphey YL (2007) Multi-class pattern classification using neural networks. Pattern Recognit 40(1):4–18

    Article  MATH  Google Scholar 

  47. Kowalski PA, Kulczycki P (2015) Interval probabilistic neural network. Neural Comput Appl 28:817–834

    Article  Google Scholar 

  48. Orlowska-Kowalska T, Kaminski M (2014) Influence of the optimization methods on neural state estimation quality of the drive system with elasticity. Neural Comput Appl 24(6):1327–1340

    Article  Google Scholar 

  49. Huang G, Huang G-B, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Networks 61:32–48

    Article  MATH  Google Scholar 

  50. Huang G, Song S, Gupta JN, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417

    Article  Google Scholar 

  51. Zhang L, Jack LB, Nandi AK (2005) Extending genetic programming for multi-class classification by combining K-nearest neighbor. In: Proceedings. (ICASSP’05). IEEE international conference on acoustics, speech, and signal processing, 2005, vol 5, pp 349–352

Download references

Acknowledgements

This project was supported by The Ministery of Energy, Green Technology and Water, Malaysia (Cost Center:AB0315-G07) collaborated with Universiti Teknologi PETRONAS.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdul Qayyum.

Ethics declarations

Conflict of interest

The authors state that there is no conflict of interests relating the publication of this paper.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qayyum, A., Saeed Malik, A., Saad, N.M. et al. Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach. Neural Comput & Applic 31, 3587–3607 (2019). https://doi.org/10.1007/s00521-017-3300-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-017-3300-5

Keywords

Navigation