Abstract
This work offers an approach to aerial image classification for use in remote sensing object recognition, image processing and computer vision. Sparse coding (SC) is used to classify unmanned-aerial-vehicle (UAV) and satellite images because SC representation can generalize a large dataset and improve the detection of distinctive features by reducing calculation time for feature matching and classification. Features from images are extracted based on the following descriptors: (a) Scale Invariant Feature Transform; (b) Histogram of Oriented Gradients; and (c) Local Binary Patterns. SC representation and local image features are combined to represent global features for classification. Features are deployed in a sparse model to store descriptor features using extant dictionaries such as (a) the Discrete Cosine Transform and (b) the Discrete Wavelet Transform. An additional two dictionaries are proposed as developed for the present work: (c) the Discrete Ridgelet Transform (DRT) and (d) the Discrete Tchebichef Transform. The DRT dictionary is constructed by using the Ricker wavelet function to generate finite Ridgelet transforms as basis elements for a hybrid dictionary. Different pooling methods have also been employed to convert sparse-coded features into a feature matrix. Various machine learning algorithms are then applied to the feature matrix to classify objects contained in UAV and satellite imagery data. Experimental results show that the SC model secured better accuracy rates for extracted discriminative features contained in remote sensing images. The authors concluded that the proposed SC technique and proposed dictionaries provided feasible solutions for image classification and object recognition.
Similar content being viewed by others
References
Hu F, Xia G-S, Hu J, Zhang L (2015) Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sens 7(11):14680–14707
Zhou G (2009) Near real-time orthorectification and mosaic of small UAV video flow for time-critical event response. IEEE Trans Geosci Remote Sens 47(3):739–747
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7(1):11–32
Puissant A, Hirsch J, Weber C (2005) The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. Int J Remote Sens 26(4):733–745
Yang Y, Newsam S (2013) Geographic Image retrieval using local invariant features. IEEE Trans Geosci Remote Sens 51(2):818–832
Yang Y, Newsam S (2008) Comparing SIFT descriptors and gabor texture features for classification of remote sensed imagery. In: 2008 15th IEEE international conference on image processing, 2008, pp 1852–1855
Zhou XS, Huang TS (2003) Relevance feedback in image retrieval: a comprehensive review. Multimed Syst 8(6):536–544
van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596
Wu L, Hoi SCH, Yu N (2010) Semantics-preserving bag-of-words models and applications. IEEE Trans Image Process 19(7):1908–1920
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition—volume 2 (CVPR’06), vol 2, pp 2169–2178
Olshausen BA, Field DJ (1997) Strategy employed by V1 ? Vis Res 37(23):3311–3325
Mairal J, Elad M, Sapiro G (2008) Sparse representation for color image restoration. IEEE Trans Image Process 17(1):53–69
Yang J, Wright J, Huang T, Ma Y (2008) Image super-resolution as sparse representation of raw image patches. In: 2008 IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: IEEE Conference on computer vision and pattern recognition, pp 1–8
Zeng S, Gou J, Yang X (2017) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput Appl
Ranzato M, Huang FJ, Boureau Y-L, LeCun Y (2007) Unsupervised Learning of Invariant feature hierarchies with applications to object recognition. In: IEEE conference on computer vision and pattern recognition, pp 1–8
Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on computer vision and pattern recognition, pp 1794–1801
Gao S, Tsang IW-H, Chia L-T, Zhao P (2010) Local features are not lonely & #x2013; Laplacian sparse coding for image classification. In: IEEE computer society conference on computer vision and pattern recognition, pp 3555–3561
Agarwal V, Bhanot S (2017) Radial basis function neural network-based face recognition using firefly algorithm. Neural Comput Appl
Lee JW, Lee JB, Park M, Song SH (2005) An extensive comparison of recent classification tools applied to microarray data. Comput Stat Data Anal 48(4):869–885
Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21(5):631–643
Nex F, Remondino F (2014) UAV for 3D mapping applications: a review. Appl Geomatics 6(1):1–15
Lowe DG (1999) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision, 1999, vol 2, pp 1150–1157
Mondragon IF, Campoy P, Correa JF, Mejias L (2007) Visual model feature tracking for UAV Control. In IEEE international symposium on intelligent signal processing 2007, pp 1–6
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Ke y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004., vol 2, pp 506–513
Cai D, He X, Han J (2007) Spectral regression: a unified approach for sparse subspace learning. In: Seventh IEEE international conference on data mining (ICDM 2007), 2007, pp 73–82
Raina R, Battle A, Lee H, Packer B, Ng AY (2007) Self-taught learning. In: Proceedings of the 24th international conference on machine learning—ICML’07, 2007, pp 759–766
Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: IEEE conference on computer vision and pattern recognition, 2008, pp 1–8
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Yi (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Siebert S, Teizer J (2014) Mobile 3D mapping for surveying earthwork projects using an Unmanned Aerial Vehicle (UAV) system. Autom Constr 41:1–14
Elad M, Figueiredo MAT, Ma Y (2010) On the role of sparse and redundant representations in image processing. Proc IEEE 98(6):972–982
Mallat SG (1993) Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 41(12):3397–3415
Mallat SG (1994) Adaptive time-frequency decompositions. Opt Eng 33(7):2183
Wei Q, Bioucas-Dias J, Dobigeon N, Tourneret J-Y (2015) Hyperspectral and multispectral image fusion based on a sparse representation. IEEE Trans Geosci Remote Sens 53(7):3658–3668
Qayyum A, Malik AS, Nuafal M, Mazher M, Ahmad RF, Abdullah MF (2015) Evaluation of optimization algorithms for sparse and redundant dictionaries. In: IEEE student symposium in biomedical engineering & sciences (ISSBES), 2015, pp 128–133
Do MN, Vetterli M (2003) The finite ridgelet transform for image representation. IEEE Trans Image Process 12(1):16–28
Rahmalan H, Abu NA, Wong SL (2010) Using tchebichef moment for fast and efficient image compression. Pattern Recognit Image Anal 20(4):505–512
Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: IEEE computer society conference on computer vision and pattern recognition 2010, pp 2559–2566
Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 2, pp 994–1000
Agarwal A, Triggs B (2006) Hyperfeatures—multilevel local coding for visual recognition, 2006, pp 30–43
Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790
Benediktsson JA, Pesaresi M, Arnason K (2003) Classification and feature extraction for remote sensing images from urban areas based on morphological transformations. IEEE Trans Geosci Remote Sens 41(9):1940–1949
Manikandan J, Venkataramani B (2009) Design of a modified one-against-all SVM classifier. In: 2009 IEEE international conference on systems, man and cybernetics, 2009, pp 1869–1874
Ou G, Murphey YL (2007) Multi-class pattern classification using neural networks. Pattern Recognit 40(1):4–18
Kowalski PA, Kulczycki P (2015) Interval probabilistic neural network. Neural Comput Appl 28:817–834
Orlowska-Kowalska T, Kaminski M (2014) Influence of the optimization methods on neural state estimation quality of the drive system with elasticity. Neural Comput Appl 24(6):1327–1340
Huang G, Huang G-B, Song S, You K (2015) Trends in extreme learning machines: a review. Neural Networks 61:32–48
Huang G, Song S, Gupta JN, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417
Zhang L, Jack LB, Nandi AK (2005) Extending genetic programming for multi-class classification by combining K-nearest neighbor. In: Proceedings. (ICASSP’05). IEEE international conference on acoustics, speech, and signal processing, 2005, vol 5, pp 349–352
Acknowledgements
This project was supported by The Ministery of Energy, Green Technology and Water, Malaysia (Cost Center:AB0315-G07) collaborated with Universiti Teknologi PETRONAS.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors state that there is no conflict of interests relating the publication of this paper.
Rights and permissions
About this article
Cite this article
Qayyum, A., Saeed Malik, A., Saad, N.M. et al. Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach. Neural Comput & Applic 31, 3587–3607 (2019). https://doi.org/10.1007/s00521-017-3300-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-017-3300-5