Remote sensing scene classification with multi-spatial scale frequency covariance pooling

Chen, Wenjie; Gao, Yuan; Chen, Aibin; Zhou, Guoxiong; Wang, Jianwu; Yang, Xiaobo; Jiang, RunDong

doi:10.1007/s11042-022-12603-x

Remote sensing scene classification with multi-spatial scale frequency covariance pooling

Published: 06 April 2022

Volume 81, pages 30413–30435, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Wenjie Chen^1,2,3,
Yuan Gao^1,2,3,
Aibin Chen ORCID: orcid.org/0000-0003-4410-412X^1,2,3,4,
Guoxiong Zhou^1,2,3,
Jianwu Wang^1,2,3,
Xiaobo Yang^1,2,3 &
…
RunDong Jiang^1,2,3

254 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

To address the problem of redundant learning in remote sensing scene classification, a method of multi-space-scale frequency covariance pooling (MSFCP) is proposed in this study. Specifically, a Gabor filter is introduced to the network which reduced redundant learning in ordinary convolution filters and enhanced the robustness of the network to external interference. Secondly, reducing redundant information in low-frequency components via dividing the feature map output by the first layer into high and low-frequencies and performing average pooling for low-frequency information. Next, the introduction of the Octave Convolution (OctConv) operation realized self-update and information interaction of high and low-frequency characteristics. Finally, the global covariance pooling is performed on the output feature map to enhance the representation ability of the entire network and boost the classification effect. Our method performed an accuracy value of 99.35 ± 0.28 (%) on the UC Merced Land Use dataset. The experimental results demonstrate that the proposed MSFCP method achieves better classification performance and lower network model complexity than other methods, which significantly reduces the demand for computing power. Hence, a good trade-off is achieved between experimental accuracy and computational resource consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-deep features fusion for high-resolution remote sensing image scene classification

Article 22 June 2020

Baohua Yuan, Lixin Han, … Hong Yan

High-Resolution Remote Sensing Image Scene Classification by Merging Multilevel Features of Convolutional Neural Networks

Article 17 February 2021

Xiaoxia Zhang, Yong Guo & Xia Zhang

Remote Sensing Image Fusion Using Multi-Scale Convolutional Neural Network

Article 23 March 2021

Wei Shi, ChaoBen Du, … JiNing Yan

References

Chaib S, Gu Y, Yao H (2015) An informative feature selection method based on sparse PCA for VHR scene classification[J]. IEEE Geosci Remote Sens Lett 13(2):147–151. https://doi.org/10.1109/lgrs.2015.2501383
Article Google Scholar
Chan TH, Jia K, Gao S (2015) PCANet: a simple deep learning baseline for image classification?[J]. IEEE Trans Image Process 24(12):5017–5032. https://doi.org/10.1109/tip.2015.2475625
Article MathSciNet MATH Google Scholar
Chen Y, Fang H, Xu B (2019) Drop an octave: reducing spatial redundancy in convolutional neural networks with octave convolution[J]. arXiv preprint arXiv:1904.05049. arXiv:1904.05049v3
Cheng G, Guo L, Zhao T (2013) Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA[J]. Int J Remote Sens 34(1):45–59. https://doi.org/10.1080/01431161.2012.705443
Article Google Scholar
Cheng G, Han J, Zhou P (2014) Multi-class geospatial object detection and geographic image classification based on collection of part detectors[J]. ISPRS J Photogramm Remote Sens 98:119–132. https://doi.org/10.1016/j.isprsjprs.2014.10.002
Article Google Scholar
Cheng G, Zhou P, Han J (2015) Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images[J]. IET Comput Vis 9(5):639–647. https://doi.org/10.1049/iet-cvi.2014.0270
Article Google Scholar
Cheng G, Han J, Guo L (2015) Effective and efficient midlevel visual elements-oriented land-use classification using VHR remote sensing images[J]. IEEE Trans Geosci Remote Sens 53(8):4238–4249. https://doi.org/10.1109/tgrs.2015.2393857
Article Google Scholar
Cheng G, Ma C, Zhou P (2016) Scene classification of high resolution remote sensing images using convolutional neural networks[C]//2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, 767–770. doi:https://doi.org/10.1109/igarss.2016.7729193
Cheng G, Han J, Lu X (2017) Remote sensing image scene classification: benchmark and state of the art[J]. Proc IEEE 105(10):1865–1883. https://doi.org/10.1109/tgrs.2017.2709802
Article Google Scholar
Cheriyadat AM (2013) Unsupervised feature learning for aerial scene classification[J]. IEEE Trans Geosci Remote Sens 52(1):439–451. https://doi.org/10.1109/tgrs.2013.2241444
Article Google Scholar
Deng J, Dong W, Socher R (2009) Imagenet: a large-scale hierarchical image database[C]//2009 IEEE conference on computer vision and pattern recognition. IEEE:248–255. https://doi.org/10.1109/cvprw.2009.5206848
dos Santos JA, Penatti OAB, da Silva Torres R (2010) Evaluating the potential of texture and color descriptors for remote sensing image retrieval and classification[C]//VISAPP (2),203–208. doi:https://doi.org/10.5220/0002843402030208
Fang Z, Li W, Zou J (2016) Using CNN-based high-level features for remote sensing scene classification[C]//2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). IEEE, 2610–2613. doi:https://doi.org/10.1109/igarss.2016.7729674
He K, Zhang X, Ren S (2016) Deep residual learning for image recognition[C]//proceedings of the IEEE conference on computer vision and pattern recognition, 770–778. doi:https://doi.org/10.1109/cvpr.2016.90
He N, Fang L, Li S, Plaza J, Plaza A (2020) Skip-connected covariance network for remote sensing scene classification. IEEE Trans Neural Networks Learn Syst 31(5):1461–1474
Article Google Scholar
Jain AK, Ratha NK, Lakshmanan S (1997) Object detection using Gabor filters[J]. Pattern Recogn 30(2):295–309. https://doi.org/10.1016/s0031-3203(96)00068-4
Article Google Scholar
Jiang B., Li X., Sun T. (2018). A decision-level fusion method based on convolutional neural networks for remote sensing scene classification[C]//2018 IEEE 3rd advanced information technology, electronic and automation control conference (IAEAC). IEEE, 128-132. doi:https://doi.org/10.1109/iaeac.2018.8577603.
Kapoor R, Gupta R, Kumar R (2019) Fog removal in images using improved dark channel prior and contrast limited adaptive histogram equalization[J]. Multimed Tools Appl 78(16):23281–23307. https://doi.org/10.1109/cisp.2009.5301485
Article Google Scholar
Li P, Xie J, Wang Q (2017) Is second-order information helpful for large-scale visual recognition?[C]//proceedings of the IEEE international conference on computer vision, 2070–2078. doi:https://doi.org/10.1109/iccv.2017.228
Li P, Xie J, Wang Q (2018) Towards faster training of global covariance pooling networks by iterative matrix square root normalization[C]//proceedings of the IEEE conference on computer vision and pattern recognition, 947–955. doi:https://doi.org/10.1109/cvpr.2018.00105
Li X, Jiang B, Wang S (2019) A review and comparison of optical remote sensing scene classification[J]. Radio. Engineering 49(4):265–271. https://doi.org/10.3969/j.issn.1003-3106.2019.04.01
Article Google Scholar
Lin M, Chen Q, Yan S (2013) Network in network[J]. arXiv preprint arXiv:1312.4400. arXiv:1312.4400v3
Liu Q, Hang R, Song H (2017) Learning multiscale deep features for high-resolution satellite image scene classification[J]. IEEE Trans Geosci Remote Sens 56(1):117–126. https://doi.org/10.1109/tgrs.2017.2743243
Article Google Scholar
Meshgini S, Aghagolzadeh A, Seyedarabi H (2012) Face recognition using gabor filter bank, kernel principle component analysis and support vector machine[J]. Int J Comput Theory Eng 4(5):767–771. https://doi.org/10.7763/ijcte.2012.v4.574
Article Google Scholar
Nanjun H, Leyuan F, Shutao L (2018) Remote Sensing Scene Classification Using Multilayer Stacked Covariance Pooling[J]. IEEE Trans Geosci Remote Sens. 1–12. doi:https://doi.org/10.1109/tgrs.2018.2845668
Nogueira K, Penatti OAB, dos Santos JA (2017) Towards better exploiting convolutional neural networks for remote sensing scene classification[J]. Pattern Recognit 61:539–556. https://doi.org/10.1016/j.patcog.2016.07.001
Article Google Scholar
Ojala T, Pietikäinen M, Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[J]. IEEE Trans Pattern Analys Machine Intell 7:971–987. https://doi.org/10.1109/tpami.2002.1017623
Article MATH Google Scholar
Othman E, Bazi Y, Alajlan N (2016) Using convolutional features and a sparse autoencoder for land-use scene classification[J]. Int J Remote Sens 37(10):2149–2167. https://doi.org/10.1080/01431161.2016.1171928
Article Google Scholar
Penatti OAB, Nogueira K, Dos Santos JA (2015) Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?[C]//proceedings of the IEEE conference on computer vision and pattern recognition workshops, 4451. doi:https://doi.org/10.1109/cvprw.2015.7301382
Ren W, Ma L, Zhang J, et al. (2018) Gated fusion network for single image dehazing[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3253–3261
Ren W., Pan J., Zhang H., Cao X., Yang M.H. Single Image Dehazing via Multi-scale Convolutional Neural Networks with Holistic Edges[J]. Int J Comput Vis, 2020, 128(8), 128, 259.
Sheng G, Yang W, Xu T (2012) High-resolution satellite scene classification using a sparse coding based multiple feature combination[J]. Int J Remote Sens 33(8):2395–2412. https://doi.org/10.1080/01431161.2011.608740
Article Google Scholar
Swain MJ, Ballard DH (1991) Color indexing[J]. Int J Comput Vis 7(1):11–32. https://doi.org/10.1007/BF00130487
Article Google Scholar
Wang EK, Li Y, Nie Z, Yu J, Liang Z, Zhang X, Yiu SM (2019) Deep fusion feature based object detection method for high resolutionoptical remote sensing images. Appl Sci 9(6):1130
Article Google Scholar
Wang S, Guan Y, Shao L (2020) Multi-granularity canonical appearance pooling for remote sensing scene classification. IEEE Trans Image Process 29:5396–5407
Article MathSciNet Google Scholar
Wu H, Liu B, Su W (2016) Hierarchical coding vectors for scene level land-use classification[J]. Remote Sens 8(5):436. https://doi.org/10.3390/rs8050436
Article Google Scholar
Xu SH, Mu XD, Zhao P (2016) Scene classification of remote sensing image based on multi-scale feature and deep neural network[J]. Acta Geodaetica et Cartographica Sinica. China 45(7):834–840. https://doi.org/10.11947/j.AGCS.2016.20150623
Article Google Scholar
Yang Y, Newsam S (2010) Bag-of-visual-words and spatial extensions for land-use classification[C]//Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems. ACM, 270–279. doi:https://doi.org/10.1145/1869790.1869829
Yao W, Loffeld O, Datcu M (2016) Application and evaluation of a hierarchical patch clustering method for remote sensing images[J]. IEEE J Select Topics Appl Earth Observ Remote Sens 9(6):2279–2289. https://doi.org/10.1109/jstars.2016.2536143
Article Google Scholar
Zhang, Hua, et al (2016) Sketchnet: Sketch classification with web images." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Zhao B, Zhong Y, Zhang L (2016) A spectral–structural bag-of-features scene classifier for very high spatial resolution remote sensing imagery[J]. ISPRS J Photogramm Remote Sens 116:73–85. https://doi.org/10.1016/j.isprsjprs.2016.03.004
Article Google Scholar
Zheng X, Sun X, Fu K (2012) Automatic annotation of satellite images via multifeature joint sparse coding with spatial relation constraint[J]. IEEE Geosci Remote Sens Lett 10(4):652–656. https://doi.org/10.1109/lgrs.2012.2216499
Article Google Scholar
Zhong Y, Zhu Q, Zhang L (2015) Scene classification based on the multifeature fusion probabilistic topic model for high spatial resolution remote sensing imagery[J]. IEEE Trans Geosci Remote Sens 53(11):6207–6222. https://doi.org/10.1109/tgrs.2015.2435801
Article Google Scholar
Zhu Q, Zhong Y, Zhao B (2016) Bag-of-visual-words scene classifier with local and global features for high spatial resolution remote sensing imagery[J]. IEEE Geosci Remote Sens Lett 13(6):747–751. https://doi.org/10.1109/lgrs.2015.2513443
Article Google Scholar
Zhu Q, Hong Y, Zhang L (2017) Scene classification based on the fully sparse semantic topic model[J]. IEEE Trans Geosci Remote Sens 55(10):5525–5538. https://doi.org/10.1109/tgrs.2017.2709802
Article Google Scholar

Download references

Acknowledgements

Firstly, the author would like to thank the provider of UC Merced Land Use dataset and the support of experimental equipment provided by the Institute of Artificial Intelligence Application of Central South University of Forestry and Technology. Meanwhile, the author would also like to thank the editor and anonymous reviewers for their constructive suggestions, which significantly improved this paper.

I hereby express gratitude to my dear partner Yuan Gao, without his effort, this paper cannot be accomplished. In the process of compilation, he made great contribution on data collecting and analyze. Besides, he completed the Section 4 and 5 by himself.

This work was supported by the National 948 Project of China: Forest Fire Prediction and Fire Fighting Resource Dispatching Technology under Grant 2014-4-09, National Natural Science Foundation of China (Grant no. 61602528), the Hunan Provincial Natural Science Foundation of China (Grant no. 2017JJ3527), and Graduate Innovation Fund of Central South University of Forestry and Technology (20183033).

Author information

Authors and Affiliations

Institute of Artificial Intelligence Application, College of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha, 410004, China
Wenjie Chen, Yuan Gao, Aibin Chen, Guoxiong Zhou, Jianwu Wang, Xiaobo Yang & RunDong Jiang
Hunan Provincial Key Laboratory of Urban Forest Ecology, Central South University of Forestry and Technology, Changsha, 410004, China
Wenjie Chen, Yuan Gao, Aibin Chen, Guoxiong Zhou, Jianwu Wang, Xiaobo Yang & RunDong Jiang
Huangfeng Bridge State-Owned Forest Farm, Youxian County, Zhuzhou, 412000, China
Wenjie Chen, Yuan Gao, Aibin Chen, Guoxiong Zhou, Jianwu Wang, Xiaobo Yang & RunDong Jiang
Central South University of Forestry and Technology, Changsha, 410000, Hunan, China
Aibin Chen

Authors

Wenjie Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Gao
View author publications
You can also search for this author in PubMed Google Scholar
Aibin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guoxiong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jianwu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaobo Yang
View author publications
You can also search for this author in PubMed Google Scholar
RunDong Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aibin Chen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Yuan G is the Joint first author

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, W., Gao, Y., Chen, A. et al. Remote sensing scene classification with multi-spatial scale frequency covariance pooling. Multimed Tools Appl 81, 30413–30435 (2022). https://doi.org/10.1007/s11042-022-12603-x

Download citation

Received: 10 November 2020
Revised: 09 March 2021
Accepted: 09 February 2022
Published: 06 April 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s11042-022-12603-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Remote sensing scene classification with multi-spatial scale frequency covariance pooling

Abstract

Access this article

Similar content being viewed by others

Multi-deep features fusion for high-resolution remote sensing image scene classification

High-Resolution Remote Sensing Image Scene Classification by Merging Multilevel Features of Convolutional Neural Networks

Remote Sensing Image Fusion Using Multi-Scale Convolutional Neural Network

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Remote sensing scene classification with multi-spatial scale frequency covariance pooling

Abstract

Access this article

Similar content being viewed by others

Multi-deep features fusion for high-resolution remote sensing image scene classification

High-Resolution Remote Sensing Image Scene Classification by Merging Multilevel Features of Convolutional Neural Networks

Remote Sensing Image Fusion Using Multi-Scale Convolutional Neural Network

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation