ABSTRACT
Scene classification is one of critical tasks in the interpretation of high-resolution remote sensing (HRRS) imagery. Most of the existing methods focus on learning efficient feature representations. Recently, deep convolutional neural network (CNN) shows its effectiveness in extracting feature of images, which is widely used in remote sensing. In this paper, we propose a novel feature representation method which called deep differential coding (DDC) for remote sensing scene classification. Firstly, we get convolutional feature maps from image using the pre-trained CNN model. Then, the dense features are encoded through our proposed differential coding (DC). Finally, features are obtained by max-pooling and sum-pooling. We extract features from the whole remote sensing image and object region of every image by our proposed DDC and the final global feature is the combination of two kinds of features. We have performed extensive experiments on two public datasets and the results demonstrate that our proposed method has an excellent performance compared with state-of-the-art methods.
- Zhao, L.-J., Tang, P., and Huo, L.-Z., 2014. Land-use scene classification using a concentric circle-structured multiscale bag-of-visual-words model. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 7, 12, 4620--4631.Google ScholarCross Ref
- Yang, Y. and Newsam, S., 2010. Bag-of-visual-words and spatial extensions for land-use classification. In Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems ACM, 270--279. Google ScholarDigital Library
- Chen, S. and Tian, Y., 2015. Pyramid of spatial relations for scene-level land use classification. IEEE Transactions on Geoscience and Remote Sensing 53, 4, 1947--1957.Google ScholarCross Ref
- Lazebnik, S., Schmid, C., and Ponce, J., 2006. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Computer vision and pattern recognition, 2006 IEEE computer society conference on IEEE, 2169--2178. Google ScholarDigital Library
- Lowe, D.G., 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2, 91--110. Google ScholarDigital Library
- Hu, F., Xia, G.-S., Wang, Z., Huang, X., Zhang, L., and SUN, H., 2015. Unsupervised feature learning via spectral clustering of multidimensional patches for remotely sensed scene classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 8, 5.Google ScholarCross Ref
- Krizhevsky, A., Sutskever, I., and Hinton, G.E., 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, 1097--1105. Google ScholarDigital Library
- Simonyan, K. and Zisserman, A., 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.Google Scholar
- Zhang, F., Du, B., and Zhang, L., 2015. Saliency-guided unsupervised feature learning for scene classification. IEEE Transactions on Geoscience and Remote Sensing 53, 4, 2175--2184.Google ScholarCross Ref
- Zou, Q., Ni, L., Zhang, T., and Wang, Q., 2015. Deep learning based feature selection for remote sensing scene classification. IEEE Geoscience and Remote Sensing Letters 12, 11, 2321--2325.Google ScholarCross Ref
- Zhang, F., Du, B., and Zhang, L., 2016. Scene classification via a gradient boosting random convolutional network framework. IEEE Transactions on Geoscience and Remote Sensing 54, 3, 1793--1802.Google ScholarCross Ref
- Ioffe, S. and Szegedy, C., 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning, 448--456. Google ScholarDigital Library
- Long, J., Shelhamer, E., and Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3431--3440.Google Scholar
- Zhao, F., Huang, Y., Wang, L., and Tan, T., 2015. Deep semantic ranking based hashing for multi-label image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1556--1564.Google Scholar
- Hu, F., Xia, G.-S., Hu, J., and Zhang, L., 2015. Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery. Remote Sensing 7, 11, 14680--14707.Google ScholarCross Ref
- Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-fei, L., 2009. Image-net: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, 248--255.Google Scholar
- Marmanis, D., Datcu, M., Esch, T., and Stilla, U., 2016. Deep learning earth observation classification using ImageNet pre-trained networks. IEEE Geoscience and Remote Sensing Letters 13, 1, 105--109.Google ScholarCross Ref
- Zhou, W., Shao, Z., and Cheng, Q., 2016. Deep feature representations for high-resolution remote sensing scene classification. In Earth Observation and Remote Sensing Applications (EORSA), 2016 4th International Workshop on IEEE, 338--342.Google Scholar
- Cheng, G., Ma, C., Zhou, P., Yao, X., and Han, J., 2016. Scene classification of high resolution remote sensing images using convolutional neural networks. In Geoscience and Remote Sensing Symposium (IGARSS), 2016 IEEE International IEEE, 767--770.Google Scholar
- Hu, F., Xia, G.-S., and Zhang, L., 2016. Deep sparse representations for land-use scene classification in remote sensing images. In Signal Processing (ICSP), 2016 IEEE 13th International Conference on IEEE, 192--197.Google Scholar
- Girshick, R., Donahue, J., Darrell, T., and Malik, J., 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 580--587. Google ScholarDigital Library
- He, K., Zhang, X., Ren, S., and Sun, J., 2014. Spatial pyramid pooling in deep convolutional networks for visual recognition. In European Conference on Computer Vision Springer, 346--361.Google Scholar
- Lecun, Y., Bengio, Y., and Hinton, G., 2015. Deep learning. Nature 521, 7553, 436--444.Google Scholar
- Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T., 2014. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia ACM, 675--678. Google ScholarDigital Library
- Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., and Lecun, Y., 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229.Google Scholar
- Zeiler, M.D. and Fergus, R., 2014. Visualizing and understanding convolutional networks. In European conference on computer vision Springer, 818--833.Google Scholar
- Xia, G.-S., Yang, W., Delon, J., Gousseau, Y., Sun, H., and Maître, H., 2010. Structural high-resolution satellite image indexing. In ISPRS TC VII Symposium-100 Years ISPRS, 298--303.Google Scholar
- Vedaldi, A. and Lenc, K., 2015. Matconvnet: Convolutional neural networks for MATLAB. In Proceedings of the 23rd ACM international conference on Multimedia ACM, 689--692. Google ScholarDigital Library
- Zhong, Y., Zhu, Q., and Zhang, L., 2015. Scene classification based on the multifeature fusion probabilistic topic model for high spatial resolution remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing 53, 11, 6207--6222.Google ScholarCross Ref
- Wu, H., Liu, B., Su, W., Zhang, W., and Sun, J., 2016. Deep filter banks for land-use scene classification. IEEE Geoscience and Remote Sensing Letters 13, 12, 1895--1899.Google ScholarCross Ref
- Zhu, Q., Zhong, Y., Zhao, B., Xia, G.-S., and Zhang, L., 2016. Bag-of-visual-words scene classifier with local and global features for high spatial resolution remote sensing imagery. IEEE Geoscience and Remote Sensing Letters 13, 6, 747--751.Google ScholarCross Ref
- Yuan, L.-X., Li, S.-J., and Jiang, Y.-P., 2017. Remote Sensing Scene Classification Using a Preclassification Strategy and an Improved Structural Feature. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 10, 9, 4094--4103.Google ScholarCross Ref
- Penatti, O.A., Nogueira, K., and Dossantos, J.A., 2015. Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 44--51.Google Scholar
- Chaib, S., Liu, H., Gu, Y., and Yao, H., 2017. Deep Feature Fusion for VHR Remote Sensing Scene Classification. IEEE Transactions on Geoscience and Remote Sensing 55, 8, 4775--4784.Google ScholarCross Ref
- Shao, W., Yang, W., Xia, G.-S., and LIU, G., 2013. A hierarchical scheme of multiple feature fusion for high-resolution satellite scene categorization. In International Conference on Computer Vision Systems Springer, 324--333. Google ScholarDigital Library
- Huang, L., Chen, C., Li, W., and Du, Q., 2016. Remote sensing image scene classification using multi-scale completed local binary patterns and fisher vectors. Remote Sensing 8, 6, 483.Google ScholarCross Ref
- Xia, G.-S., Hu, J., Hu, F., Shi, B., Bai, X., Zhong, Y., Zhang, L., and Lu, X., 2017. AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification. IEEE Transactions on Geoscience and Remote Sensing.Google ScholarCross Ref
- Bian, X., Chen, C., Tian, L., and Du, Q., 2017. Fusing Local and Global Features for High-Resolution Scene Classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 10, 6, 2889--2901.Google ScholarCross Ref
Index Terms
- Deep Differential Coding for High-Resolution Remote Sensing Scene Classification
Recommendations
High-Resolution Remote Sensing Scene Classification Using Improved LBP and SDSAE
ICACS '18: Proceedings of the 2nd International Conference on Algorithms, Computing and SystemsClassifying high-resolution remote sensing scene images with high accuracy is the challenging issues. The key of scene classification is to find effective features of the scene image. Low-level visual feature methods, such as local binary pattern (LBP), ...
Exploiting representations from pre-trained convolutional neural networks for high-resolution remote sensing image retrieval
With the increasing amount of high-resolution remote sensing images, it becomes more and more urgent to retrieve remote sensing images from large archives efficiently. The existing methods are mainly based on shallow features to retrieve images, while ...
Analysis of the inter-dataset representation ability of deep features for high spatial resolution remote sensing image scene classification
Recently, scene based classification has become a new trend for very high spatial resolution remote sensing image interpretation. With the advent of deep learning, the pretrained convolutional neural networks (CNNs) have been proved effective as feature ...
Comments