Abstract
The aerial image recognition is an important problem in multimedia information retrieval in social media. In this paper, we propose a new approach by integrating aerial image’s local features into a discriminative one which reflects both the geometric property and the color distribution of aerial image. Firstly, each aerial image is segmented into several regions in terms of their color intensities. And region connected graph (RCG), the links between the spatial neighboring regions, is presented to encode the spatial context of aerial images. Secondly, we mine frequent structures in the RCGs corresponding to training aerial images collected from social media. And a set of refined structures are selected among the frequent ones towards being more discriminative and less redundant. Finally, given a new aerial image, its sub-RCGs corresponding to all the refined structures are extracted and quantized into a discriminative feature for aerial image recognition. The experimental results validate the proposed method by providing a more accurate recognition result of the aerial images on different datasets from different social medias.











Similar content being viewed by others
References
Blum, H.: Biological shape and visual science (part i). J. Theor. Biol. 38(2), 205–287 (1973)
Duchenne, O., Joulin, A., Ponce, J.: A graph-matching kernel for object categorization. In: proceedings of the 2011 IEEE international conference on computer vision (ICCV), pp. 1792–1799 (2011)
Gao, Y., Wang, M., Ji, R., Wu, X., Dai, Q.: 3d object retrieval with hausdorff distance learning. IEEE Trans. Ind. Electron. (2014)
Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3-d object retrieval and recognition with hypergraph analysis. IEEE Trans. Image Process. 21(9), 4290–4303 (2012)
Gao, Y., Wang, M., Zha, Z.J., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013)
Gonzalez, R., Woods, R.E., Steven L.E.: Digital image processing using MATLAB. Prentice Hall (2003)
Harchaoui, Z., Bach, F.: Image classification with segmentation graph kernels. In: proceeding of the IEEE conference on computer vision and pattern recognition pp. 1–8 (2007)
Ji, R., Duan, L.Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location discriminative vocabulary coding for mobile landmark search. Int. J. Comput. Vis. 96(3), 290–314 (2012)
Ji, R., Gao, Y., Zhong, B., Tian, Q.: Mining city landmarks by modeling reconstruction sparsity. ACM Trans. Multimed. Comput. Commun. Appl. (TOMCCAP) 7(1), 31–52 (2011)
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)
Kuramochi, M., Karypis, G.: An efficient algorithm for discovering frequent subgraphs. IEEE Trans. Knowl. Data Eng. 16(9), 1038–1051 (2004)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: proceeding of the 2006 IEEE computer society conference on computer vision and pattern recognition, pp. 2169–2178 (2006)
Lee, Y.J., Grauman, K.: Object-graphs for context-aware visual category discovery. IEEE Trans. Pattern Anal. Mach. Intell. 34(2), 346–358 (2012)
Leiserson, C.E., Rivest, R.L., Stein, C., Cormen, T.H.: Introduction to algorithms. The MIT press (2001)
Lin, L., Liu, X., Peng, S., Chao, H., Wang, Y., Jiang, B.: Object categorization with sketch representation and generalized samples. Pattern Recognit. 45(10), 3648–3660 (2012)
Lin, L., Wu, T., Porway, J., Xu, Z.: A stochastic graph grammar for compositional object representation and recognition. Pattern Recognit. 42(7), 1297–1307 (2009)
Maloof, M.A., Langley, P., Binford, T.O., Nevatia, R., Sage, S.: Improved rooftop detection in aerial images with machine learning. Mach. Learn. 53(1–2), 157–191 (2003)
Porway, J., Wang, K., Yao, B., Zhu, S.: A hierarchical and contextual model for aerial image understanding. In: proceeding of the IEEE conference on computer vision and pattern recognition, pp. 1–8 (2008)
Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM 23(1), 31–42 (1976)
Wang, Q., Jiang, Z., Yang, J., Zhao, D., Shi, Z.: A hierarchical connection graph algorithm for gable-roof detection in aerial image. IEEE Geosci. Remote Sens. Lett. 8(1), 177–181 (2011)
Yao, B., Yang, X., Zhu, S.: Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks. Energy minimization methods in computer vision and pattern recognition pp. 169–183 (2007)
Yuan, X., Zhu, H., Yang, S.: A robust framework for eigenspace image reconstruction. In: proceeding of the seventh IEEE workshops on application of computer vision, pp. 54–59 (2005)
Zhang, L., Bian, W., Song, M., Tao, D., Liu, X.: Integrating local features into discriminative graphlets for scene classification. In: proceedings of the 2011 18th international conference on neural information processing, pp. 657–666. Springer (2011)
Zhang, L., Han, Y., Yang, Y., Song, M., Yan, S., Tian, Q.: Discovering discriminative graphlets for aerial image categories recognition. IEEE Trans. Image Process. 22(12), 5071–5084 (2013)
Zhang, L., Song, M., Li, N., Bu, J., Chen, C.: Feature selection for fast speech emotion recognition. In: proceedings of the 17th ACM international conference on multimedia, pp. 753–756 (2009)
Zhang, L., Song, M., Sun, L., Liu, X., Wang, Y., Tao, D., Bu, J., Chen, C.: Spatial graphlet matching kernel for recognizing aerial image categories. In: proceedings of the 2012 21st international conference on pattern recognition (ICPR), pp. 2813–2816 (2012)
Acknowledgments
This paper draws on work supported in part by the following funds: National High Technology Research and Development Program of China (863 Program) under grant number 2011AA010101, National Natural Science Foundation of China under grant number 61002009 and 61304188, Key Science and Technology Program of Zhejiang Province of China under grant number 2012C01035-1, and Zhejiang Provincial Natural Science Foundation of China under grant number LZ13F020004 and LR14F020003, and College Students’ Activity Program in The Innovation of Science and Technology (Program of Xinmiao Talent) of Zhejiang Province under grant number ZX13005002047.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xia, Y., Chen, J., Li, J. et al. Geometric discriminative features for aerial image retrieval in social media. Multimedia Systems 22, 497–507 (2016). https://doi.org/10.1007/s00530-014-0412-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-014-0412-y