Skip to main content
Log in

Geometric discriminative features for aerial image retrieval in social media

  • Special Issue Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

The aerial image recognition is an important problem in multimedia information retrieval in social media. In this paper, we propose a new approach by integrating aerial image’s local features into a discriminative one which reflects both the geometric property and the color distribution of aerial image. Firstly, each aerial image is segmented into several regions in terms of their color intensities. And region connected graph (RCG), the links between the spatial neighboring regions, is presented to encode the spatial context of aerial images. Secondly, we mine frequent structures in the RCGs corresponding to training aerial images collected from social media. And a set of refined structures are selected among the frequent ones towards being more discriminative and less redundant. Finally, given a new aerial image, its sub-RCGs corresponding to all the refined structures are extracted and quantized into a discriminative feature for aerial image recognition. The experimental results validate the proposed method by providing a more accurate recognition result of the aerial images on different datasets from different social medias.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Blum, H.: Biological shape and visual science (part i). J. Theor. Biol. 38(2), 205–287 (1973)

    Article  Google Scholar 

  2. Duchenne, O., Joulin, A., Ponce, J.: A graph-matching kernel for object categorization. In: proceedings of the 2011 IEEE international conference on computer vision (ICCV), pp. 1792–1799 (2011)

  3. Gao, Y., Wang, M., Ji, R., Wu, X., Dai, Q.: 3d object retrieval with hausdorff distance learning. IEEE Trans. Ind. Electron. (2014)

  4. Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3-d object retrieval and recognition with hypergraph analysis. IEEE Trans. Image Process. 21(9), 4290–4303 (2012)

    Article  MathSciNet  Google Scholar 

  5. Gao, Y., Wang, M., Zha, Z.J., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013)

    Article  MathSciNet  Google Scholar 

  6. Gonzalez, R., Woods, R.E., Steven L.E.: Digital image processing using MATLAB. Prentice Hall (2003)

  7. Harchaoui, Z., Bach, F.: Image classification with segmentation graph kernels. In: proceeding of the IEEE conference on computer vision and pattern recognition pp. 1–8 (2007)

  8. Ji, R., Duan, L.Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location discriminative vocabulary coding for mobile landmark search. Int. J. Comput. Vis. 96(3), 290–314 (2012)

    Article  MATH  Google Scholar 

  9. Ji, R., Gao, Y., Zhong, B., Tian, Q.: Mining city landmarks by modeling reconstruction sparsity. ACM Trans. Multimed. Comput. Commun. Appl. (TOMCCAP) 7(1), 31–52 (2011)

    Google Scholar 

  10. Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3d scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)

    Article  Google Scholar 

  11. Kuramochi, M., Karypis, G.: An efficient algorithm for discovering frequent subgraphs. IEEE Trans. Knowl. Data Eng. 16(9), 1038–1051 (2004)

    Article  Google Scholar 

  12. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: proceeding of the 2006 IEEE computer society conference on computer vision and pattern recognition, pp. 2169–2178 (2006)

  13. Lee, Y.J., Grauman, K.: Object-graphs for context-aware visual category discovery. IEEE Trans. Pattern Anal. Mach. Intell. 34(2), 346–358 (2012)

    Article  Google Scholar 

  14. Leiserson, C.E., Rivest, R.L., Stein, C., Cormen, T.H.: Introduction to algorithms. The MIT press (2001)

  15. Lin, L., Liu, X., Peng, S., Chao, H., Wang, Y., Jiang, B.: Object categorization with sketch representation and generalized samples. Pattern Recognit. 45(10), 3648–3660 (2012)

    Article  Google Scholar 

  16. Lin, L., Wu, T., Porway, J., Xu, Z.: A stochastic graph grammar for compositional object representation and recognition. Pattern Recognit. 42(7), 1297–1307 (2009)

    Article  MATH  Google Scholar 

  17. Maloof, M.A., Langley, P., Binford, T.O., Nevatia, R., Sage, S.: Improved rooftop detection in aerial images with machine learning. Mach. Learn. 53(1–2), 157–191 (2003)

    Article  Google Scholar 

  18. Porway, J., Wang, K., Yao, B., Zhu, S.: A hierarchical and contextual model for aerial image understanding. In: proceeding of the IEEE conference on computer vision and pattern recognition, pp. 1–8 (2008)

  19. Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM 23(1), 31–42 (1976)

    Article  MathSciNet  Google Scholar 

  20. Wang, Q., Jiang, Z., Yang, J., Zhao, D., Shi, Z.: A hierarchical connection graph algorithm for gable-roof detection in aerial image. IEEE Geosci. Remote Sens. Lett. 8(1), 177–181 (2011)

    Article  Google Scholar 

  21. Yao, B., Yang, X., Zhu, S.: Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks. Energy minimization methods in computer vision and pattern recognition pp. 169–183 (2007)

  22. Yuan, X., Zhu, H., Yang, S.: A robust framework for eigenspace image reconstruction. In: proceeding of the seventh IEEE workshops on application of computer vision, pp. 54–59 (2005)

  23. Zhang, L., Bian, W., Song, M., Tao, D., Liu, X.: Integrating local features into discriminative graphlets for scene classification. In: proceedings of the 2011 18th international conference on neural information processing, pp. 657–666. Springer (2011)

  24. Zhang, L., Han, Y., Yang, Y., Song, M., Yan, S., Tian, Q.: Discovering discriminative graphlets for aerial image categories recognition. IEEE Trans. Image Process. 22(12), 5071–5084 (2013)

    Article  MathSciNet  Google Scholar 

  25. Zhang, L., Song, M., Li, N., Bu, J., Chen, C.: Feature selection for fast speech emotion recognition. In: proceedings of the 17th ACM international conference on multimedia, pp. 753–756 (2009)

  26. Zhang, L., Song, M., Sun, L., Liu, X., Wang, Y., Tao, D., Bu, J., Chen, C.: Spatial graphlet matching kernel for recognizing aerial image categories. In: proceedings of the 2012 21st international conference on pattern recognition (ICPR), pp. 2813–2816 (2012)

Download references

Acknowledgments

This paper draws on work supported in part by the following funds: National High Technology Research and Development Program of China (863 Program) under grant number 2011AA010101, National Natural Science Foundation of China under grant number 61002009 and 61304188, Key Science and Technology Program of Zhejiang Province of China under grant number 2012C01035-1, and Zhejiang Provincial Natural Science Foundation of China under grant number LZ13F020004 and LR14F020003, and College Students’ Activity Program in The Innovation of Science and Technology (Program of Xinmiao Talent) of Zhejiang Province under grant number ZX13005002047.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yingjie Xia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xia, Y., Chen, J., Li, J. et al. Geometric discriminative features for aerial image retrieval in social media. Multimedia Systems 22, 497–507 (2016). https://doi.org/10.1007/s00530-014-0412-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-014-0412-y

Keywords