Skip to main content

Bionic Vision Descriptor for Image Retrieval

  • Conference paper
  • First Online:
  • 2251 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12532))

Abstract

Human visual system gets remarkable performance by processing low-level features. In the last decade, many descriptors have been proposed for feature extraction. However, fewer of them get satisfying performance with low-level features. Compared to high-level ones, low-level features make use of natural underlying elements like texture and they are extracted directly, which makes low-level features more efficient in image retrieval domains. In this paper, a new descriptor named Bionic Vision Descriptor (BVD), which is based on the principle of human visual system, is proposed. The descriptor fuses uniform low-level features extracted from color, texture and gradient elements. Moreover, matrix calculation and feature selection are utilized to accelerate the calculation of BVD. Experimental results show that our method outperforms other state-of-the-art traditional descriptors with less runtime and fewer initial dimensions on benchmark datasets.

This study was funded by National Natural Science Foundation of Peoples Republic of China(61672130, 61972064), The Fundamental Research Funds for the Central Universities(DUT19RC(3)012, DUT20RC(5)010) and LiaoNing Revitalization Talents Program(XLYC1806006).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Babenko, A., Slesarev, A., Chigorin, A., Lempitsky, V.: Neural codes for image retrieval. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 584–599. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_38

    Chapter  Google Scholar 

  2. Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32

    Chapter  Google Scholar 

  3. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893. IEEE (2005)

    Google Scholar 

  4. Di Zenzo, S.: A note on the gradient of a multi-image. Comput. Vis. Graph. Image Process. 33(1), 116–125 (1986)

    Article  Google Scholar 

  5. Ferman, A.M., Tekalp, A.M., Mehrotra, R.: Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. Image Process. 11(5), 497–508 (2002)

    Article  Google Scholar 

  6. Ge, T., Ke, Q., Sun, J.: Sparse-coded features for image retrieval. In: BMVC, pp. 132.1–132.11 (2013)

    Google Scholar 

  7. Gordoa, A., Rodríguez-Serrano, J.A., Perronnin, F., Valveny, E.: Leveraging category-level labels for instance-level image retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3045–3052. IEEE (2012)

    Google Scholar 

  8. He, X., Cai, D., Niyogi, P.: Laplacian score for feature selection. In: Advances in Neural Information Processing Systems, pp. 507–514 (2006)

    Google Scholar 

  9. Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)

    Article  Google Scholar 

  10. Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: CVPR 2010–23rd IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311. IEEE Computer Society (2010)

    Google Scholar 

  11. Jolliffe, I.: Principal component analysis. Springer (2011). https://doi.org/10.1007/b98835

  12. Koffka, K.: Principles of Gestalt Psychology. Routledge, London (2013)

    Book  Google Scholar 

  13. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  14. Lewis, D.E., Pearson, J., Khuu, S.K.: The color “fruit”: object memories defined by color. PloS One 8(5), e64960 (2013)

    Article  Google Scholar 

  15. Liu, C., Yuen, J., Torralba, A.: Sift flow: Dense correspondence across scenes and its applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2010)

    Article  Google Scholar 

  16. Liu, G.H., Li, Z.Y., Zhang, L., Xu, Y.: Image retrieval based on micro-structure descriptor. Pattern Recogn. 44(9), 2123–2133 (2011)

    Article  Google Scholar 

  17. Liu, G.H., Yang, J.Y.: Content-based image retrieval using color difference histogram. Pattern Recogn. 46(1), 188–198 (2013)

    Article  Google Scholar 

  18. Liu, H., Zhao, Q., Zhang, C., Mbelwa, J.T., Tang, S., Zhang, J.: Boosting vlad with weighted fusion of local descriptors for image retrieval. Multimedia Tools Appl. 78(9), 11835–11855 (2019)

    Article  Google Scholar 

  19. Liu, S., et al.: Color recognition for rubik’s cube robot. arXiv preprint arXiv:1901.03470 (2019)

  20. Liu, S., et al.: Perceptual uniform descriptor and ranking on manifold for image retrieval. Inf. Sci. 424, 235–249 (2018)

    Article  MathSciNet  Google Scholar 

  21. Liu, Y., Zhang, D., Lu, G., Ma, W.Y.: A survey of content-based image retrieval with high-level semantics. Pattern Recogn. 40(1), 262–282 (2007)

    Article  Google Scholar 

  22. Liu, Z., Li, H., Zhou, W., Rui, T., Tian, Q.: Making residual vector distribution uniform for distinctive image representation. IEEE Trans. Circuits Syst. Video Technol. 26(2), 375–384 (2015)

    Article  Google Scholar 

  23. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  24. Maaten, L.V.D., Hinton, G.: Visualizing data using t-sne. J. Mach. Learn. Res. 9, 2579–2605 (2008)

    MATH  Google Scholar 

  25. Ojala, T., Pietikäinen, M., Mäenpää, T.: Gray scale and rotation invariant texture classification with local binary patterns. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 404–420. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-45054-8_27

    Chapter  Google Scholar 

  26. Rocha, A., Goldenstein, S.K.: Multiclass from binary: expanding one-versus-all, one-versus-one and ecoc-based approaches. IEEE Trans. Neural Networks Learn. Syst. 25(2), 289–302 (2013)

    Article  Google Scholar 

  27. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  28. Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision, pp. 1470–1477. IEEE (2003)

    Google Scholar 

  29. Ullman, S., Assif, L., Fetaya, E., Harari, D.: Atoms of recognition in human and computer vision. Proc. Nat. Acad. Sci. 113(10), 2744–2749 (2016)

    Article  Google Scholar 

  30. Wengert, C., Douze, M., Jégou, H.: Bag-of-colors for improved image search. In: Proceedings of the 19th ACM international conference on Multimedia, pp. 1437–1440. ACM (2011)

    Google Scholar 

  31. Zheng, L., Wang, S., Liu, Z., Tian, Q.: Packing and padding: Coupled multi-index for accurate image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1939–1946 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shenglan Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, G., Liu, S., Wang, F., Feng, L. (2020). Bionic Vision Descriptor for Image Retrieval. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12532. Springer, Cham. https://doi.org/10.1007/978-3-030-63830-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-63830-6_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-63829-0

  • Online ISBN: 978-3-030-63830-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics