Skip to main content

CNN-SIFT Consecutive Searching and Matching for Wine Label Retrieval

  • Conference paper
  • First Online:
Book cover Intelligent Computing Theories and Application (ICIC 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11643))

Included in the following conference series:

Abstract

Wine label retrieval is key to automatic wine brand search through the web or mobile phone in our daily life. In comparison with the general image retrieval tasks, it is a rather challenging problem with a huge number of unbalanced wine brand images. In this paper, we propose a CNN-SIFT Consecutive Searching and Matching (CSCSM) framework for wine label retrieval. In particular, a CNN is trained to recognize the main-brand (manufacturer) for narrowing the searching range, while the SIFT descriptor is improved by adopting the RANSAC and TF-IDF mechanisms to match the final sub-brand (item attribute under the manufacture). The experiments are conducted on a dataset containing approximately 548k images of wine labels with 17, 328 main-brands and 260, 579 sub-brands. It is demonstrated by the experimental results that our proposed CSCSM method can solve the wine label retrieval problem effectively and efficiently and outperform the competitive methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lim, J., Kim, S., Park, J.H., Lee, G.S., Yang, H.J., Lee, C.W.: Recognition of text in wine label images. In: Chinese Conference on Pattern Recognition, pp. 1–5 (2009)

    Google Scholar 

  2. Wu, M.Y., Lee, J.H., Kuo, S.W.: A hierarchical feature search method for wine label image recognition. In: International Conference on Telecommunications and Signal Processing, pp. 568–572 (2015)

    Google Scholar 

  3. Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683

    Chapter  Google Scholar 

  4. Wu, H.C., Luk, R.W.P., Wong, K.F.: Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. 26(3), 13 (2008)

    Article  Google Scholar 

  5. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  6. Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311 (2010)

    Google Scholar 

  7. Peng, K., Chen, X., Zhou, D., Liu, Y.: 3D reconstruction based on SIFT and Harris feature points. In: 2009 IEEE International Conference on Robotics and Biomimetics, pp. 960–964 (2009)

    Google Scholar 

  8. Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)

    Google Scholar 

  9. Zhou, W., Li, H., Hong, R., Lu, Y., Tian, Q.: BSIFT: toward data-independent codebook for large scale image search. IEEE Trans. Image Process. 24(3), 967–979 (2015)

    Article  MathSciNet  Google Scholar 

  10. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  11. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014). http://arxiv.org/abs/1409.1556

  12. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  13. Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1492–1500 (2017)

    Google Scholar 

  14. Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)

    Google Scholar 

  15. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)

    Google Scholar 

  16. Swets, D.L., Weng, J.J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996)

    Article  Google Scholar 

  17. Tieu, K., Viola, P.: Boosting image retrieval: special issue on content-based image retrieval. Int. J. Comput. Vision 56(1–2), 17–36 (2004)

    Article  Google Scholar 

  18. Wei, W., Jun, H., Yiping, T.: Image matching for geomorphic measurement based on SIFT and RANSAC methods. In: 2008 International Conference on Computer Science and Software Engineering, vol. 2, pp. 317–320 (2008)

    Google Scholar 

  19. Azizpour, H., Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: Factors of transferability for a generic convnet representation. IEEE Trans. Pattern Anal. Mach. Intell. 38(9), 1790–1802 (2015)

    Article  Google Scholar 

Download references

Acknowledgment

This work is supported by the Natural Science Foundation of China for Grand U1604153.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jinwen Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, X., Yang, J., Ma, J. (2019). CNN-SIFT Consecutive Searching and Matching for Wine Label Retrieval. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2019. Lecture Notes in Computer Science(), vol 11643. Springer, Cham. https://doi.org/10.1007/978-3-030-26763-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26763-6_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26762-9

  • Online ISBN: 978-3-030-26763-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics