CNN-SIFT Consecutive Searching and Matching for Wine Label Retrieval

Li, Xiaoqing; Yang, Jiansheng; Ma, Jinwen

doi:10.1007/978-3-030-26763-6_24

Xiaoqing Li¹¹,
Jiansheng Yang¹¹ &
Jinwen Ma¹¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11643))

Included in the following conference series:

International Conference on Intelligent Computing

1538 Accesses
3 Citations

Abstract

Wine label retrieval is key to automatic wine brand search through the web or mobile phone in our daily life. In comparison with the general image retrieval tasks, it is a rather challenging problem with a huge number of unbalanced wine brand images. In this paper, we propose a CNN-SIFT Consecutive Searching and Matching (CSCSM) framework for wine label retrieval. In particular, a CNN is trained to recognize the main-brand (manufacturer) for narrowing the searching range, while the SIFT descriptor is improved by adopting the RANSAC and TF-IDF mechanisms to match the final sub-brand (item attribute under the manufacture). The experiments are conducted on a dataset containing approximately 548k images of wine labels with 17, 328 main-brands and 260, 579 sub-brands. It is demonstrated by the experimental results that our proposed CSCSM method can solve the wine label retrieval problem effectively and efficiently and outperform the competitive methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lim, J., Kim, S., Park, J.H., Lee, G.S., Yang, H.J., Lee, C.W.: Recognition of text in wine label images. In: Chinese Conference on Pattern Recognition, pp. 1–5 (2009)
Google Scholar
Wu, M.Y., Lee, J.H., Kuo, S.W.: A hierarchical feature search method for wine label image recognition. In: International Conference on Telecommunications and Signal Processing, pp. 568–572 (2015)
Google Scholar
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683
Chapter Google Scholar
Wu, H.C., Luk, R.W.P., Wong, K.F.: Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. 26(3), 13 (2008)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311 (2010)
Google Scholar
Peng, K., Chen, X., Zhou, D., Liu, Y.: 3D reconstruction based on SIFT and Harris feature points. In: 2009 IEEE International Conference on Robotics and Biomimetics, pp. 960–964 (2009)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)
Google Scholar
Zhou, W., Li, H., Hong, R., Lu, Y., Tian, Q.: BSIFT: toward data-independent codebook for large scale image search. IEEE Trans. Image Process. 24(3), 967–979 (2015)
Article MathSciNet Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014). http://arxiv.org/abs/1409.1556
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1492–1500 (2017)
Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Swets, D.L., Weng, J.J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996)
Article Google Scholar
Tieu, K., Viola, P.: Boosting image retrieval: special issue on content-based image retrieval. Int. J. Comput. Vision 56(1–2), 17–36 (2004)
Article Google Scholar
Wei, W., Jun, H., Yiping, T.: Image matching for geomorphic measurement based on SIFT and RANSAC methods. In: 2008 International Conference on Computer Science and Software Engineering, vol. 2, pp. 317–320 (2008)
Google Scholar
Azizpour, H., Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: Factors of transferability for a generic convnet representation. IEEE Trans. Pattern Anal. Mach. Intell. 38(9), 1790–1802 (2015)
Article Google Scholar

Download references

Acknowledgment

This work is supported by the Natural Science Foundation of China for Grand U1604153.

Author information

Authors and Affiliations

Department of Information Science, School of Mathematical Sciences and LMAM, Peking University, Beijing, 100871, China
Xiaoqing Li, Jiansheng Yang & Jinwen Ma

Authors

Xiaoqing Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiansheng Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jinwen Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinwen Ma .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
De-Shuang Huang
Polytechnic University of Bari, Bari, Italy
Vitoantonio Bevilacqua
University of Wollongong, North Wollongong, NSW, Australia
Prashan Premaratne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Yang, J., Ma, J. (2019). CNN-SIFT Consecutive Searching and Matching for Wine Label Retrieval. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2019. Lecture Notes in Computer Science(), vol 11643. Springer, Cham. https://doi.org/10.1007/978-3-030-26763-6_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-26763-6_24
Published: 24 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26762-9
Online ISBN: 978-3-030-26763-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics