Abstract
Wine label retrieval is key to automatic wine brand search through the web or mobile phone in our daily life. In comparison with the general image retrieval tasks, it is a rather challenging problem with a huge number of unbalanced wine brand images. In this paper, we propose a CNN-SIFT Consecutive Searching and Matching (CSCSM) framework for wine label retrieval. In particular, a CNN is trained to recognize the main-brand (manufacturer) for narrowing the searching range, while the SIFT descriptor is improved by adopting the RANSAC and TF-IDF mechanisms to match the final sub-brand (item attribute under the manufacture). The experiments are conducted on a dataset containing approximately 548k images of wine labels with 17, 328 main-brands and 260, 579 sub-brands. It is demonstrated by the experimental results that our proposed CSCSM method can solve the wine label retrieval problem effectively and efficiently and outperform the competitive methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lim, J., Kim, S., Park, J.H., Lee, G.S., Yang, H.J., Lee, C.W.: Recognition of text in wine label images. In: Chinese Conference on Pattern Recognition, pp. 1–5 (2009)
Wu, M.Y., Lee, J.H., Kuo, S.W.: A hierarchical feature search method for wine label image recognition. In: International Conference on Telecommunications and Signal Processing, pp. 568–572 (2015)
Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0026683
Wu, H.C., Luk, R.W.P., Wong, K.F.: Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. 26(3), 13 (2008)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311 (2010)
Peng, K., Chen, X., Zhou, D., Liu, Y.: 3D reconstruction based on SIFT and Harris feature points. In: 2009 IEEE International Conference on Robotics and Biomimetics, pp. 960–964 (2009)
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)
Zhou, W., Li, H., Hong, R., Lu, Y., Tian, Q.: BSIFT: toward data-independent codebook for large scale image search. IEEE Trans. Image Process. 24(3), 967–979 (2015)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014). http://arxiv.org/abs/1409.1556
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Xie, S., Girshick, R., Dollar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1492–1500 (2017)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Swets, D.L., Weng, J.J.: Using discriminant eigenfeatures for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 831–836 (1996)
Tieu, K., Viola, P.: Boosting image retrieval: special issue on content-based image retrieval. Int. J. Comput. Vision 56(1–2), 17–36 (2004)
Wei, W., Jun, H., Yiping, T.: Image matching for geomorphic measurement based on SIFT and RANSAC methods. In: 2008 International Conference on Computer Science and Software Engineering, vol. 2, pp. 317–320 (2008)
Azizpour, H., Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: Factors of transferability for a generic convnet representation. IEEE Trans. Pattern Anal. Mach. Intell. 38(9), 1790–1802 (2015)
Acknowledgment
This work is supported by the Natural Science Foundation of China for Grand U1604153.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, X., Yang, J., Ma, J. (2019). CNN-SIFT Consecutive Searching and Matching for Wine Label Retrieval. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2019. Lecture Notes in Computer Science(), vol 11643. Springer, Cham. https://doi.org/10.1007/978-3-030-26763-6_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-26763-6_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26762-9
Online ISBN: 978-3-030-26763-6
eBook Packages: Computer ScienceComputer Science (R0)