Skip to main content
Log in

Efficient logo recognition by local feature groups

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

This paper presents a method for efficient and scalable logo recognition. Using generalized Hough transform to identify local features that are invariant across images, we can efficiently add spatial information into groups of local features and enhance the discriminative power of local feature. Our method is more flexible and efficient compared with state-of-the-art methods that merge features into groups. To fully exploit the information that different logo images provide, we employ a reference-based image representation scheme to represent training and testing images. Experiments on challenging datasets show that our method is efficient and scalable and achieves state-of-the-art performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Romberg, S., Pueyo, L. G., Lienhart, R., Van Zwol, R.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 25 (2011)

  2. Torralba, A., Murphy, K. P., Freeman, W. T., Rubin, M.: Context-based vision system for place and object recognition. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 273–280 (2003)

  3. Dalal, N., & Triggs, B.: Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on IEEE, vol. 1, pp. 886–893 (2005)

  4. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  5. Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (SURF). Comput. Vis. Image Underst. 110(3), 346–359 (2008)

    Article  Google Scholar 

  6. Rublee, E., Rabaud, V., Konolige, K., & Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Computer Vision (ICCV), 2011 IEEE International Conference on IEEE, pp. 2564–2571, (2011)

  7. Sivic, J., & Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on IEEE, pp. 1470–1477 (2003)

  8. Tian, Q., Zhang, S., Zhou, W., Ji, R., Ni, B., Sebe, N.: Building descriptive and discriminative visual codebook for large-scale image applications. Multimed. Tools Appl 51(2), 441–477 (2011)

    Article  Google Scholar 

  9. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Computer vision and pattern recognition, 2007. CVPR’07. IEEE Conference on, IEEE, pp. 1–8 (2007)

  10. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on IEEE, pp. 1–8 (2007)

  11. Jiang, Y., Meng, J., Yuan, J.: Grid-based local feature bundling for efficient object search and localization. In Image Processing (ICIP), 2011 18th IEEE International Conference on IEEE, pp. 113–116 (2011)

  12. VC, H. P.:. U.S. Patent No. 3,069,654. Washington, DC: U.S. Patent and Trademark Office (1962)

  13. Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey vision conference, vol. 15, p. 50 (1988)

  14. Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. Int. J. Comput. Vision 60(1), 63–86 (2004)

    Article  Google Scholar 

  15. Morel, J.M., Yu, G.: ASIFT: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  16. Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vision 65(1–2), 43–72 (2005)

    Article  Google Scholar 

  17. Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. Pattern Anal. Mach. Intell. IEEE Trans. 27(10), 1615–1630 (2005)

    Article  Google Scholar 

  18. Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 25–32 (2009)

  19. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)

    Article  Google Scholar 

  20. Fu, J., Wang, J., Zhang, Y., Lu, H.: Point-context descriptor based region search for logo recognition. In: Proceedings of the 4th International Conference on Internet Multimedia Computing and Service ACM, pp. 188–191 (2012)

  21. Kalantidis, Y., Pueyo, L. G., Trevisiol, M., van Zwol, R., Avrithis, Y.: Scalable triangulation-based logo recognition. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval ACM, p. 20 (2011)

  22. Romberg, S., & Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval ACM, pp. 113–120 (2013)

  23. Zhang, Y., Jia, Z., Chen, T.: Image retrieval with geometry-preserving visual phrases. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on IEEE, pp. 809–816 (2011)

  24. Zheng, L., Wang, S.: Visual phraselet: refining spatial constraints for large scale image search. Signal Process. Lett. IEEE 20(4), 391–394 (2013)

    Article  MathSciNet  Google Scholar 

  25. Liu, D., Hua, G., Viola, P., Chen, T.: Integrated feature selection and higher-order spatial feature extraction for object categorization. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE pp. 1–8 (2008)

  26. Zhang, S., Huang, Q., Hua, G., Jiang, S., Gao, W., Tian, Q.: Building contextual visual vocabulary for large-scale image applications. In: Proceedings of the international conference on Multimedia ACM, pp. 501–510 (2010)

  27. Kleban, J., Xie, X., Ma, W. Y.: Spatial pyramid mining for logo detection in natural scenes. In: Multimedia and Expo, 2008 IEEE International Conference on IEEE, pp. 1077–1080, (2008)

  28. Zhang, S., Tian, Q., Hua, G., Huang, Q., Li, S.: Descriptive visual words and visual phrases for image applications. In: Proceedings of the 17th ACM international conference on Multimedia ACM, pp. 75–84 (2009)

  29. Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp. 1794–1801 (2009)

  30. Arandjelović, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE, pp. 2911–2918 (2012)

  31. Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM international conference on Multimedia ACM, pp. 965–968 (2012)

  32. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)

    Google Scholar 

Download references

Acknowledgments

This work is partly supported by National Natural Science Foundation of China (Grant no. 61379106), the Shandong Provincial Natural Science Foundation (Grant nos. ZR2013FM036, ZR2015FM011), the Open Project Program of the State Key Lab of CAD&CG (Grant no. A1315), Zhejiang University, the Fundamental Research Funds for the Central Universities (Grant nos.14CX02032A, 14CX02031A).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yujie Liu.

Additional information

Communicated by Q. Tian.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Wang, J., Li, Z. et al. Efficient logo recognition by local feature groups. Multimedia Systems 23, 395–403 (2017). https://doi.org/10.1007/s00530-016-0508-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-016-0508-7

Keywords

Navigation