Skip to main content

A Text Recognition Augmented Deep Learning Approach for Logo Identification

  • Conference paper
  • First Online:
Computer Vision, Graphics, and Image Processing (ICVGIP 2016)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10481))

Abstract

Logo/brand name detection and recognition in unstructured and highly unpredictable natural images has always been a challenging problem. We notice that in most natural images logos are accompanied with associated text. Therefore, we address the problem of logo recognition by first detecting and isolating text of varying color, font size and orientation in the input image using affine invariant maximally stable extremal regions (MSERs). Using an off-the-shelf OCR, we identify the text associated with the logo image. Then an effective grouping technique is employed to combine the remaining stable regions based on spatial proximity of MSERs. Deep learning has the advantage that optimal features can be learned automatically from image pixel data. This motivates us to feed the clustered logo candidate image regions to a pre-trained deep convolutional neural network (DCNN) to generate a set of complex features which are further input to a multiclass support vector machine (SVM) for classification. We tested our proposed logo recognition system on 32 logo classes, and a non-logo class obtained by combining FlickrLogos-32 and MICC logo databases, amounting to a total of 23582 training and testing images. Our method yields robust recognition performance, outperforming state-of-the-art techniques achieving 97.8% precision, 95.7% recall and 95.7% average accuracy on the combined MICC and FlickrLogos-32 datasets and a precision of 98.6%, recall of 97.9% and average accuracy of 99.6% on only the FlickrLogos-32 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alaei, A., Delalandre, M., Girard, N.: Logo detection using painting based representation and probability features. In: ICDAR, pp. 1235–1239 (2013)

    Google Scholar 

  2. Boia, R., Florea, C., Florea, L., Dogaru, R.: Logo localization and recognition in natural images using homographic class graphs. Mach. Vis. Appl. 27(2), 287–301 (2016)

    Article  Google Scholar 

  3. Romberg, S., Pueyo, L.G., Lienhart, R., Zwol, R.V.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 965–968 (2011)

    Google Scholar 

  4. Chen, W., Lan, S., Xu, P.: Multiple feature fusion via hierarchical matching for TV logo recognition. In: Proceedings of the 8th International Congress on Image and Signal Processing, IEEE (2015)

    Google Scholar 

  5. Sahbi, H., Ballan, L., Serra, G., Bimbo, A.: Context-dependent logo matching and recognition. IEEE Trans. Image Process. 22(3), 1018–1031 (2013). IEEE

    Article  MATH  MathSciNet  Google Scholar 

  6. Zhang, Y., Zhang, S., Liang, W., Guo, Q.: Individualized matching based on logo density for scalable logo recognition. In: ICASSP, pp. 4324–4328 (2014)

    Google Scholar 

  7. Hassanzadeh, S., Pourghassem, H.: Fast logo detection based on morphological features in document images. In: Proceedings of the 7th International Colloquium on Signal Processing and its Applications, pp. 283–286 (2011)

    Google Scholar 

  8. Hoi, S.C.H., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., Wu, Q.: LOGO-net: largescale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv:1511.02462 (2015)

  9. Iandola, F.N., Shen, A., Gao, P., Keutzer, K.: DeepLogo: hitting logo recognition with the deep neural network hammer. arXiv:1510.02131 (2015)

  10. Krizhevsky, A., Sutskever, I., Hinton, J.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 1097–1105 (2012)

    Google Scholar 

  11. Oliveira, G., Frazão, X., Pimentel, A., Ribeiro. B.: Automatic graphic logo detection via fast region-based convolutional networks. arXiv:1604.06083 (2016)

  12. Uijlings, J.R.R., Van De Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Intl. J. Comput. Vis. 104(2), 154–171 (2013). Springer

    Article  Google Scholar 

  13. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2002)

    Article  Google Scholar 

  14. Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53

    Google Scholar 

  15. Hancock, J.M.: Jaccard distance (Jaccard Index, Jaccard Similarity Coefficient). Dictionary Bioinform. Comput. Biol (2004)

    Google Scholar 

  16. Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Cham (2014). doi:10.1007/978-3-319-10593-2_33

    Google Scholar 

  17. de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the 4th International Conference on Computer Vision Theory and Applications, pp. 273–280 (2009)

    Google Scholar 

  18. Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 965–968 (2012)

    Google Scholar 

  19. Romberg, S., Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM Conf. on International Conference on Multimedia Accessed, pp. 113–120 (2013)

    Google Scholar 

  20. Farajzadeh, N.: Exemplar-based logo and trademark recognition. Mach. Vis. Appl. 26(6), 791–805 (2015)

    Article  Google Scholar 

  21. Liu, Y., Wang, J., Li, Z., Li, H.: Efficient logo recognition by local feature groups. Multimedia Syst. 1–9 (2016)

    Google Scholar 

  22. Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: ICML (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Moushumi Medhi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Medhi, M., Sinha, S., Sahay, R.R. (2017). A Text Recognition Augmented Deep Learning Approach for Logo Identification. In: Mukherjee, S., et al. Computer Vision, Graphics, and Image Processing. ICVGIP 2016. Lecture Notes in Computer Science(), vol 10481. Springer, Cham. https://doi.org/10.1007/978-3-319-68124-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68124-5_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68123-8

  • Online ISBN: 978-3-319-68124-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics