A Text Recognition Augmented Deep Learning Approach for Logo Identification

Medhi, Moushumi; Sinha, Shubham; Sahay, Rajiv Ranjan

doi:10.1007/978-3-319-68124-5_13

Moushumi Medhi²¹,
Shubham Sinha²² &
Rajiv Ranjan Sahay²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10481))

Included in the following conference series:

International Conference on Computer Vision, Graphics, and Image processing

1490 Accesses
1 Citations

Abstract

Logo/brand name detection and recognition in unstructured and highly unpredictable natural images has always been a challenging problem. We notice that in most natural images logos are accompanied with associated text. Therefore, we address the problem of logo recognition by first detecting and isolating text of varying color, font size and orientation in the input image using affine invariant maximally stable extremal regions (MSERs). Using an off-the-shelf OCR, we identify the text associated with the logo image. Then an effective grouping technique is employed to combine the remaining stable regions based on spatial proximity of MSERs. Deep learning has the advantage that optimal features can be learned automatically from image pixel data. This motivates us to feed the clustered logo candidate image regions to a pre-trained deep convolutional neural network (DCNN) to generate a set of complex features which are further input to a multiclass support vector machine (SVM) for classification. We tested our proposed logo recognition system on 32 logo classes, and a non-logo class obtained by combining FlickrLogos-32 and MICC logo databases, amounting to a total of 23582 training and testing images. Our method yields robust recognition performance, outperforming state-of-the-art techniques achieving 97.8% precision, 95.7% recall and 95.7% average accuracy on the combined MICC and FlickrLogos-32 datasets and a precision of 98.6%, recall of 97.9% and average accuracy of 99.6% on only the FlickrLogos-32 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alaei, A., Delalandre, M., Girard, N.: Logo detection using painting based representation and probability features. In: ICDAR, pp. 1235–1239 (2013)
Google Scholar
Boia, R., Florea, C., Florea, L., Dogaru, R.: Logo localization and recognition in natural images using homographic class graphs. Mach. Vis. Appl. 27(2), 287–301 (2016)
Article Google Scholar
Romberg, S., Pueyo, L.G., Lienhart, R., Zwol, R.V.: Scalable logo recognition in real-world images. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, pp. 965–968 (2011)
Google Scholar
Chen, W., Lan, S., Xu, P.: Multiple feature fusion via hierarchical matching for TV logo recognition. In: Proceedings of the 8th International Congress on Image and Signal Processing, IEEE (2015)
Google Scholar
Sahbi, H., Ballan, L., Serra, G., Bimbo, A.: Context-dependent logo matching and recognition. IEEE Trans. Image Process. 22(3), 1018–1031 (2013). IEEE
Article MATH MathSciNet Google Scholar
Zhang, Y., Zhang, S., Liang, W., Guo, Q.: Individualized matching based on logo density for scalable logo recognition. In: ICASSP, pp. 4324–4328 (2014)
Google Scholar
Hassanzadeh, S., Pourghassem, H.: Fast logo detection based on morphological features in document images. In: Proceedings of the 7th International Colloquium on Signal Processing and its Applications, pp. 283–286 (2011)
Google Scholar
Hoi, S.C.H., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., Wu, Q.: LOGO-net: largescale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv:1511.02462 (2015)
Iandola, F.N., Shen, A., Gao, P., Keutzer, K.: DeepLogo: hitting logo recognition with the deep neural network hammer. arXiv:1510.02131 (2015)
Krizhevsky, A., Sutskever, I., Hinton, J.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 1097–1105 (2012)
Google Scholar
Oliveira, G., Frazão, X., Pimentel, A., Ribeiro. B.: Automatic graphic logo detection via fast region-based convolutional networks. arXiv:1604.06083 (2016)
Uijlings, J.R.R., Van De Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Intl. J. Comput. Vis. 104(2), 154–171 (2013). Springer
Article Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2002)
Article Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). doi:10.1007/978-3-319-10590-1_53
Google Scholar
Hancock, J.M.: Jaccard distance (Jaccard Index, Jaccard Similarity Coefficient). Dictionary Bioinform. Comput. Biol (2004)
Google Scholar
Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Cham (2014). doi:10.1007/978-3-319-10593-2_33
Google Scholar
de Campos, T.E., Babu, B.R., Varma, M.: Character recognition in natural images. In: Proceedings of the 4th International Conference on Computer Vision Theory and Applications, pp. 273–280 (2009)
Google Scholar
Revaud, J., Douze, M., Schmid, C.: Correlation-based burstiness for logo retrieval. In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 965–968 (2012)
Google Scholar
Romberg, S., Lienhart, R.: Bundle min-hashing for logo recognition. In: Proceedings of the 3rd ACM Conf. on International Conference on Multimedia Accessed, pp. 113–120 (2013)
Google Scholar
Farajzadeh, N.: Exemplar-based logo and trademark recognition. Mach. Vis. Appl. 26(6), 791–805 (2015)
Article Google Scholar
Liu, Y., Wang, J., Li, Z., Li, H.: Efficient logo recognition by local feature groups. Multimedia Syst. 1–9 (2016)
Google Scholar
Nair, V., Hinton, G.: Rectified linear units improve restricted boltzmann machines. In: ICML (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Computational Vision Lab, Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India
Moushumi Medhi
Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, 711103, West Bengal, India
Shubham Sinha
Department of Electrical Engineering, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India
Rajiv Ranjan Sahay

Authors

Moushumi Medhi
View author publications
You can also search for this author in PubMed Google Scholar
Shubham Sinha
View author publications
You can also search for this author in PubMed Google Scholar
Rajiv Ranjan Sahay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Moushumi Medhi .

Editor information

Editors and Affiliations

Indian Institute of Information Technology, Chittoor, India
Snehasis Mukherjee
GE Global Research, Bangalore, India
Suvadip Mukherjee
Indian Statistical Institute, Kolkata, India
Dipti Prasad Mukherjee
International Institute of Information Technology, Hyderabad, India
Jayanthi Sivaswamy
Indian Institute of Technology Bombay, Mumbai, India
Suyash Awate
CEDAR, Buffalo, New York, USA
Srirangaraj Setlur
International Institute of Information Technology, Hyderabad, India
Anoop M. Namboodiri
CSIR-CEERI, Pilani, Rajasthan, India
Santanu Chaudhury

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Medhi, M., Sinha, S., Sahay, R.R. (2017). A Text Recognition Augmented Deep Learning Approach for Logo Identification. In: Mukherjee, S., et al. Computer Vision, Graphics, and Image Processing. ICVGIP 2016. Lecture Notes in Computer Science(), vol 10481. Springer, Cham. https://doi.org/10.1007/978-3-319-68124-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-68124-5_13
Published: 21 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68123-8
Online ISBN: 978-3-319-68124-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics