Abstract
Finding logos in the real-world images is a challenging task due to their small size, simple shape, less texture and clutter background. In this paper, through visual logo analysis with different types of features, we propose a novel framework for finding visual logos in the real-world images. First, we exploit the contextual shape and patch information around feature points, merge them into a combined feature representation (point-context). Considering the characteristics of logos, this kind of fusion is an effective enhancement for the discriminability of single point features. Second, to eliminate the interference of the complex and noisy background, we transfer the logo recognition to a region-to-image search problem by segmenting real-world images into region trees. A weak geometric constraint based on regions is encoded into an inverted file structure to accelerate the search process. Third, we apply global features to refine initial results in the re-ranking stage. Finally, we combine each region score both in max-response and accumulate-response mode to obtain the final results. Performances of the proposed approach are evaluated on both our CASIA-LOGO dataset and the standard Flickr logos 27 dataset. Experiments and comparisons show that our approach is superior to the state-of-the-art approaches.
Similar content being viewed by others
Notes
References
Arbelaez, P., Maire, M., Foelks, C., Malik, J.: From contours to regions: An empirical evaluation. In: Proceedings of CVPR’09, pp. 2294–2301 (2009)
Bay, H., Tuytelaars, T., Gool, L.: Surf speeded up robust features. In: Proceedings of ECCV’06 (2006)
Belongie, S., Malik, J., Shape, J.: Matching and object recognition using shape contexts. IEEE Trans. PMAI 9, 471–474 (2002)
Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: Proceedings of CIVR’07, pp. 401–408 (2007)
Canny, J.: A computational approach to edge detection. IEEE Trans. PAMI 8(6), 679–698 (1986)
Casia logo. http://www.nlpr.ia.ac.cn/iva/homepage/jqwang/files/CASIA-LOGO.rar
Comaniciu D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. PAMI 24, 603–619 (2002)
Deng, Y., Manjunath, B., Shin, H.: Color image segmentation. In: Proceedings of CVPR’99, pp. 446–451 (1999)
Fu, J., Wang, J., Li, Z., Xu, M., Lu, H.: Efficient clothing retrieval with semantic-preserving visual phrases. In: ACCV12, pp. II:420–431 (2012)
Fu, J., Wang, J., Lu, H.: Effective logo retrieval with adaptive local feature selection. In: Proceedings of ACM Multimedia, pp. 971–974 (2010)
Galleguillos, C., McFee, B., Belongie, S., Lanckriet, G.: From region similarity to category discovery. In: Proceedings of CVPR’11, pp. 2665–2672 (2011)
Gu, C., Lim, J., Arbelaez, P., Malik, J.: Recognition using regions. In: Proceedings of CVPR’09, pp. 1030–1037 (2009)
Hollander, R.J., Hanjalic, A.: Logo recognition in video stills by string matching. In: Proceedings of ICIP’03, pp. 517–520 (2003)
Kalantidis, Y., Pueyo, L., Trevisiol, M.: Scalable triangulation-based logo recognition. In: Proceedings of ACM ICMR’11, pp. 20–20 (2011)
Kim, J., Grauman, K.: Asymmetric region-to-image matching for comparing images with generic object categories. In: Proceedings of CVPR’10, pp. 2344–2351 (2010)
Kleban, J., Xie, X., Ma, W.: Spatial pyramid mining for logo detection in natural scenes. In: Proceedings of ICME’08, pp. 1077–1080 (2008)
Kumar, M., Koller, D.: Efficient selecting regions for scene understanding. In: Proceedings of CVPR’10, pp. 3217–3224 (2010)
Liu, D., Hua, G., Viola, P., Chen, T.: Integrated feature selection and higher-order spatial feature extraction for object categorization. In: Proceedings of CVPR’08, pp. 1–8 (2008)
Llinkigt, M., Kise, K.: Local configuration of sift-like features by a shape context. In: Proceedings of CJKPR’10, pp. 11–15 (2010)
Lowe, D.: Distinctive image features form scale-invariant keypoints. IJCV’04, 60(2), 90–110 (2004)
Matas, J., Chum, O., Martin, U., Pajdla, T.: Logoseeker: a system for detecting and matching logos in natural images. In: Proceedings of BMVC’02, pp. 384–393 (2002)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. J. Comput. Vision 42, 145–175 (2001)
Romberg, S., Pueyo, L.G., Lienhart, R., van Zwol, R.: Scalable logo recognition in real-world images. In: Proceedings of ICMR’11 (2011)
Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. In: Proceedings of ACM SIGRAPH’04, pp. 309–314 (2004)
Rusinol, M., Lladós, J.: Efficient logo retrieval through hashing shape context descriptors. In: Proceedings of the Ninth IAPR Workshop on Document Analysis Systems (DAS’10), pp. 215–222 (2010)
Sanyal, S., Sengamedu, S.H.: Logoseeker: a system for detecting and matching logos in natural images. In: Proceedings of ACM Multimedia’07, pp. 166–167 (2007)
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: Proceedings of ICCV’03, pp. 1470–1477 (2003)
Vijayanarasimhan, S., Grauman, K.: Efficient region search for object detection. In: Proceedings of CVPR’11, pp. 1401–1408 (2011)
Wu, Z., Ke, Q., Isard, M., Sun, J.: Bundling features for large scale partial-duplicate web image search. In: Proceedings of CVPR’09, pp. 215–222 (2009)
Zha, Z.J., Hua, X.S., Mei, T., Wang, J., Qi, G.J., Wang, Z.: Joint multi-label multi-instance learning for image classification. In: CVPR’08, pp. 1–8 (2008)
Zha, Z.J., Mei, T., Wang, J., Wang, Z., Hua, X.S.: Graph-based semi-supervised learning with multiple labels. J. Visual Commun. Image Rep. 20(2), 97–103 (2009)
Zha, Z.J., Wang, M., Zheng, Y.T., Yang, Y., Hong, R., Chua, T.S.: Interactive video indexing with statistical active learning. IEEE Trans. Multimedia 14, 17–27 (2012)
Zhang, S., Huang, Q., Hua, G., Jiang, S., Gao, W., Tian, Q.: Building contextual visual vocabulary for large-scale image application. In: Proceedings of ACM Multimedia’10, pp. 501–510 (2010)
Acknowledgments
This work was supported by 973 Program (2010CB327905) and National Natural Science Foundation of China (61273034, 61070104, 61005027 and 61272329).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, J., Fu, J. & Lu, H. Finding logos in real-world images with point-context representation-based region search. Multimedia Systems 21, 301–311 (2015). https://doi.org/10.1007/s00530-013-0349-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-013-0349-6