Skip to main content
Log in

Image classification without segmentation using a hybrid pyramid kernel

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Image classification usually requires complicated segmentation to separate foreground objects from the background scene. However, the statistical content of a background scene can actually provide very useful information for classification. In this paper, we propose a new hybrid pyramid kernel which incorporates local features extracted from both dense regular grids and interest points for image classification, without requiring segmentation. Features extracted from dense regular grids can better capture information about the background scene, while interest points detected at corners and edges can better capture information about the salient objects. In our algorithm, these two local features are combined in both the spatial and the feature-space domains, and are organized into pyramid representations. In order to obtain better classification accuracy, we fine-tune the parameters involved in the similarity measure, and we determine discriminative regions by means of relevance feedback. From the experimental results, we observe that our algorithm can achieve a 6.37 % increase in performance as compared to other pyramid-representation-based methods. To evaluate the applicability of the proposed hybrid kernel to large-scale databases, we have performed a cross-dataset experiment and investigated the effect of foreground/background features on each of the kernels. In particular, the proposed hybrid kernel has been proven to satisfy Mercer’s condition and is efficient in measuring the similarity between image features. For instance, the computational complexity of the proposed hybrid kernel is proportional to the number of features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Ardizzoni S, Bartolini I, Patella M, (1999) “Windsurf: Region-based image retrieval using wavelets.” In IWOSS’99, pp. 167

  2. Bartolini I, Ciaccia P, Patella M (2010) Query processing issues in region-based image databases. In Knowledge and Information Systems 25(2):389–420

    Article  Google Scholar 

  3. Boughhorbel S, Tarel J-P, and Fleuret F (2004) “Non-mercer kernels for SVM object recognition.” In British Machine Vision Conference, Sept

  4. Cho WS, Lam KM (2013) “An efficient and effective hybrid pyramid kernel for un-segmented image classification.” ICSAI 2012, pp. 2153–2158

  5. Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) “Visual categorization with bags of keypoints”. In ECCV’04 workshop on Statistical Learning in Computer Vision. pp.59

  6. Datta R, Joshi D, Li J, Wang JZ (2008) “Image retrieval: Ideas, influences, and trends of the new age.” ACM Comput. Surv. 40(2): Article 5, April

  7. Fei-Fei L, Perona P (2005) “A Bayesian hierarchical model for learning natural scene categories”. In Proc. CVPR

  8. Fergus R, Perona P, Zisserman A (2003) “Object class recognition by unsupervised scale-invariant learning”. In Proc. CVPR

  9. Flitton G, Breckon T (2010) “Object Recognition using 3D SIFT in Complex CT Volumes.” In Proc. of the British Machine Vision Conference. pp. 11.1–12

  10. Gorkani M, Picard R (1994) Texture orientation for sorting photos ‘at a glance’. In IAPR International Conference on Pattern Recognition 1:459–464

    Google Scholar 

  11. Grauman K, Darrell T (2005) “Pyramid match kernels: Discriminative classification with sets of image features.” In Proc. ICCV

  12. Grauman K, Darrell T (2006) “Unsupervised learning of categories from sets of partially matching image features”. In CVPR

  13. Kondor R, Jebara T (2003) “A kernel between sets of vectors,” In Proccedings of International Conference on Machine Learning, Washington, D.C., Aug

  14. Kovashka A, Parikh D, Grauman K (2013) “WhittleSearch: Image Search with Relative Attribute Feedback.” In Proc. CVPR, June

  15. Lazebnik S, Schmid C, Ponce J, (2003) “Affine-invariant local descriptors and neighborhood statistics for texture recognition”. In Proc. of the IEEE International Conf. on Computer Vision, pp. 649–655

  16. Lazebnik S, Schmid C, Ponce J (2006) “Beyond bags of features: spatial pyramid matching for recognizing natural scene categories.” In Proc. of CVPR

  17. Li F, Carreira J, Sminchisescu C (2010) “Object Recognition as Ranking Holistic Figure-Ground Hypotheses”. In Proc.of CVPR 2010, pp.1712–1719

  18. Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu Kai, Cao Liangliang, Huang T (2011) “Large-scale image classification: Fast feature extraction and SVM training.” In Proc. CVPR, pp.1689–1696

  19. Lowe D (1999) “Object recognition from local scale-invariant features.” In Proc. of the International Conference on Computer Vision. pp. 1150–1157

  20. Lowe D (2000) “Towards a computational model for object recognition in IT cortex.” In Biologically Motivated Computer Vision pp. 20–31

  21. Lowe D (2004) Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2):91–110

    Article  Google Scholar 

  22. Lyu S (2005) “Mercer kernels for object recognition with local features”. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Jun

  23. Malik J, Belongie S, Leung T, Shi J (2001) Contour and Texture Analysis for Image Segmentation. International Journal of Computer Vision 43(1):7–27

    Article  MATH  Google Scholar 

  24. Moreno P, Ho P, Vasconcelos N, (2003) “A Kullback–Leibler divergence based kernel for SVM classification in multimedia applications.” In NIPS, Vancouver, Dec

  25. Niebles JC, Fei-Fei L (2007) “A hierarchical model model of shape and appearance for human action classification.” In Proc. of IEEE Computer Vision and Pattern Recognition

  26. Qu Y, Wu S, Liu H, Xie Y, Wang H (2012) “Evaluation of local features and classifiers in BOW model for image classification”. Journal of Multimedia Tools and Applications. doi:10.1007/s11042-012-1107-z

    Google Scholar 

  27. Rubner Y, Tomasi C, Guibas LJ (2000) “The Earth Mover’s Distance as a metric for image retrieval.” International Journal of Computer Vision 40(2)

  28. Shawe-Taylor J, Cristianini N (2004)“Kernel Methods for Pattern Analysis”. Cambridge University Press

  29. Shi J, Malik J (2000) “Normalized Cuts and Image Segmentation”. IEEE Transactions on Pattern Analysis and Machine Intelligence 22: (8) August

  30. Sivic J, Russell B, Efros A, Zisserman A, Freeman W (2005) “Discovering objects and their localization in images.” In Proc. of ICCV

  31. Squire D, Muller W, Muller H, Raki J (1999) “Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback”. in Proceedings of the 11th Scandinavian conference on image analysis. pp. 143–149

  32. Szummer M, Picard R (1998) “Indoor-outdoor image classification”. In IEEE International Workshop on Content-Based Access of Image and Video Databases pp. 42–51

  33. Tong S, Chang E (2001) “Support vector machine active leaning for image retrieval”. In Proc. of the 9th ACM Conference on Multimedia. Ottawa Canada

  34. Torralba A, Murphy K, Freeman W, Rubin M (2003) “Context-based vision system for place and object recognition.” In Proc. ICCV

  35. Wallraven C, Caputo B, Graf A (2003) “Recognition with local features: the Kernel Recipe.” In Proc. IEEE International Conf. on Computer Vision, Oct

  36. Wang X-Y, Zhang B-B, Yang H-Y (2012) “Content-based image retrieval by integrating color and texture features”. Journal of Multimedia Tools and Applications. doi:10.1007/s11042-012-1055-7

    Google Scholar 

  37. Wolf L, Shashua A (Dec 2003) Learning over sets using kernel principal angles. Journal of Machine Learning Research 4:913–931

    Google Scholar 

  38. Yu S, Shi J (2004) “Segmentation Given Partial Grouping Constraints.” IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(2):February

    Google Scholar 

  39. Zhang J, Marszalek M, Lazebnik S, Schmid C (2005) “Local features and kernels for classification of texture and object categories: An in-depth study”, Technical Report RR-5737. INRIA, Rhône-Alpes

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kin-Man Lam.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cho, WS., Lam, KM. Image classification without segmentation using a hybrid pyramid kernel. Multimed Tools Appl 73, 1195–1224 (2014). https://doi.org/10.1007/s11042-013-1569-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-013-1569-7

Keywords

Navigation