Image classification without segmentation using a hybrid pyramid kernel

Cho, Wai-Shing; Lam, Kin-Man

doi:10.1007/s11042-013-1569-7

Image classification without segmentation using a hybrid pyramid kernel

Published: 31 July 2013

Volume 73, pages 1195–1224, (2014)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Wai-Shing Cho¹ &
Kin-Man Lam¹

310 Accesses
1 Citation
Explore all metrics

Abstract

Image classification usually requires complicated segmentation to separate foreground objects from the background scene. However, the statistical content of a background scene can actually provide very useful information for classification. In this paper, we propose a new hybrid pyramid kernel which incorporates local features extracted from both dense regular grids and interest points for image classification, without requiring segmentation. Features extracted from dense regular grids can better capture information about the background scene, while interest points detected at corners and edges can better capture information about the salient objects. In our algorithm, these two local features are combined in both the spatial and the feature-space domains, and are organized into pyramid representations. In order to obtain better classification accuracy, we fine-tune the parameters involved in the similarity measure, and we determine discriminative regions by means of relevance feedback. From the experimental results, we observe that our algorithm can achieve a 6.37 % increase in performance as compared to other pyramid-representation-based methods. To evaluate the applicability of the proposed hybrid kernel to large-scale databases, we have performed a cross-dataset experiment and investigated the effect of foreground/background features on each of the kernels. In particular, the proposed hybrid kernel has been proven to satisfy Mercer’s condition and is efficient in measuring the similarity between image features. For instance, the computational complexity of the proposed hybrid kernel is proportional to the number of features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ardizzoni S, Bartolini I, Patella M, (1999) “Windsurf: Region-based image retrieval using wavelets.” In IWOSS’99, pp. 167
Bartolini I, Ciaccia P, Patella M (2010) Query processing issues in region-based image databases. In Knowledge and Information Systems 25(2):389–420
Article Google Scholar
Boughhorbel S, Tarel J-P, and Fleuret F (2004) “Non-mercer kernels for SVM object recognition.” In British Machine Vision Conference, Sept
Cho WS, Lam KM (2013) “An efficient and effective hybrid pyramid kernel for un-segmented image classification.” ICSAI 2012, pp. 2153–2158
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) “Visual categorization with bags of keypoints”. In ECCV’04 workshop on Statistical Learning in Computer Vision. pp.59
Datta R, Joshi D, Li J, Wang JZ (2008) “Image retrieval: Ideas, influences, and trends of the new age.” ACM Comput. Surv. 40(2): Article 5, April
Fei-Fei L, Perona P (2005) “A Bayesian hierarchical model for learning natural scene categories”. In Proc. CVPR
Fergus R, Perona P, Zisserman A (2003) “Object class recognition by unsupervised scale-invariant learning”. In Proc. CVPR
Flitton G, Breckon T (2010) “Object Recognition using 3D SIFT in Complex CT Volumes.” In Proc. of the British Machine Vision Conference. pp. 11.1–12
Gorkani M, Picard R (1994) Texture orientation for sorting photos ‘at a glance’. In IAPR International Conference on Pattern Recognition 1:459–464
Google Scholar
Grauman K, Darrell T (2005) “Pyramid match kernels: Discriminative classification with sets of image features.” In Proc. ICCV
Grauman K, Darrell T (2006) “Unsupervised learning of categories from sets of partially matching image features”. In CVPR
Kondor R, Jebara T (2003) “A kernel between sets of vectors,” In Proccedings of International Conference on Machine Learning, Washington, D.C., Aug
Kovashka A, Parikh D, Grauman K (2013) “WhittleSearch: Image Search with Relative Attribute Feedback.” In Proc. CVPR, June
Lazebnik S, Schmid C, Ponce J, (2003) “Affine-invariant local descriptors and neighborhood statistics for texture recognition”. In Proc. of the IEEE International Conf. on Computer Vision, pp. 649–655
Lazebnik S, Schmid C, Ponce J (2006) “Beyond bags of features: spatial pyramid matching for recognizing natural scene categories.” In Proc. of CVPR
Li F, Carreira J, Sminchisescu C (2010) “Object Recognition as Ranking Holistic Figure-Ground Hypotheses”. In Proc.of CVPR 2010, pp.1712–1719
Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu Kai, Cao Liangliang, Huang T (2011) “Large-scale image classification: Fast feature extraction and SVM training.” In Proc. CVPR, pp.1689–1696
Lowe D (1999) “Object recognition from local scale-invariant features.” In Proc. of the International Conference on Computer Vision. pp. 1150–1157
Lowe D (2000) “Towards a computational model for object recognition in IT cortex.” In Biologically Motivated Computer Vision pp. 20–31
Lowe D (2004) Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2):91–110
Article Google Scholar
Lyu S (2005) “Mercer kernels for object recognition with local features”. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Jun
Malik J, Belongie S, Leung T, Shi J (2001) Contour and Texture Analysis for Image Segmentation. International Journal of Computer Vision 43(1):7–27
Article MATH Google Scholar
Moreno P, Ho P, Vasconcelos N, (2003) “A Kullback–Leibler divergence based kernel for SVM classification in multimedia applications.” In NIPS, Vancouver, Dec
Niebles JC, Fei-Fei L (2007) “A hierarchical model model of shape and appearance for human action classification.” In Proc. of IEEE Computer Vision and Pattern Recognition
Qu Y, Wu S, Liu H, Xie Y, Wang H (2012) “Evaluation of local features and classifiers in BOW model for image classification”. Journal of Multimedia Tools and Applications. doi:10.1007/s11042-012-1107-z
Google Scholar
Rubner Y, Tomasi C, Guibas LJ (2000) “The Earth Mover’s Distance as a metric for image retrieval.” International Journal of Computer Vision 40(2)
Shawe-Taylor J, Cristianini N (2004)“Kernel Methods for Pattern Analysis”. Cambridge University Press
Shi J, Malik J (2000) “Normalized Cuts and Image Segmentation”. IEEE Transactions on Pattern Analysis and Machine Intelligence 22: (8) August
Sivic J, Russell B, Efros A, Zisserman A, Freeman W (2005) “Discovering objects and their localization in images.” In Proc. of ICCV
Squire D, Muller W, Muller H, Raki J (1999) “Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback”. in Proceedings of the 11th Scandinavian conference on image analysis. pp. 143–149
Szummer M, Picard R (1998) “Indoor-outdoor image classification”. In IEEE International Workshop on Content-Based Access of Image and Video Databases pp. 42–51
Tong S, Chang E (2001) “Support vector machine active leaning for image retrieval”. In Proc. of the 9th ACM Conference on Multimedia. Ottawa Canada
Torralba A, Murphy K, Freeman W, Rubin M (2003) “Context-based vision system for place and object recognition.” In Proc. ICCV
Wallraven C, Caputo B, Graf A (2003) “Recognition with local features: the Kernel Recipe.” In Proc. IEEE International Conf. on Computer Vision, Oct
Wang X-Y, Zhang B-B, Yang H-Y (2012) “Content-based image retrieval by integrating color and texture features”. Journal of Multimedia Tools and Applications. doi:10.1007/s11042-012-1055-7
Google Scholar
Wolf L, Shashua A (Dec 2003) Learning over sets using kernel principal angles. Journal of Machine Learning Research 4:913–931
Google Scholar
Yu S, Shi J (2004) “Segmentation Given Partial Grouping Constraints.” IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(2):February
Google Scholar
Zhang J, Marszalek M, Lazebnik S, Schmid C (2005) “Local features and kernels for classification of texture and object categories: An in-depth study”, Technical Report RR-5737. INRIA, Rhône-Alpes
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Signal Processing, Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Wai-Shing Cho & Kin-Man Lam

Authors

Wai-Shing Cho
View author publications
Search author on:PubMed Google Scholar
Kin-Man Lam
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Kin-Man Lam.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cho, WS., Lam, KM. Image classification without segmentation using a hybrid pyramid kernel. Multimed Tools Appl 73, 1195–1224 (2014). https://doi.org/10.1007/s11042-013-1569-7

Download citation

Published: 31 July 2013
Issue Date: December 2014
DOI: https://doi.org/10.1007/s11042-013-1569-7

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image classification without segmentation using a hybrid pyramid kernel

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Content-based image retrieval via a hierarchical-local-feature extraction scheme

Single image deraining via deep pyramid network with spatial contextual information aggregation

A kernel discriminant analysis for spatially dependent data

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Image classification without segmentation using a hybrid pyramid kernel

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Content-based image retrieval via a hierarchical-local-feature extraction scheme

Single image deraining via deep pyramid network with spatial contextual information aggregation

A kernel discriminant analysis for spatially dependent data

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now