Abstract
In this paper, we present a novel multiple kernel method to learn the optimal classification function for visual concept. Although many carefully designed kernels have been proposed in the literature to measure the visual similarity, few works have been done on how these kernels really affect the learning performance. We propose a Per-Sample Based Multiple Kernel Learning method (PS-MKL) to investigate the discriminative power of each training sample in different basic kernel spaces. The optimal, sample-specific kernel is learned as a linear combination of a set of basic kernels, which leads to a convex optimization problem with a unique global optimum. As illustrated in the experiments on the Caltech 101 and the Wikipedia MM dataset, the proposed PS-MKL outperforms the traditional Multiple Kernel Learning methods (MKL) and achieves comparable results with the state-of-the-art methods of learning visual concepts.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. PAMI 22(12), 1349–1380 (2000)
Szummer, M., Picard, R.W.: Indoor-outdoor image classification. In: ICCV Workshop on Content-based Access of Image and Video Databases, Bombay, India, pp. 42–50 (1998)
Vogel, J., Schiele, B.: Natural Scene Retrieval Based on a Semantic Modeling Step. In: Proc. Int’l. Conf. Image and Video Retrieval (July 2004)
Sivic, J., Russell, B., Efros, A., Zisserman, A.: Discovering Objects and Their Location in Images. In: Proceedings of the IEEE ICCV 2005, pp. 370–377 (2005)
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning Object Categories from Google’s Image Search. In: Proceedings of the Tenth ICCV 2005, vol. 2, pp. 1816–1823 (2005)
Fei-Fei, L., Fergus, R., Perona, P.: Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories. In: Conference on Computer Vision and Pattern Recognition Workshop (2004)
Kumar, A., Sminc, C.: Support Kernel Machines for Object Recognition. In: IEEE 11th International Conference on Computer Vision, 2007. ICCV 2007, October 14-21, 2007, pp. 1–8 (2007)
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models. In: Proc. Computer Vision and Pattern Recognition (2005)
Fei-Fei, L., Fergus, R., Perona, P.: One-Shot learning of object categories. IEEE Trans. PAMI 28(4), 594–611 (2006)
Jia, L., Fei-Fei, L.: What, where and who? Classifying event by scene and object recognition. In: ICCV (2007)
Ng, A., Jordan, M.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. In: Advances in NIPS, vol. 12 (2002)
Malisiewicz, T., Efros, A.A.: Recognition by Association via Learning Per-exemplar Distances. In: CVPR (June 2008)
Torralba, A., Fergus, R., Freeman, W.T.: Tiny images.Technical Report MIT-CSAIL-TR-2007-024, MIT CSAIL (2007)
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: ICCV, October 17-21, 2005, vol. 2, pp. 1458–1465 (2005)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178 (2006)
Ling, H., Soatto, S.: Proximity Distribution Kernels for Geometric Context in Category Recognition. In: ICCV, October 14-21, 2007, pp. 1–8 (2007)
Bach, F.R., Lanckriet, G.R.G., Jordan, M.I.: Multiple kernel learning, conic duality, and the SMO algorithm. In: NIPS (2004)
Sonnenburg, S., Raetsch, G., Schaefer, C., Scholkopf, B.: Large scale multiple kernel learning. Journal of Machine Learning Research, 1531–1565 (2006)
Frome, A., Singer, Y., Sha, F., Malik, J.: Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification. In: ICCV 2007, pp. 1–8 (2007)
Zhang, H., Berg, A.C., Maire, M., Malik, J.: SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. In: CVPR. pp. 2126–2136 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, J., Li, Y., Tian, Y., Duan, L., Gao, W. (2009). A New Multiple Kernel Approach for Visual Concept Learning. In: Huet, B., Smeaton, A., Mayer-Patel, K., Avrithis, Y. (eds) Advances in Multimedia Modeling . MMM 2009. Lecture Notes in Computer Science, vol 5371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92892-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-92892-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92891-1
Online ISBN: 978-3-540-92892-8
eBook Packages: Computer ScienceComputer Science (R0)