Abstract
Hypothesis generation and verification technique has recently attracted much attention in the research on multiple object category detection and localization in images. However, the performance of this strategy greatly depends on the accuracy of generated hypotheses. This paper proposes a method of multiple category object detection adopting the hypothesis generation and verification strategy that can solve the accurate hypothesis generation problem by sub-categorization. Our generative learning algorithm automatically sub-categorizes images of each category into one or more different groups depending on the object’s appearance changes. Based on these sub-categories, efficient hypotheses are generated for each object category within an image in the recognition stage. These hypotheses are then verified to determine the appropriate object categories with their locations using the discriminative classifier. We compare our approach with previous related methods on various standards and the authors’ own datasets. The results show that our approach outperforms the state-of-the-art methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wu, B., Nevatia, R.: Cluster boosted tree classifier for multi-view, multi-pose object detection. In: IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil, pp. 1–8 (2007)
Seemann, E., Leibe, B., Schiele, B.: Multi-aspect detection of articulated objects. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR(2)), pp. 1582–1588. IEEE Computer Society, New York (2006)
Mansur, A., Kuno, Y.: Improving recognition through object sub-categorization. In: Bebis, G., Boyle, R., Parvin, B., Koracin, D., Remagnino, P., Porikli, F., Peters, J., Klosowski, J., Arns, L., Chun, Y.K., Rhyne, T.-M., Monroe, L. (eds.) ISVC 2008, Part II. LNCS, vol. 5359, pp. 851–859. Springer, Heidelberg (2008)
Huang, C., Ai, H., Li, Y., Lao, S.: Vector boosting for rotation invariant multi-view face detection. In: IEEE International Conference on Computer Vision, Beijing, China, pp. 446–453. IEEE Computer Society, Los Alamitos (2005)
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42, 177–196 (2001)
Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their localization in images. In: IEEE International Conference on Computer Vision, Beijing, China, pp. 370–377. IEEE Computer Society, Los Alamitos (2005)
Fergus, R., Li, F.F., Perona, P., Zisserman, A.: Learning object categories from google’s image search. In: IEEE International Conference on Computer Vision, Beijing, China, pp. 1816–1823. IEEE Computer Society, Los Alamitos (2005)
Das, D., Mansur, A., Kobayashi, Y., Kuno, Y.: An integrated method for multiple object detection and localization. In: Bebis, G., Boyle, R., Parvin, B., Koracin, D., Remagnino, P., Porikli, F., Peters, J., Klosowski, J., Arns, L., Chun, Y.K., Rhyne, T.-M., Monroe, L. (eds.) ISVC 2008, Part II. LNCS, vol. 5359, pp. 133–144. Springer, Heidelberg (2008)
Fritz, M., Leibe, B., Caputo, B., Schiele, B.: Integrating representative and discriminative models for object category detection. In: IEEE International Conference on Computer Vision, Beijing, China, pp. 1363–1370. IEEE Computer Society, Los Alamitos (2005)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
He, X.C., Yung, N.H.C.: Curvature scale space corner detector with adaptive threshold and dynamic region of support. In: ICPR (2), pp. 791–794 (2004)
Murphy, K.P., Torralba, A.B., Eaton, D., Freeman, W.T.: Object detection and localization using local and global features. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, pp. 382–400. Springer, Heidelberg (2006)
Bosch, A., Zisserman, A., Muñoz, X.: Representing shape with spatial pyramid kernel. In: ACM Int. Conf. on Image and Video Retrieval (CIVR), Amsterdam, The Netherlands, pp. 401–408 (2007)
Chang, C.C., Lin, C.J.: Libsvm: A library for support vector machines (2008), http://www.csie.ntu.edu.tw/cjlin/libsvm/
Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit shape model. In: Workshop on Statistical Learning in Computer Vision, Prague, Czech Republic, pp. 17–32 (2004)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes challenge (voc2007) results (2007), http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das, D., Kobayashi, Y., Kuno, Y. (2009). Efficient Hypothesis Generation through Sub-categorization for Multiple Object Detection. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2009. Lecture Notes in Computer Science, vol 5876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10520-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-10520-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10519-7
Online ISBN: 978-3-642-10520-3
eBook Packages: Computer ScienceComputer Science (R0)