Abstract
This paper addresses the problem of pose invariant Generic Object Recognition by modeling the perceptual capability of human beings. We propose a novel framework using a combination of appearance and shape cues to recognize the object class and viewpoint (axis of rotation) as well as determine its pose (angle of view). The appearance model of the object from multiple viewpoints is captured using Linear Subspace Analysis techniques and is used to reduce the search space to a few rank-ordered candidates. We have used a decision-fusion based combination of 2D PCA and ICA to integrate the complementary information of classifiers and improve recognition accuracy. For matching based on shape features, we propose the use of distance transform based correlation. A decision fusion using ‘Sum Rule’ of 2D PCA and ICA subspace classifiers, and distance transform based correlation is then used to verify the correct object class and determine its viewpoint and pose. Experiments were conducted on COIL-100 and IGOIL (IITM Generic Object Image Library) databases which contain objects with complex appearance and shape characteristics. IGOIL database was captured to analyze the appearance manifolds along two orthogonal axes of rotation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Murase, H., Nayar, S.: Visual Learning and recognition of 3-D Objects from Appearance. International Journal of Computer Vision 14, 5–24 (1995)
Nagabhushan, P., Guru, D., Shekar, B.: Visual Learning and recognition of 3-D Objects from Appearance using two-dimensional principal components analysis: A robust and an efficient approach. Pattern Recognition 39, 721–725 (2006)
Rosin, P.L., Marshall, D.: Object recognition using local affine frames on distinguished regions. In: Proceedings of the British Machine Vision Conference, London, UK, pp. 113–122 (2002)
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints. International Journal of Computer Vision 66 (2006)
Biederman, I.: Recognition by Components: A theory of Human Image Understanding. Psychological Review 94, 115–147 (1987)
Leonardis, A., Bischof, H.: Robust recognition using eigenimages. Computer Vision and Image Understanding 78, 99–118 (2000)
Pontil, M., Verri, A.: Support Vector Machines for 3D Object Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 637–646 (1998)
Zhang, H., Berg, A., Mair, M., Malik, J.: SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. Computer Vision and Pattern Recognition 2, 2126–2136 (2006)
Belongie, P.J., Malik, J.: Shape Matching and Object Recognition using Shape Contexts. IEEE transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)
Bohm, J., Bernner, C., Guhring, J., Fritsch, D.: Automated Extraction of features from CAD models for 3-D object recognition. In: International Society for Photogrammetry and Remote Sensing Congress, Amsterdam, Netherlands, vol. 33 (2000)
Edelman, S., Buelthoff, H.: Orientation dependence in the recognition of familiar and novel views of three dimensional objects. Vision Research 32, 2385–2400 (1992)
Kanwisher, N.: Domain Specificity in face perception. Nature Neuroscience 3, 759–776 (2000)
Hyvarinen, A.: Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Trans. on Neural Networks 10, 626–634 (1999)
Zhang, J.Y., Frangi, A.F., Yang, J.Y.: Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition. IEEE Tran. on Pattern Analysis and Machine Intelligence 26, 131–137 (2004)
Kittler, J., Duin, P., Matas, J.: On Combining Classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 226–239 (1998)
Sanjay, M., Das, S., Yegnanarayana, B.: Robust Template Matching for noisy bitmap images invariant to translation and rotation. In: Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, pp. 82–84 (1998)
Nene, S.A., Nayar, S.K., Murase, H.: COIL 100 Database (1996), http://www1.cs.columbia.edu/CAVE/research/softlib/coil-100.html
Kalra, M., Das, S.: IITM Generic Object Image Library (2006), http://vplab.cs.iitm.ernet.in/downloads.html
Rother, C., Kolomogorov, V., Blake, A.: GrabCut- Interactive Foreground extraction using iterated Graph Cuts. ACM transactions on Graphics (SIGGRAPH), 309–314 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kalra, M., Deepti, P., Abhilash, R., Das, S. (2006). Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_55
Download citation
DOI: https://doi.org/10.1007/11949619_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68301-8
Online ISBN: 978-3-540-68302-5
eBook Packages: Computer ScienceComputer Science (R0)