Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace

Kalra, Manisha; Deepti, P.; Abhilash, R.; Das, Sukhendu

doi:10.1007/11949619_55

Manisha Kalra¹⁸,
P. Deepti¹⁸,
R. Abhilash¹⁸ &
…
Sukhendu Das¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4338))

1826 Accesses

Abstract

This paper addresses the problem of pose invariant Generic Object Recognition by modeling the perceptual capability of human beings. We propose a novel framework using a combination of appearance and shape cues to recognize the object class and viewpoint (axis of rotation) as well as determine its pose (angle of view). The appearance model of the object from multiple viewpoints is captured using Linear Subspace Analysis techniques and is used to reduce the search space to a few rank-ordered candidates. We have used a decision-fusion based combination of 2D PCA and ICA to integrate the complementary information of classifiers and improve recognition accuracy. For matching based on shape features, we propose the use of distance transform based correlation. A decision fusion using ‘Sum Rule’ of 2D PCA and ICA subspace classifiers, and distance transform based correlation is then used to verify the correct object class and determine its viewpoint and pose. Experiments were conducted on COIL-100 and IGOIL (IITM Generic Object Image Library) databases which contain objects with complex appearance and shape characteristics. IGOIL database was captured to analyze the appearance manifolds along two orthogonal axes of rotation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Murase, H., Nayar, S.: Visual Learning and recognition of 3-D Objects from Appearance. International Journal of Computer Vision 14, 5–24 (1995)
Article Google Scholar
Nagabhushan, P., Guru, D., Shekar, B.: Visual Learning and recognition of 3-D Objects from Appearance using two-dimensional principal components analysis: A robust and an efficient approach. Pattern Recognition 39, 721–725 (2006)
Article MATH Google Scholar
Rosin, P.L., Marshall, D.: Object recognition using local affine frames on distinguished regions. In: Proceedings of the British Machine Vision Conference, London, UK, pp. 113–122 (2002)
Google Scholar
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints. International Journal of Computer Vision 66 (2006)
Google Scholar
Biederman, I.: Recognition by Components: A theory of Human Image Understanding. Psychological Review 94, 115–147 (1987)
Article Google Scholar
Leonardis, A., Bischof, H.: Robust recognition using eigenimages. Computer Vision and Image Understanding 78, 99–118 (2000)
Article Google Scholar
Pontil, M., Verri, A.: Support Vector Machines for 3D Object Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 637–646 (1998)
Article Google Scholar
Zhang, H., Berg, A., Mair, M., Malik, J.: SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. Computer Vision and Pattern Recognition 2, 2126–2136 (2006)
Google Scholar
Belongie, P.J., Malik, J.: Shape Matching and Object Recognition using Shape Contexts. IEEE transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)
Article Google Scholar
Bohm, J., Bernner, C., Guhring, J., Fritsch, D.: Automated Extraction of features from CAD models for 3-D object recognition. In: International Society for Photogrammetry and Remote Sensing Congress, Amsterdam, Netherlands, vol. 33 (2000)
Google Scholar
Edelman, S., Buelthoff, H.: Orientation dependence in the recognition of familiar and novel views of three dimensional objects. Vision Research 32, 2385–2400 (1992)
Article Google Scholar
Kanwisher, N.: Domain Specificity in face perception. Nature Neuroscience 3, 759–776 (2000)
Article Google Scholar
Hyvarinen, A.: Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Trans. on Neural Networks 10, 626–634 (1999)
Article Google Scholar
Zhang, J.Y., Frangi, A.F., Yang, J.Y.: Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition. IEEE Tran. on Pattern Analysis and Machine Intelligence 26, 131–137 (2004)
Article Google Scholar
Kittler, J., Duin, P., Matas, J.: On Combining Classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 226–239 (1998)
Article Google Scholar
Sanjay, M., Das, S., Yegnanarayana, B.: Robust Template Matching for noisy bitmap images invariant to translation and rotation. In: Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, pp. 82–84 (1998)
Google Scholar
Nene, S.A., Nayar, S.K., Murase, H.: COIL 100 Database (1996), http://www1.cs.columbia.edu/CAVE/research/softlib/coil-100.html
Kalra, M., Das, S.: IITM Generic Object Image Library (2006), http://vplab.cs.iitm.ernet.in/downloads.html
Rother, C., Kolomogorov, V., Blake, A.: GrabCut- Interactive Foreground extraction using iterated Graph Cuts. ACM transactions on Graphics (SIGGRAPH), 309–314 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Visualization and Perception Laboratory, Department of Computer Science and Engineering, Indian Institute of Technology – Madras, Chennai, 600 036, India
Manisha Kalra, P. Deepti, R. Abhilash & Sukhendu Das

Authors

Manisha Kalra
View author publications
You can also search for this author in PubMed Google Scholar
P. Deepti
View author publications
You can also search for this author in PubMed Google Scholar
R. Abhilash
View author publications
You can also search for this author in PubMed Google Scholar
Sukhendu Das
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, IIT Delhi, New Delhi, India
Prem K. Kalra
School of Computer Science and Engineering, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel
Shmuel Peleg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kalra, M., Deepti, P., Abhilash, R., Das, S. (2006). Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_55

Download citation

DOI: https://doi.org/10.1007/11949619_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68301-8
Online ISBN: 978-3-540-68302-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics