Skip to main content

Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace

  • Conference paper
Computer Vision, Graphics and Image Processing

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4338))

  • 1826 Accesses

Abstract

This paper addresses the problem of pose invariant Generic Object Recognition by modeling the perceptual capability of human beings. We propose a novel framework using a combination of appearance and shape cues to recognize the object class and viewpoint (axis of rotation) as well as determine its pose (angle of view). The appearance model of the object from multiple viewpoints is captured using Linear Subspace Analysis techniques and is used to reduce the search space to a few rank-ordered candidates. We have used a decision-fusion based combination of 2D PCA and ICA to integrate the complementary information of classifiers and improve recognition accuracy. For matching based on shape features, we propose the use of distance transform based correlation. A decision fusion using ‘Sum Rule’ of 2D PCA and ICA subspace classifiers, and distance transform based correlation is then used to verify the correct object class and determine its viewpoint and pose. Experiments were conducted on COIL-100 and IGOIL (IITM Generic Object Image Library) databases which contain objects with complex appearance and shape characteristics. IGOIL database was captured to analyze the appearance manifolds along two orthogonal axes of rotation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Murase, H., Nayar, S.: Visual Learning and recognition of 3-D Objects from Appearance. International Journal of Computer Vision 14, 5–24 (1995)

    Article  Google Scholar 

  2. Nagabhushan, P., Guru, D., Shekar, B.: Visual Learning and recognition of 3-D Objects from Appearance using two-dimensional principal components analysis: A robust and an efficient approach. Pattern Recognition 39, 721–725 (2006)

    Article  MATH  Google Scholar 

  3. Rosin, P.L., Marshall, D.: Object recognition using local affine frames on distinguished regions. In: Proceedings of the British Machine Vision Conference, London, UK, pp. 113–122 (2002)

    Google Scholar 

  4. Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints. International Journal of Computer Vision 66 (2006)

    Google Scholar 

  5. Biederman, I.: Recognition by Components: A theory of Human Image Understanding. Psychological Review 94, 115–147 (1987)

    Article  Google Scholar 

  6. Leonardis, A., Bischof, H.: Robust recognition using eigenimages. Computer Vision and Image Understanding 78, 99–118 (2000)

    Article  Google Scholar 

  7. Pontil, M., Verri, A.: Support Vector Machines for 3D Object Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 637–646 (1998)

    Article  Google Scholar 

  8. Zhang, H., Berg, A., Mair, M., Malik, J.: SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. Computer Vision and Pattern Recognition 2, 2126–2136 (2006)

    Google Scholar 

  9. Belongie, P.J., Malik, J.: Shape Matching and Object Recognition using Shape Contexts. IEEE transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)

    Article  Google Scholar 

  10. Bohm, J., Bernner, C., Guhring, J., Fritsch, D.: Automated Extraction of features from CAD models for 3-D object recognition. In: International Society for Photogrammetry and Remote Sensing Congress, Amsterdam, Netherlands, vol. 33 (2000)

    Google Scholar 

  11. Edelman, S., Buelthoff, H.: Orientation dependence in the recognition of familiar and novel views of three dimensional objects. Vision Research 32, 2385–2400 (1992)

    Article  Google Scholar 

  12. Kanwisher, N.: Domain Specificity in face perception. Nature Neuroscience 3, 759–776 (2000)

    Article  Google Scholar 

  13. Hyvarinen, A.: Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Trans. on Neural Networks 10, 626–634 (1999)

    Article  Google Scholar 

  14. Zhang, J.Y., Frangi, A.F., Yang, J.Y.: Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition. IEEE Tran. on Pattern Analysis and Machine Intelligence 26, 131–137 (2004)

    Article  Google Scholar 

  15. Kittler, J., Duin, P., Matas, J.: On Combining Classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 226–239 (1998)

    Article  Google Scholar 

  16. Sanjay, M., Das, S., Yegnanarayana, B.: Robust Template Matching for noisy bitmap images invariant to translation and rotation. In: Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, pp. 82–84 (1998)

    Google Scholar 

  17. Nene, S.A., Nayar, S.K., Murase, H.: COIL 100 Database (1996), http://www1.cs.columbia.edu/CAVE/research/softlib/coil-100.html

  18. Kalra, M., Das, S.: IITM Generic Object Image Library (2006), http://vplab.cs.iitm.ernet.in/downloads.html

  19. Rother, C., Kolomogorov, V., Blake, A.: GrabCut- Interactive Foreground extraction using iterated Graph Cuts. ACM transactions on Graphics (SIGGRAPH), 309–314 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kalra, M., Deepti, P., Abhilash, R., Das, S. (2006). Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_55

Download citation

  • DOI: https://doi.org/10.1007/11949619_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68301-8

  • Online ISBN: 978-3-540-68302-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics