Skip to main content

Viewpoint Selection – Planning Optimal Sequences of Views for Object Recognition

  • Conference paper
Computer Analysis of Images and Patterns (CAIP 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2756))

Included in the following conference series:

Abstract

In the past decades most object recognition systems were based on passive approaches. But in the last few years a lot of research was done in the field of active object recognition. In this context there are several unique problems to be solved, like the fusion of several views and the selection of the best next viewpoint.

In this paper we present an approach to solve the problem of choosing optimal views (viewpoint selection) and the fusion of these for an optimal 3D object recognition (viewpoint fusion). We formally define the selection of additional views as an optimization problem and we show how to use reinforcement learning for viewpoint training and selection in continuous state spaces without user interaction. We also present an approach for the fusion of multiple views based on recursive density propagation.

The experimental results show that our viewpoint selection is able to select a minimal number of views and perform an optimal object recognition with respect to the classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bertsekas, D.P., Tstsiklis, J.N.: Neuro–Dynamic Programming. Athena Scientific, Belmont (1996)

    MATH  Google Scholar 

  2. Borotschnig, H., Paletta, L., Prantl, M., Pinz, A.: A Comparison of Probabilistic, Possibilistic and Evidence Theoretic Fusion Schemes for Active Object Recognition. Computing 62, 293–319 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  3. Deinzer, F., Denzler, J., Niemann, H.: On Fusion of Multiple Views for Active Object Recognition. In: Radig, B., Florczyk, S. (eds.) DAGM 2001. LNCS, vol. 2191, pp. 239–245. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  4. Deinzer, F., Denzler, J., Niemann, H.: Improving Object Recognition By Fusion Of Multiple Views. In: 3rd Indian Conference on Computer Vision Graphics and Image Processing, Ahmedabad, Indien, pp. 161–166. Allied Publishers Pvt. Ltd (2002)

    Google Scholar 

  5. Denzler, J., Brown, C.M.: Information theoretic sensor data selection for active object recognition and state estimation. PAMI 24(2) (2002)

    Google Scholar 

  6. Grässl, C., Deinzer, F., Niemann, H.: Continuous Parametrization of Normal Distributions for Improving the Discrete Statistical Eigenspace Approach for Object Recognition. In: PRIP 2003 (May 2003) (submitted)

    Google Scholar 

  7. Isard, M., Andrew, B.: CONDENSATION – Conditional Density Propagation for Visual Tracking. In: IJCV 1998, vol. 29(1), pp. 5–28 (1998)

    Google Scholar 

  8. Kalman, R.E.: A New Approach to Linear Filtering and Prediction Problems. Journal of Basic Engineering, 35–44 (1960)

    Google Scholar 

  9. Krebs, B., Burkhardt, M., Korn, B.: Handling Uncertainty in 3D Object Recognition using Bayesian Networks. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, pp. 782–795. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  10. Madsen, C.B., Christensen, H.I.: A Viewpoint Planning Strategy for Determining True Angles on Polyhedral Objects by Camera Alignment. PAMI 19(2) (1997)

    Google Scholar 

  11. Lehel, P., Hemayed, E.E., Farag, A.A.: Sensor Planning for a Trinocular Active Vision System. In: CVPR, vol. II, pp. 306–312 (1999)

    Google Scholar 

  12. Roy, S.D., Chaudhury, S., Banerjee, S.: Recognizing Large 3-D Objects through Next View Planning using an Uncalibrated Camera. In: ICCV 2001, Vancouver, Canada, vol. II, pp. 276–281. IEEE Computer Press, Los Alamitos (2001)

    Google Scholar 

  13. Schiele, B., Crowley, J.L.: Transinformation for Active Object Recognition. In: ICCV 1998, Bombay, India, pp. 249–254 (1998)

    Google Scholar 

  14. Sutton, R.S., Barto, A.G.: Reinforcement Learning. A Bradford Book. Cambridge, London (1998)

    Google Scholar 

  15. Viola, P., Wells III, W.M.: Alignment by Maximization of Mutual Information. International Journal of Computer Vision 24(2), 137–154 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Deinzer, F., Denzler, J., Niemann, H. (2003). Viewpoint Selection – Planning Optimal Sequences of Views for Object Recognition. In: Petkov, N., Westenberg, M.A. (eds) Computer Analysis of Images and Patterns. CAIP 2003. Lecture Notes in Computer Science, vol 2756. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45179-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45179-2_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40730-0

  • Online ISBN: 978-3-540-45179-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics