Skip to main content

Global Context Extraction for Object Recognition Using a Combination of Range and Visual Features

  • Conference paper
Dynamic 3D Imaging (Dyn3D 2009)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5742))

Included in the following conference series:

Abstract

It has been highlighted by many researchers, that the use of context information as an additional cue for high-level object recognition is important to close the gap between human and computer vision. We present an approach to context extraction in the form of global features for place recognition. Based on an uncalibrated combination of range data of a time-of-flight (ToF) camera and images obtained from a visual sensor, our system is able to classify the environment in predefined places (e.g. kitchen, corridor, office) by representing the sensor data with various global features. Besides state-of-the-art feature types, such as power spectrum models and Gabor filters, we introduce histograms of surface normals as a new representation of range images. An evaluation with different classifiers shows the potential of range data from a ToF camera as an additional cue for this task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lange, R.: 3D Time-of-Flight Distance Measurement with Custom Solid-State Image Sensors in CMOS/CCD-Technology. PhD thesis, University of Siegen (2000)

    Google Scholar 

  2. Prusak, A., Melnychuk, O., Roth, H., Schiller, I., Koch, R.: Pose estimation and map building with a time-of-flight camera for robot navigation. Int. J. Intell. Syst. Technol. Appl. 5, 355–364 (2008)

    Google Scholar 

  3. Hegazy, D., Denzler, J.: Generic 3d object recognition from time-of-flight images using boosted combined shape features. In: Proc. of VISAPP, pp. 321–326 (2009)

    Google Scholar 

  4. Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: Proc. of ICCV, pp. 273–280 (2003)

    Google Scholar 

  5. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. of CVPR, pp. 2169–2178 (2006)

    Google Scholar 

  6. Murray, D.R.: Patchlets: a method of interpreting correlation stereo three-dimensional data. PhD thesis, The University of British Columbia, Canada (2004)

    Google Scholar 

  7. Hetzel, G., Leibe, B., Levi, P., Schiele, B.: 3D object recognition from range images using local feature histograms. In: Proc. of CVPR, vol. 2, pp. 394–399 (2001)

    Google Scholar 

  8. Mezrich, J., Carlson, C., Cohen, R.: Image descriptors for displays. Technical Report PRRL-77-CR-7, Office of Naval Research (1977)

    Google Scholar 

  9. Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV 42, 145–175 (2001)

    Article  MATH  Google Scholar 

  10. Redies, C., Hasenstein, J., Denzler, J.: Fractal-like image statistics in visual art: similarity to natural scenes. Spatial Vision 21, 137–148 (2007)

    Article  Google Scholar 

  11. Torralba, A.: Contextual priming for object detection. International Journal of Computer Vision 53, 169–191 (2003)

    Article  Google Scholar 

  12. Wainwright, M.J., Schwartz, O., Simoncelli, E.P.: Natural image statistics and divisive normalization: Modeling nonlinearity and adaptation in cortical neurons. In: Rao, R., Olshausen, B., Lewicki, M. (eds.) Probabilistic Models of the Brain: Perception and Neural Function, pp. 203–222. MIT Press, Cambridge (2002)

    Google Scholar 

  13. Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics), 1st edn., Springer, Heidelberg (2007)

    Google Scholar 

  14. Schimek, M.G.: Smoothing and Regression: Approaches, Computation, and Application. Series in Probability and Statistics. Wiley, Chichester (1996)

    MATH  Google Scholar 

  15. Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)

    Article  MATH  Google Scholar 

  16. Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Maching Learning 63, 3–42 (2006)

    Article  MATH  Google Scholar 

  17. Platt, J.: Probabilistic outputs for support vector machines and comparison to regularize likelihood methods. In: Smola, A., Bartlett, P., Schoelkopf, B., Schuurmans, D. (eds.) Advances in Large Margin Classifiers, pp. 61–74 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kemmler, M., Rodner, E., Denzler, J. (2009). Global Context Extraction for Object Recognition Using a Combination of Range and Visual Features. In: Kolb, A., Koch, R. (eds) Dynamic 3D Imaging. Dyn3D 2009. Lecture Notes in Computer Science, vol 5742. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03778-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03778-8_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03777-1

  • Online ISBN: 978-3-642-03778-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics