Skip to main content
Log in

Attention-Based Detection of Unknown Objects in a Situated Vision Framework

  • Research Project
  • Published:
KI - Künstliche Intelligenz Aims and scope Submit manuscript

Abstract

We present an attention-based approach for the detection of unknown objects in a 3D environment. The ability to address individual objects in the environment without having previous knowledge about their properties or their identity is one important requirement of the Situated Vision theory. Based on saliency maps, our attention system determines the regions where objects are likely to be found; these are the proto-objects whose extent is refined by a 2D segmentation step. At the same time a 3D scene model is built from measurements of a depth camera. The detected objects are projected into the 3D scene, resulting in 3D object models which are incrementally updated. We show the validity of our approach in an RGB-D sequence recorded in an office environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. This work is part of DFG DACH project FR 2598/5-1 called Situated Vision to Perceive Object Shape and Affordances., in cooperation with TU Wien, RTWH Aachen and IDIAP.

  2. In [7], the TSDF function is raycasted, given a camera pose, to generate a depth map prediction. Using this method in our extended TSDF function means we can generate 2D IOR or object label maps for every new pose of the camera.

References

  1. Frintrop S, Rome E, Christensen HI (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1)

  2. Givens CR, Shortt RM (1984) A class of Wasserstein metrics for probability distributions. Mich Math J 31:231–240

    Article  MathSciNet  MATH  Google Scholar 

  3. Klein DA, Frintrop S (2012) Salient pattern detection using W2 on multivariate normal distributions. In: Proc of DAGM-OAGM. Springer, Berlin

    Google Scholar 

  4. Kootstra G, Kragic D (2011) Fast and bottom-up object detection, segmentation, and evaluation using Gestalt principles. In: IEEE int’l conf on robotics and automation

    Google Scholar 

  5. Martín-García G, Frintrop S (2013) A computational framework for attentional 3d object detection. In: Proceedings of the annual meeting of the cognitive science society

    Google Scholar 

  6. Meger D, Muja M, Helmer S, Gupta A, Gamroth C, Hoffman T, Baumann M, Southey T, Fazli P, Wohlkinger W, Viswanathan P, Little JJ, Lowe DG, Orwell J (2010) Curious George: an integrated visual search platform. In: Canadian conference on computer and robot vision

    Google Scholar 

  7. Newcombe RA, Izadi S, Hilliges O, Molyneaux D, Kim D, Davison AJ, Kohli P, Shotton J, Hodges S, Fitzgibbon A (2011) KinectFusion: real-time dense surface mapping and tracking. In: Proc of IEEE int’l symposium on mixed and augmented reality (ISMAR ’11)

    Google Scholar 

  8. Pylyshyn ZW (2001) Visual indexes, preconceptual objects, and situated vision. Cognition 80(1–2):127–158

    Article  Google Scholar 

  9. Rensink RA (2000) The dynamic representation of scenes. Vis Cogn 7:17–42

    Article  Google Scholar 

  10. Rensink RA (2000) Seeing, sensing and scrutinizing. Vis Res 40:1469–1487

    Article  Google Scholar 

  11. Rother C, Kolmogorov V, Blake A (2004) GrabCut: interactive foreground extraction using iterated graph cuts. ACM Trans Graph 23:309–314

    Article  Google Scholar 

  12. Rusu RB, Cousins S (2011) 3D is here: point cloud library (PCL). In: IEEE international conference on robotics and automation (ICRA)

    Google Scholar 

  13. Schlemmer M (2009) Getting past passive vision—on the use of an ontology for situated perception in robots. PhD thesis, Faculty of Electrical Engineering and Information Technology, Vienna University of Technology

  14. Tipper SP, Weaver B, Jerreat LM, Burak AL (1994) Object-based and environment-based inhibition of return of visual attention. J Exp Psychol 20(3):478

    Google Scholar 

  15. Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19(9):1395–1407

    Article  MATH  Google Scholar 

  16. Wolfe JM, Horowitz TS (2004) What attributes guide the deployment of visual attention and how do they do it? Nat Rev Neurosci 5:1–7

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Germán Martín García.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martín García, G., Frintrop, S. & Cremers, A.B. Attention-Based Detection of Unknown Objects in a Situated Vision Framework. Künstl Intell 27, 267–272 (2013). https://doi.org/10.1007/s13218-013-0256-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13218-013-0256-1

Keywords

Navigation