Skip to main content
Log in

Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data

  • Research Project
  • Published:
KI - Künstliche Intelligenz Aims and scope Submit manuscript

Abstract

In this paper, we summarize our project work of the last two years, where we addressed the tasks of visually exploring a scene with visual attention mechanisms based on saliency computation, and of locating unknown objects in the environment. The latter is also called object discovery and consists in finding candidate objects without previous knowledge about the objects themselves or the scene. We follow an approach motivated from human perception and combine saliency and segmentation to generate object candidates. We show results on 2D images as well as on 3D sequences obtained from an RGB-D camera.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. Code will soon be available on our webpages.

References

  1. Achanta R, Hemami S, Estrada F, Süsstrunk S (2009) Frequency-tuned salient region detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  2. Achanta R, Süsstrunk S (2010) Saliency detection using maximum symmetric surround. In: Proceedings of the International Conference on Image Processing (ICIP)

  3. Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  4. Björkman M, Eklundh JO (2007) Vision in the real world: finding, attending and recognizing objects. Int J Imaging Syst Technol 16(2):189–208

    Google Scholar 

  5. Borji A, Itti L (2010) State-of-the-art in visual attention modeling. IEEE Trans Pattern Anal Mach Intell 35:185–207

    Article  Google Scholar 

  6. Borji A, Itti L (2012) Exploiting local and global patch rarities for saliency detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  7. Bruce NDB, Tsotsos JK (2009) Saliency, attention, and visual search: an information theoretic approach. J Vision 9(3):5

    Article  Google Scholar 

  8. Frintrop S (2006) VOCUS: a visual attention system for object detection and goal-directed search. Lecture notes in artificial intelligence (LNAI). Springer, Berlin/Heidelberg

    Book  Google Scholar 

  9. Frintrop S (2014) Cognitive approaches for mobile vision systems. Habilitation thesis at the University of Bonn, Germany

  10. Frintrop S, García GM, Cremers AB (2014) A cognitive approach for object discovery. In: Proceedings of the International Conference on Pattern Recognition (ICPR). Stockholm, Sweden

  11. Frintrop S, Rome E, Christensen HI (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1):6

    Google Scholar 

  12. Gao D, Han S, Vasconcelos N (2009) Discriminant saliency, the detection of suspicious coincidences, and applications to visual recognition. IEEE Trans Pattern Anal Mach Intell 31(6):985–1005

    Google Scholar 

  13. Gao D, Vasconcelos N (2007) Bottom-up saliency is a discriminant process. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV)

  14. García GM, Frintrop S (2013) A computational framework for attentional 3D object detection. In: Proceedings of the Annual Conference of the Cognitive Science Society. Berlin, Germany

  15. Hou X, Harel J, Koch C (2012) Image signature: highlighting sparse salient regions. IEEE Trans Pattern Anal Mach Intell 34:194–201

    Article  Google Scholar 

  16. Hou X, Zhang L (2008) Dynamic visual attention: searching for coding length increments. In: Advances in Neural Information Processing Systems

  17. Itti L, Baldi P (2009) Bayesian surprise attracts human attention. Vision Res 49(10):1295–1306

    Article  Google Scholar 

  18. Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 2011:1254–1259

    Article  Google Scholar 

  19. Klein DA, Frintrop S (2012) Salient pattern detection using \(W_2\) on multivariate normal distributions. In: Proceedings of the Joint Pattern Recognition Symposium of the German Association for Pattern Recognition (DAGM) and the Austrian Association for Pattern Recognition (OAGM) (DAGM-OAGM). Graz

  20. Liu T, Yuan Z, Sun J, Wang J, Zheng N, Tang X, Shum HY (2009) Learning to detect a salient object. IEEE Trans Pattern Anal Mach Intell 33:353–367

    Google Scholar 

  21. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110

    Article  Google Scholar 

  22. Maki A, Nordlund P, Eklundh JO (2000) Attentional scene segmentation: integrating depth and motion. Comput Vision Image Underst 78(3):351–373

    Article  Google Scholar 

  23. Newcombe RA, Izadi S, Hilliges O, Molyneaux D, Kim D, Davison AJ, Kohli P, Shotton J, Hodges S, Fitzgibbon A (2011) KinectFusion: real-time dense surface mapping and tracking. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality (ISMAR)

  24. Pashler H (1997) The psychology of attention. MIT Press, Cambridge

    Google Scholar 

  25. Perazzi F, Krahenbuhl P, Pritch Y, Hornung A (2012) Saliency filters: contrast based filtering for salient region detection. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition

  26. Posner M, Cohen Y (1984) Components of visual orienting. In: Bouma H, Bouwhuis D (eds) Attention and performance X. Erlbaum, London, pp 531–556

    Google Scholar 

  27. Rensink RA (2000) The dynamic representation of scenes. Visual Cogn 7:17–42

    Article  Google Scholar 

  28. Rodieck RW (1965) Quantitative analysis of cat retinal ganglion cell response to visual stimuli. Vision Res 5:583–601

    Article  Google Scholar 

  29. Schauerte B, Stiefelhagen R (2012) Quaternion-based spectral saliency detection for eye fixation prediction. In: Proceedings of the European Conference on Computer Vision (ECCV)

  30. Ungerleider L, Mishkin M (1982) Two cortical visual systems. In: Ingle D, Goodale M, Mansfield R (eds) Analysis of visual behavior, pp. 549–586. MIT Press

  31. Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19:1395–1407

    Article  MATH  Google Scholar 

  32. Sun X, Yao H, Ji R (2012) What are we looking for: towards statistical modeling of saccadic eye movements and visual saliency. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  33. Yan Q, Xu L, Shi J, Jia J (2013) Hierarchical saliency detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  34. Yang C, Zhang L, Lu H, Ruan X, Yang MH (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  35. Zhang L, Tong MH, Marks TK, Shan H, Cottrell GW (2008) Sun: a bayesian framework for saliency using natural statistics. J Vision 8(7):32

    Article  Google Scholar 

  36. Zhu L, Klein DA, Frintrop S, Cao Z, Cremers AB (2013) Multi-scale region-based saliency detection using \(W_2\) distance on n-dimensional normal distributions. In: Proceedings of the International Conference on Image Processing (ICIP)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Germán Martín García.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Martín García, G., Werner, T. & Frintrop, S. Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data. Künstl Intell 29, 75–81 (2015). https://doi.org/10.1007/s13218-014-0337-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13218-014-0337-9

Keywords

Navigation