Abstract
In this paper, we summarize our project work of the last two years, where we addressed the tasks of visually exploring a scene with visual attention mechanisms based on saliency computation, and of locating unknown objects in the environment. The latter is also called object discovery and consists in finding candidate objects without previous knowledge about the objects themselves or the scene. We follow an approach motivated from human perception and combine saliency and segmentation to generate object candidates. We show results on 2D images as well as on 3D sequences obtained from an RGB-D camera.
Similar content being viewed by others
Notes
Code will soon be available on our webpages.
References
Achanta R, Hemami S, Estrada F, Süsstrunk S (2009) Frequency-tuned salient region detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Achanta R, Süsstrunk S (2010) Saliency detection using maximum symmetric surround. In: Proceedings of the International Conference on Image Processing (ICIP)
Alexe B, Deselaers T, Ferrari V (2010) What is an object? In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Björkman M, Eklundh JO (2007) Vision in the real world: finding, attending and recognizing objects. Int J Imaging Syst Technol 16(2):189–208
Borji A, Itti L (2010) State-of-the-art in visual attention modeling. IEEE Trans Pattern Anal Mach Intell 35:185–207
Borji A, Itti L (2012) Exploiting local and global patch rarities for saliency detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Bruce NDB, Tsotsos JK (2009) Saliency, attention, and visual search: an information theoretic approach. J Vision 9(3):5
Frintrop S (2006) VOCUS: a visual attention system for object detection and goal-directed search. Lecture notes in artificial intelligence (LNAI). Springer, Berlin/Heidelberg
Frintrop S (2014) Cognitive approaches for mobile vision systems. Habilitation thesis at the University of Bonn, Germany
Frintrop S, García GM, Cremers AB (2014) A cognitive approach for object discovery. In: Proceedings of the International Conference on Pattern Recognition (ICPR). Stockholm, Sweden
Frintrop S, Rome E, Christensen HI (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1):6
Gao D, Han S, Vasconcelos N (2009) Discriminant saliency, the detection of suspicious coincidences, and applications to visual recognition. IEEE Trans Pattern Anal Mach Intell 31(6):985–1005
Gao D, Vasconcelos N (2007) Bottom-up saliency is a discriminant process. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV)
García GM, Frintrop S (2013) A computational framework for attentional 3D object detection. In: Proceedings of the Annual Conference of the Cognitive Science Society. Berlin, Germany
Hou X, Harel J, Koch C (2012) Image signature: highlighting sparse salient regions. IEEE Trans Pattern Anal Mach Intell 34:194–201
Hou X, Zhang L (2008) Dynamic visual attention: searching for coding length increments. In: Advances in Neural Information Processing Systems
Itti L, Baldi P (2009) Bayesian surprise attracts human attention. Vision Res 49(10):1295–1306
Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 2011:1254–1259
Klein DA, Frintrop S (2012) Salient pattern detection using \(W_2\) on multivariate normal distributions. In: Proceedings of the Joint Pattern Recognition Symposium of the German Association for Pattern Recognition (DAGM) and the Austrian Association for Pattern Recognition (OAGM) (DAGM-OAGM). Graz
Liu T, Yuan Z, Sun J, Wang J, Zheng N, Tang X, Shum HY (2009) Learning to detect a salient object. IEEE Trans Pattern Anal Mach Intell 33:353–367
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Maki A, Nordlund P, Eklundh JO (2000) Attentional scene segmentation: integrating depth and motion. Comput Vision Image Underst 78(3):351–373
Newcombe RA, Izadi S, Hilliges O, Molyneaux D, Kim D, Davison AJ, Kohli P, Shotton J, Hodges S, Fitzgibbon A (2011) KinectFusion: real-time dense surface mapping and tracking. In: Proceedings of the IEEE International Symposium on Mixed and Augmented Reality (ISMAR)
Pashler H (1997) The psychology of attention. MIT Press, Cambridge
Perazzi F, Krahenbuhl P, Pritch Y, Hornung A (2012) Saliency filters: contrast based filtering for salient region detection. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition
Posner M, Cohen Y (1984) Components of visual orienting. In: Bouma H, Bouwhuis D (eds) Attention and performance X. Erlbaum, London, pp 531–556
Rensink RA (2000) The dynamic representation of scenes. Visual Cogn 7:17–42
Rodieck RW (1965) Quantitative analysis of cat retinal ganglion cell response to visual stimuli. Vision Res 5:583–601
Schauerte B, Stiefelhagen R (2012) Quaternion-based spectral saliency detection for eye fixation prediction. In: Proceedings of the European Conference on Computer Vision (ECCV)
Ungerleider L, Mishkin M (1982) Two cortical visual systems. In: Ingle D, Goodale M, Mansfield R (eds) Analysis of visual behavior, pp. 549–586. MIT Press
Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19:1395–1407
Sun X, Yao H, Ji R (2012) What are we looking for: towards statistical modeling of saccadic eye movements and visual saliency. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Yan Q, Xu L, Shi J, Jia J (2013) Hierarchical saliency detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Yang C, Zhang L, Lu H, Ruan X, Yang MH (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Zhang L, Tong MH, Marks TK, Shan H, Cottrell GW (2008) Sun: a bayesian framework for saliency using natural statistics. J Vision 8(7):32
Zhu L, Klein DA, Frintrop S, Cao Z, Cremers AB (2013) Multi-scale region-based saliency detection using \(W_2\) distance on n-dimensional normal distributions. In: Proceedings of the International Conference on Image Processing (ICIP)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Martín García, G., Werner, T. & Frintrop, S. Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data. Künstl Intell 29, 75–81 (2015). https://doi.org/10.1007/s13218-014-0337-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13218-014-0337-9