Abstract
Research on the brain information processing has focused on the interrelationships among cognitive processes. Thus, it is currently well-established that the units of attention on human vision are not merely spatial but closely related to perceptual objects. This implies a strong relationship between segmentation and attention processes. This interaction is bi-directional: if the segmentation process constraints attention, the way an image is segmented may depend on the specific question asked to an observer, i.e. what she ‘attend’ in this sense. When the focus of attention is deployed from one visual unit to another, the rest of the scene is perceived but at a lower resolution that the focused object. The result is a multi-resolution visual perception in which the fovea, a dimple on the central retina, provides the highest resolution vision. While much work has recently been focused on computational models for object-based attention, the design and development of multi-resolution structures that can segment the input image according to the focused perceptual unit is largely unexplored. This paper proposes a novel structure for multi-resolution image segmentation that extends the encoding provided by the Bounded Irregular Pyramid. Bottom-up attention is enclosed in the same structure, allowing to set the fovea over the most salient image region. Preliminary results obtained from the segmentation of natural images show that the performance of the approach is good in terms of speed and accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Antonisse, H.J.: Image segmentation in pyramids. Comput. Graph. Image Process. 19, 367–383 (1982)
Antúnez, E., Marfil, R., Bandera, A.: Combining boundary and region features inside the combinatorial pyramid for topology-preserving perceptual image segmentation. Pattern Recogn. Lett. 33(16), 2245–2253 (2012)
Antúnez, E., Palomino, A., Marfil, R., Bandera, J.P.: Perceptual organization and artificial attention for visual landmarks detection. Cogn. Process. 14(1), 13–18 (2013)
Arbeláez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33, 898–916 (2011)
Arrebola, F., Camacho, P., Sandoval, F.: Generalization of shifted fovea multiresolution geometries applied to object detection. In: Del Bimbo, A. (ed.) ICIAP 1997. LNCS, vol. 1311, pp. 477–484. Springer, Heidelberg (1997)
Aziz, M.Z., Mertsching, B.: Color saliency and inhibition using static and dynamic scenes in region based visual attention. In: Paletta, L., Rome, E. (eds.) WAPCV 2007. LNCS (LNAI), vol. 4840, pp. 234–250. Springer, Heidelberg (2007)
Bister, M., Cornelis, J., Rosenfeld, A.: A critical view of pyramid segmentation algorithms. Pattern Recogn. Lett. 11, 605–617 (1990)
Driver, J., Davis, G., Russell, C., Turatto, M., Freeman, E.: Segmentation, attention and phenomenal visual objects. Cogn. 80, 61–95 (2001)
Frintrop, S., Rome, E., Christensen, H.: Computational visual attention systems and their cognitive foundations: a survey. ACM Trans. Appl. Percept. 7(1), 1–39 (2010)
Marfil, R., Molina-Tanco, L., Bandera, A., Rodriguez, J.A., Sandoval, F.: Pyramid segmentation algorithms revisited. Pattern Recogn. 39, 1430–1451 (2006)
Marfil, R., Molina-Tanco, L., Bandera, A., Sandoval, F.: The construction of bounded irregular pyramids with a union-find decimation process. In: Escolano, F., Vento, M. (eds.) GbRPR. LNCS, vol. 4538, pp. 307–318. Springer, Heidelberg (2007)
Marfil, R., Bandera, A., Rodríguez, J.A., Sandoval, F.: A novel hierarchical framework for object-based visual attention. In: Paletta, L., Tsotsos, J.K. (eds.) WAPCV 2008. LNCS (LNAI), vol. 5395, pp. 27–40. Springer, Heidelberg (2009)
Mishra, A., Aloimonos, Y., Cheong, L., Kassim, A.: Active visual segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 639–653 (2012)
Palomino, A., Marfil, R., Bandera, J.P., Bandera, A.: A novel biologically inspired attention mechanism for a social robot. EURASIP J. Adv. Sig. Proc. (2011)
Traver, V.J., Bernardino, A.: A review of log-polar imaging for visual perception in robotics. Robot. Auton. Syst. 58, 378–398 (2010)
Acknowledgments
This work has been partially granted by the Spanish Government and FEDER funds project no. TIN2012-38079-C03-03. This article is the result of the work of the group of the Integrated Action AT2009-0026, constituted by Spanish and Austrian researchers.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Marfil, R., Antúnez, E., Arrebola, F., Bandera, A. (2014). Merging Attention and Segmentation: Active Foveal Image Representation. In: Grandinetti, L., Lippert, T., Petkov, N. (eds) Brain-Inspired Computing. BrainComp 2013. Lecture Notes in Computer Science(), vol 8603. Springer, Cham. https://doi.org/10.1007/978-3-319-12084-3_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-12084-3_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12083-6
Online ISBN: 978-3-319-12084-3
eBook Packages: Computer ScienceComputer Science (R0)