Abstract
In biological vision systems, attention mechanisms are responsible for selecting the relevant information from the sensed field of view, so that the complete scene can be analyzed using a sequence of rapid eye saccades. In recent years, efforts have been made to imitate such attention behavior in artificial vision systems, because it allows optimizing the computational resources as they can be focused on the processing of a set of selected regions. In the framework of mobile robotics navigation, this work proposes an artificial model where attention is deployed at the level of objects (visual landmarks) and where new processes for estimating bottom-up and top-down (target-based) saliency maps are employed. Bottom-up attention is implemented through a hierarchical process, whose final result is the perceptual grouping of the image content. The hierarchical grouping is applied using a Combinatorial Pyramid that represents each level of the hierarchy by a combinatorial map. The process takes into account both image regions (faces in the map) and edges (arcs in the map). Top-down attention searches for previously detected landmarks, enabling their re-detection when the robot presumes that it is revisiting a known location. Landmarks are described by a combinatorial submap; thus, this search is conducted through an error-tolerant submap isomorphism procedure.
References
Antúnez E, Marfil R, Bandera A (2011) Region correspondence using combinatorial pyramids. In: Bandera A, Dias J, Escolano F (eds) Recognition and action for scene understanding. pp 13--24
Antúnez E, Marfil R, Bandera A (2012) Combining boundary and region features inside the Combinatorial Pyramid for topology-preserving perceptual image segmentation. Pattern Recognit Lett 33(16):2245–2253
Bruce N, Tsotsos J (2006) Saliency based on information maximization. In: Weiss Y, Schölkopf B, Platt J (eds) Adv Neural Inf Process Syst 18:155–162
Brun L, Kropatsch W (2001) Introduction to Combinatorial Pyramids. Lect Notes Comput Sci 2243:108–128
Foo P, Warren WH, Duchon A, Tarr MJ (2005) Do humans integrate routes into a cognitive map? Map- versus landmark-based navigation of novel shortcuts. J Exp Psychol Learn Mem Cogn 31(2):195–215
Frintrop S, Rome E, Christensen H (2010) Computational visual attention systems and their cognitive foundations: a survey. ACM Trans Appl Percept 7(1):6:1–6:39
Gao D, Mahadevan V, Vasconcelos N (2008) On the plausibility of the discriminant center-surround hypothesis for visual saliency. J Vision 8(7):1–18
Hirtle, SC (2008) Landmarks for navigation in human and robots. In: M.E. Jefferies et al. (Eds.), Robot. & Cogn. Approach. to Spatial Mapping, STAR 38, 203–214
Koch C, Ullman S (1985) Shifts in selective visual attention: towards the underlying neural circuitry. Hum Neurobiol 4(4):219–227
Marfil R, Bandera A, Rodríguez JA, Sandoval F (2009) A novel hierarchical framework for object-based visual attention. Lect Notes in Artif Intell 5395:27–40
Navalpakkam V, Itti L (2005) Modeling the influence of task on attention. Vision Res 45:205–231
Pylyshyn Z (2001) Visual indexes, preconceptual objects, and situated vision. Cognition 80:127–158
Rensink R (2000) The dynamic representation of scenes. Visual Cogn 7:17–42
Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19:1395–1407
Wang T, Liu Y (2008) Implements of some new algorithm for combinatorial maps. OR Trans 12:58–66
Acknowledgments
This work has been partially granted by the Spanish Government and FEDER funds project no. TIN2011-27512-C05-01. This article is the result of the work of the group of the Integrated Action AT2009-0026, constituted by Spanish and Austrian researchers.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Antúnez, E., Palomino, A.J., Marfil, R. et al. Perceptual organization and artificial attention for visual landmarks detection. Cogn Process 14, 13–18 (2013). https://doi.org/10.1007/s10339-012-0536-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10339-012-0536-y