Abstract
There has been a growing need to build an object recognition system that can successfully characterize object constancy, irrespective of lighting, shading, occlusions, viewpoint variations and most importantly, deal with the multitude of shapes, colors and sizes in which objects are found. Affordances on the other hand, provide symbolic grounding mechanisms that enable linking features obtained from visual perception with the functionality of the objects, which provides the most consistent and holistic characterization of an object. Recognition by Component Affordances (RBCA) is a recent theory that builds affordance features for recognition. As an extension of the psychophysical theory of Recognition by Components (RBC) to generic visual perception, RBCA is well suited for cognitive visual processing systems which are required to perform implicit cognitive tasks. A common task is to substitute a cup for a mug, bottle, jug, pitcher, pilsner, beaker, chalice, goblet or any other unlabeled object, but with a physical part affording the ability to hold liquid and a part affording grasping by a human hand, given the goal of ’finding an empty cup’ and no cups are available in the work environment of interest. In this paper, we present affordance features for recognition of objects. Using a set of 25 structural and 10 material affordances we define a database of over 250 common household objects. This database called the Affordance Network or AfNet is available as community development framework and is well suited for deployment on domestic robots. Sample object recognition results using AfNet and the associated inference engine that grounds the affordances through visual perception features demonstrate the effectiveness of the approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Grabner, H., Gall, J., van Gool, L.: What Makes a Chair a Chair? In: CVPR, pp. 1529–1536 (2011)
Varadarajan, K.M., Vincze, M.: Holistic Visual Cognitive Recognizer using Part based Local, Global, Semantic and Affordance Features. In: CVPR W (2011)
Varadarajan, K.M., Vincze, M.: Affordance based Part Recognition for Grasping and Manipulation. In: ICRA W (2011)
Varadarajan, K.M., Vincze, M.: Object Part Segmentation and Classification in Range Images for Grasping. In: ICAR (2011)
Varadarajan, K.M., Vincze, M.: Knowledge Representation and Inference for Grasp Affordances. In: Crowley, J.L., Draper, B.A., Thonnat, M. (eds.) ICVS 2011. LNCS, vol. 6962, pp. 173–182. Springer, Heidelberg (2011)
Varadarajan, K.M.: Karmic Tabula Rasa k-TR - A Theory of Visual Perception. In: ISP (2011)
Gibson, J.J.: The Theory of Affordances. In: Shaw, R., Bransford, J. (eds.) (1977) ISBN 0-470-99014-7
Biederman I.: Recognition - by - components: a theory of human image understanding. Psych. Rev. (1994)
MacDorman, K.F.: Responding to affordances: Learning and projecting a sensorimotor mapping. In: ICRA (2000)
Fitzpatrick, P., et. al: Learning about objects through action. In: ICRA (2003)
Stoytchev, A.: Toward learning the binding affordances of objects. In: AAAI Symposium on Dev. Robotics (2005)
Sahin, E., et al.: To afford or not to afford. Adaptive Behavior 15(4), 447–472 (2007)
Varadarajan, K.M., Vincze, M.: Real-Time Depth Diffusion for 3D Surface Reconstruction. In: ICIP (2010)
Varadarajan, K.M., Vincze, M.: Surface Reconstruction for RGB-D Data using Real-Time Depth Propagation. In: ICCV W (2011)
Varadarajan, K.M., Vincze, M.: 4D Space-Time Mereotopogeometry. In: PCC ICRA (2013)
Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between class attribute transfer. In: CVPR (2009)
Parikh D., Grauman K.: Relative Attributes. In: ICCV (2011)
Gupta, A., Satkin, E., Efros, I., Hebert, M.: From 3D Scene Geometry to Human Workspace. In: CVPR (2011)
Winston, P.H., Binford, T.O., Katz, B., Lowry, M.: Learning physical description from functional definitions, examples, and precedents. MIT Press (1984)
Stark, L., Bowyer, K.: Achieving generalized object recognition through reasoning about association of function to structure. PAMI (1991)
Rivlin, E., Dickinson, S.J., Rosenfeld, A.: Recognition by functional parts. In: CVIU (1995)
Varadarajan, K.M., Vincze, M.: K-TR Theory of Semantic Saliency. In: ICPR (2012)
Varadarajan, K.M., Vincze, M.: AfkTRAANS: The language of Cognitive Robots. In: AAAI Robotics and Multimedia Satellite Event (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Varadarajan, K.M., Vincze, M. (2013). AfNet: The Affordance Network. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37331-2_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-37331-2_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37330-5
Online ISBN: 978-3-642-37331-2
eBook Packages: Computer ScienceComputer Science (R0)