AfNet: The Affordance Network

Varadarajan, Karthik Mahesh; Vincze, Markus

doi:10.1007/978-3-642-37331-2_39

Karthik Mahesh Varadarajan²⁰ &
Markus Vincze²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7724))

Included in the following conference series:

Asian Conference on Computer Vision

8583 Accesses
9 Citations

Abstract

There has been a growing need to build an object recognition system that can successfully characterize object constancy, irrespective of lighting, shading, occlusions, viewpoint variations and most importantly, deal with the multitude of shapes, colors and sizes in which objects are found. Affordances on the other hand, provide symbolic grounding mechanisms that enable linking features obtained from visual perception with the functionality of the objects, which provides the most consistent and holistic characterization of an object. Recognition by Component Affordances (RBCA) is a recent theory that builds affordance features for recognition. As an extension of the psychophysical theory of Recognition by Components (RBC) to generic visual perception, RBCA is well suited for cognitive visual processing systems which are required to perform implicit cognitive tasks. A common task is to substitute a cup for a mug, bottle, jug, pitcher, pilsner, beaker, chalice, goblet or any other unlabeled object, but with a physical part affording the ability to hold liquid and a part affording grasping by a human hand, given the goal of ’finding an empty cup’ and no cups are available in the work environment of interest. In this paper, we present affordance features for recognition of objects. Using a set of 25 structural and 10 material affordances we define a database of over 250 common household objects. This database called the Affordance Network or AfNet is available as community development framework and is well suited for deployment on domestic robots. Sample object recognition results using AfNet and the associated inference engine that grounds the affordances through visual perception features demonstrate the effectiveness of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Grabner, H., Gall, J., van Gool, L.: What Makes a Chair a Chair? In: CVPR, pp. 1529–1536 (2011)
Google Scholar
Varadarajan, K.M., Vincze, M.: Holistic Visual Cognitive Recognizer using Part based Local, Global, Semantic and Affordance Features. In: CVPR W (2011)
Google Scholar
Varadarajan, K.M., Vincze, M.: Affordance based Part Recognition for Grasping and Manipulation. In: ICRA W (2011)
Google Scholar
Varadarajan, K.M., Vincze, M.: Object Part Segmentation and Classification in Range Images for Grasping. In: ICAR (2011)
Google Scholar
Varadarajan, K.M., Vincze, M.: Knowledge Representation and Inference for Grasp Affordances. In: Crowley, J.L., Draper, B.A., Thonnat, M. (eds.) ICVS 2011. LNCS, vol. 6962, pp. 173–182. Springer, Heidelberg (2011)
Chapter Google Scholar
Varadarajan, K.M.: Karmic Tabula Rasa k-TR - A Theory of Visual Perception. In: ISP (2011)
Google Scholar
Gibson, J.J.: The Theory of Affordances. In: Shaw, R., Bransford, J. (eds.) (1977) ISBN 0-470-99014-7
Google Scholar
Biederman I.: Recognition - by - components: a theory of human image understanding. Psych. Rev. (1994)
Google Scholar
MacDorman, K.F.: Responding to affordances: Learning and projecting a sensorimotor mapping. In: ICRA (2000)
Google Scholar
Fitzpatrick, P., et. al: Learning about objects through action. In: ICRA (2003)
Google Scholar
Stoytchev, A.: Toward learning the binding affordances of objects. In: AAAI Symposium on Dev. Robotics (2005)
Google Scholar
Sahin, E., et al.: To afford or not to afford. Adaptive Behavior 15(4), 447–472 (2007)
Article Google Scholar
Varadarajan, K.M., Vincze, M.: Real-Time Depth Diffusion for 3D Surface Reconstruction. In: ICIP (2010)
Google Scholar
Varadarajan, K.M., Vincze, M.: Surface Reconstruction for RGB-D Data using Real-Time Depth Propagation. In: ICCV W (2011)
Google Scholar
Varadarajan, K.M., Vincze, M.: 4D Space-Time Mereotopogeometry. In: PCC ICRA (2013)
Google Scholar
Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between class attribute transfer. In: CVPR (2009)
Google Scholar
Parikh D., Grauman K.: Relative Attributes. In: ICCV (2011)
Google Scholar
Gupta, A., Satkin, E., Efros, I., Hebert, M.: From 3D Scene Geometry to Human Workspace. In: CVPR (2011)
Google Scholar
Winston, P.H., Binford, T.O., Katz, B., Lowry, M.: Learning physical description from functional definitions, examples, and precedents. MIT Press (1984)
Google Scholar
Stark, L., Bowyer, K.: Achieving generalized object recognition through reasoning about association of function to structure. PAMI (1991)
Google Scholar
Rivlin, E., Dickinson, S.J., Rosenfeld, A.: Recognition by functional parts. In: CVIU (1995)
Google Scholar
Varadarajan, K.M., Vincze, M.: K-TR Theory of Semantic Saliency. In: ICPR (2012)
Google Scholar
Varadarajan, K.M., Vincze, M.: AfkTRAANS: The language of Cognitive Robots. In: AAAI Robotics and Multimedia Satellite Event (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

TU Wien, Vienna, Austria
Karthik Mahesh Varadarajan & Markus Vincze

Authors

Karthik Mahesh Varadarajan
View author publications
You can also search for this author in PubMed Google Scholar
Markus Vincze
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, Seoul National University, 1 Gwanak-ro, 151-744, Gwanak-gu, Seoul, Korea
Kyoung Mu Lee
Microsoft Research Asia, No. 5, Danling st., Haidian district, 100080, Beijing, P.R. China
Yasuyuki Matsushita
School of Interactive Computing, Georgia Institute of Technology, 801 Atlantic Drive, CCB 315, 30332, Atlanta, GA, USA
James M. Rehg
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, Zhong Quan Cun East Road 95, Haidian District, 100 190, Beijing, P.R. China
Zhanyi Hu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Varadarajan, K.M., Vincze, M. (2013). AfNet: The Affordance Network. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37331-2_39

Download citation

DOI: https://doi.org/10.1007/978-3-642-37331-2_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37330-5
Online ISBN: 978-3-642-37331-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics