Conferences >2023 IEEE/RSJ International C...

Hierarchical Transformer for Visual Affordance Understanding using a Large-scale Dataset

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Recognition, detection, and segmentation tasks in machine vision have focused on studying the physical and textural attributes of objects. However, robots and intelligent...Show More

Metadata

Abstract:

Recognition, detection, and segmentation tasks in machine vision have focused on studying the physical and textural attributes of objects. However, robots and intelligent machines require the ability to understand visual cues, such as the visual affordances that objects offer, to interact intelligently with novel objects. In this paper, we present a large-scale multi-view RGBD visual affordance learning dataset a benchmark of 47,210 RGBD images from 37 object categories, annotated with 15 visual affordance categories and 35 cluttered/complex scenes. We deploy a Vision Transformer (ViT), called Visual Affordance Transformer (VAT), for the affordance segmentation task. Due to its hierarchical architecture, VAT can learn multiple affordances at various scales, making it suitable for objects of varying sizes. Our experimental results show the superior performance of VAT compared to state-of-the-art deep learning networks. In addition, the challenging nature of the proposed dataset highlights the potential for new and robust affordance learning algorithms. Our dataset is publicly available at https://sites.google.com/view/afaqshah/dataset.

Published in: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Date of Conference: 01-05 October 2023

Date Added to IEEE Xplore: 13 December 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/IROS55552.2023.10341976

Conference Location: Detroit, MI, USA

Contents

References is not available for this document.

Hierarchical Transformer for Visual Affordance Understanding using a Large-scale Dataset

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Hierarchical Transformer for Visual Affordance Understanding using a Large-scale Dataset

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?