Journals & Magazines >IEEE Transactions on Intellig... >Volume: 23 Issue: 12

CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The key to 3D object detection is proper utilization of depth data. Compared with LiDAR based approaches, 3D object detection from a single image remains a challenging ta...Show More

Metadata

Abstract:

The key to 3D object detection is proper utilization of depth data. Compared with LiDAR based approaches, 3D object detection from a single image remains a challenging task due to the lack of structure information. Recent methods leverage monocular depth estimation as a way to produce 2D depth maps, and adopt the depth maps as additional source of input to explore structure information. However, these methods either encode local structure correlations, or encode long range structure correlations by iteratively passing local messages. In this work, we propose a cross modal attention network (CMAN) for monocular 3D object detection. It is built upon the self-attention module which learns attention map from single modal data. Our CMAN is able to encode structure correlations from depth data, and embed the structure correlations with appearance information which is learned from RGB data. Thanks to the attention learning mechanism, our CMAN learns global structure correlations without iteration. In order to reduce the computational burden, our CMAN adopts a novel node sampler to eliminate redundant nodes during the attention map calculation. Experiment results on benchmark KITTI3D dataset show that our proposed CMAN outperforms the state-of-the-art methods.

Published in: IEEE Transactions on Intelligent Transportation Systems ( Volume: 23, Issue: 12, December 2022)

Page(s): 24727 - 24737

Date of Publication: 21 September 2022

ISSN Information:

DOI: 10.1109/TITS.2022.3205446

Funding Agency:

Contents

References is not available for this document.

CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?