skip to main content
10.1145/3460426.3463656acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

DANet: Dimension Apart Network for Radar Object Detection

Published: 01 September 2021 Publication History

Abstract

In this paper, we propose a dimension apart network (DANet) for radar object detection task. A Dimension Apart Module (DAM) is first designed to be lightweight and capable of extracting temporal-spatial information from the RAMap sequences. To fully utilize the hierarchical features from the RAMaps, we propose a multi-scale U-Net style network architecture termed DANet. Extensive experiments demonstrate that our proposed DANet achieves superior performance on the radar detection task at much less computational cost, compared to previous pioneer works. In addition to the proposed novel network, we also utilize a vast amount of data augmentation techniques. To further improve the robustness of our model, we ensemble the predicted results from a bunch of lightweight DANet variants. Finally, we achieve 82.2% on average precision and 90% on average recall of object detection performance and rank at 1st place in the ROD2021 radar detection challenge. Our code is available at: \urlhttps://github.com/jb892/ROD2021_Radar_Detection_Challenge_Baidu.

References

[1]
Aleksandar Angelov, Andrew Robertson, Roderick Murray-Smith, and Francesco Fioranelli. 2018. Practical classification of different moving targets using automotive radar and deep neural networks. IET Radar, Sonar & Navigation, Vol. 12, 10 (2018), 1082--1089.
[2]
Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. 2017. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, Vol. 39, 12 (2017), 2481--2495.
[3]
Christopher Choy, JunYoung Gwak, and Silvio Savarese. 2019. 4d spatio-temporal convnets: Minkowski convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . 3075--3084.
[4]
Xiangyu Gao, Guanbin Xing, Sumit Roy, and Hui Liu. 2019. Experiments with mmwave automotive radar test-bed. In 2019 53rd Asilomar Conference on Signals, Systems, and Computers. IEEE, 1--6.
[5]
Ross Girshick. 2015. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision. 1440--1448.
[6]
Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 580--587.
[7]
Benjamin Graham, Martin Engelcke, and Laurens van der Maaten. 2018. 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks. CVPR (2018).
[8]
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.
[9]
Shiyi Lan, Zhou Ren, Yi Wu, Larry S Davis, and Gang Hua. 2020. SaccadeNet: A fast and accurate object detector. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10397--10406.
[10]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21--37.
[11]
Yuxuan Liu, Yuan Yixuan, and Ming Liu. 2021. Ground-aware Monocular 3D Object Detection for Autonomous Driving. IEEE Robotics and Automation Letters, Vol. 6, 2 (2021), 919--926.
[12]
Bence Major, Daniel Fontijne, Amin Ansari, Ravi Teja Sukhavasi, Radhika Gowaikar, Michael Hamilton, Sean Lee, Slawomir Grzechnik, and Sundar Subramanian. 2019. Vehicle detection with automotive radar using deep learning on range-azimuth-doppler tensors. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops. 0--0.
[13]
Arsalan Mousavian, Dragomir Anguelov, John Flynn, and Jana Kosecka. 2017. 3d bounding box estimation using deep learning and geometry. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7074--7082.
[14]
Ramin Nabati and Hairong Qi. 2021. CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision . 1527--1536.
[15]
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In European conference on computer vision. Springer, 483--499.
[16]
Arthur Ouaknine, Alasdair Newson, Julien Rebut, Florence Tupin, and Patrick Pérez. 2020. CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations. arXiv preprint arXiv:2005.01456 (2020).
[17]
Andras Palffy, Jiaao Dong, Julian FP Kooij, and Dariu M Gavrila. 2020. CNN based road user detection using the 3D radar cube. IEEE Robotics and Automation Letters, Vol. 5, 2 (2020), 1263--1270.
[18]
Charles R Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. Pointnet
[19]
: Deep hierarchical feature learning on point sets in a metric space. arXiv preprint arXiv:1706.02413 (2017).
[20]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015).
[21]
Mark A Richards. 2014. Fundamentals of radar signal processing .McGraw-Hill Education.
[22]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234--241.
[23]
Pierre Sermanet, David Eigen, Xiang Zhang, Michaël Mathieu, Rob Fergus, and Yann LeCun. 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229 (2013).
[24]
Shaoshuai Shi, Xiaogang Wang, and Hongsheng Li. 2019. Pointrcnn: 3d object proposal generation and detection from point cloud. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition . 770--779.
[25]
Tao Wang, Nanning Zheng, Jingmin Xin, and Zheng Ma. 2011. Integrating millimeter wave radar with a monocular vision sensor for on-road obstacle detection applications. Sensors, Vol. 11, 9 (2011), 8992--9008.
[26]
Yizhou Wang, Yen-Ting Huang, and Jenq-Neng Hwang. 2019. Monocular visual object 3d localization in road scenes. In Proceedings of the 27th ACM International Conference on Multimedia. 917--925.
[27]
Yizhou Wang, Zhongyu Jiang, Yudong Li, Jenq-Neng Hwang, Guanbin Xing, and Hui Liu. 2021. RODNet: A Real-Time Radar Object Detection Network Cross-Supervised by Camera-Radar Fused Object 3D Localization. IEEE Journal of Selected Topics in Signal Processing (2021).
[28]
Zetong Yang, Yanan Sun, Shu Liu, and Jiaya Jia. 2020. 3dssd: Point-based 3d single stage object detector. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition . 11040--11048.
[29]
Tianwei Yin, Xingyi Zhou, and Philipp Kr"ahenbühl. 2020. Center-based 3d object detection and tracking. arXiv preprint arXiv:2006.11275 (2020).
[30]
Xingyi Zhou, Dequan Wang, and Philipp Kr"ahenbühl. 2019. Objects as points. arXiv preprint arXiv:1904.07850 (2019).
[31]
Yin Zhou and Oncel Tuzel. 2018. Voxelnet: End-to-end learning for point cloud based 3d object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition . 4490--4499.

Cited By

View all
  • (2025)TransRAD: Retentive Vision Transformer for Enhanced Radar Object DetectionIEEE Transactions on Radar Systems10.1109/TRS.2025.35376043(303-317)Online publication date: 2025
  • (2024)TC–Radar: Transformer–CNN Hybrid Network for Millimeter-Wave Radar Object DetectionRemote Sensing10.3390/rs1616288116:16(2881)Online publication date: 7-Aug-2024
  • (2024)LQCANet: Learnable-Query-Guided Multi-Scale Fusion Network Based on Cross-Attention for Radar Semantic SegmentationIEEE Transactions on Intelligent Vehicles10.1109/TIV.2023.33422969:2(3330-3344)Online publication date: Feb-2024
  • Show More Cited By

Index Terms

  1. DANet: Dimension Apart Network for Radar Object Detection

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval
      August 2021
      715 pages
      ISBN:9781450384636
      DOI:10.1145/3460426
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 September 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. autonomous driving
      2. convolutional neural networks
      3. dam
      4. radar object detection

      Qualifiers

      • Research-article

      Conference

      ICMR '21
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 254 of 830 submissions, 31%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)65
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 23 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)TransRAD: Retentive Vision Transformer for Enhanced Radar Object DetectionIEEE Transactions on Radar Systems10.1109/TRS.2025.35376043(303-317)Online publication date: 2025
      • (2024)TC–Radar: Transformer–CNN Hybrid Network for Millimeter-Wave Radar Object DetectionRemote Sensing10.3390/rs1616288116:16(2881)Online publication date: 7-Aug-2024
      • (2024)LQCANet: Learnable-Query-Guided Multi-Scale Fusion Network Based on Cross-Attention for Radar Semantic SegmentationIEEE Transactions on Intelligent Vehicles10.1109/TIV.2023.33422969:2(3330-3344)Online publication date: Feb-2024
      • (2024)A Recurrent CNN for Online Object Detection on Raw Radar FramesIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.340407625:10(13432-13441)Online publication date: Oct-2024
      • (2024)MF-SPNet: Dilated Convolutional Pyramid with Attention Mechanism for Semantic Segmentation of Street Scenes2024 7th International Conference on Pattern Recognition and Artificial Intelligence (PRAI)10.1109/PRAI62207.2024.10827680(156-160)Online publication date: 15-Aug-2024
      • (2024)MonoLSS: Learnable Sample Selection For Monocular 3D Detection2024 International Conference on 3D Vision (3DV)10.1109/3DV62453.2024.00088(1125-1135)Online publication date: 18-Mar-2024
      • (2023)Improving the Performance of RODNet for MMW Radar Target Detection in Dense Pedestrian SceneMathematics10.3390/math1102036111:2(361)Online publication date: 10-Jan-2023
      • (2023)T-RODNet: Transformer for Vehicular Millimeter-Wave Radar Object DetectionIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2022.322970372(1-12)Online publication date: 2023
      • (2023)Histogram-based Deep Learning for Automotive Radar2023 IEEE Radar Conference (RadarConf23)10.1109/RadarConf2351548.2023.10149688(1-6)Online publication date: 1-May-2023
      • (2023)Automotive Radar Sub-Sampling via Object Detection Networks: Leveraging Prior Signal InformationIEEE Open Journal of Intelligent Transportation Systems10.1109/OJITS.2023.33320434(858-869)Online publication date: 2023
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media