Skip to main content

A Deep Reinforcement Learning Framework for Vehicle Detection and Pose Estimation in 3D Point Clouds

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12240))

Abstract

As for autonomous driving in urban environments, it is of significance to accurately capture the position and pose of vehicles. Those information can assist self-driving system in making right decisions to avoid potential risks. Currently, 3D point clouds captured by laser scanners are widely used in self-driving systems to sense the real environment. Therefore, in this paper, we propose a deep reinforcement learning framework for vehicle detection and pose estimation by using 3D point clouds. Specifically, to estimate the pose of vehicles, we propose to design a rotation action in Deep Q Network (DQN). In addition, by considering the whole detection procedure as Markov Decision Process (MDP), our intermediate detected results can further improve the detection performance of our proposed method. The evaluations are carried on outdoor point cloud scenes captured by VMX450 laser scanning system. The experimental results demonstrate the satisfied performance on vehicle detection and pose estimation.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Guo, Y., Li, C., Liu, Q.: R2N: a novel deep learning architecture for rain removal from single image. Comput. Mater. Continua 58(3), 829–843 (2019)

    Article  Google Scholar 

  2. Pan, L., Qin, J., Chen, H., Xiang, X., Li, C., Chen, R.: Image augmentation-based food recognition with convolutional neural networks. Comput. Mater. Continua 59(1), 297–313 (2019)

    Article  Google Scholar 

  3. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CCNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)

    Google Scholar 

  4. Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)

    Google Scholar 

  5. Zhang, J., Li, Y., Niu, S., Cao, Z., Wang, X.: Improved fully convolutional network for digital image region forgery detection (2019)

    Google Scholar 

  6. Wang, H., et al.: Object detection in terrestrial laser scanning point clouds based on hough forest. IEEE Geosci. Remote Sens. Lett. 11(10), 1807–1811 (2014)

    Article  Google Scholar 

  7. Qi, C.R., Litany, O., He, K., Guibas, L.J.: Deep hough voting for 3D object detection in point clouds. arXiv preprint arXiv:1904.09664 (2019)

  8. Liu, F., et al.: 3DCNN-DQN-RNN: a deep reinforcement learning framework for semantic parsing of large-scale 3d point clouds. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5678–5687 (2017)

    Google Scholar 

  9. Simon, M., Milz, S., Amende, K., Gross, H.-M.: Complex-YOLO: an Euler-region-proposal for real-time 3D object detection on point clouds. In: Leal-Taixé, L., Roth, S. (eds.) ECCV 2018. LNCS, vol. 11129, pp. 197–209. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11009-3_11

    Chapter  Google Scholar 

  10. Shi, S., Wang, X., Li, H.: PointRCNN: 3D object proposal generation and detection from point cloud. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–779 (2019)

    Google Scholar 

  11. Qi, C.R., Liu, W., Wu, C., Su, H., Guibas, L.J.: Frustum pointnets for 3D object detection from RGB-D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 918–927 (2018)

    Google Scholar 

  12. Yang, B., Luo, W., Urtasun, R.: Pixor: real-time 3D object detection from point clouds. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 7652–7660 (2018)

    Google Scholar 

  13. Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., Beijbom, O.: Pointpillars: Fast encoders for object detection from point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 12 697–12 705 (2019)

    Google Scholar 

  14. Behl, A., Paschalidou, D., Donné, S., Geiger, A.: Pointflownet: learning representations for rigid motion estimation from point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7962–7971 (2019)

    Google Scholar 

  15. Mnih, V., et al.: Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)

  16. Mathe, S., Pirinen, A., Sminchisescu, C.: Reinforcement learning for visual object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2894–2902 (2016)

    Google Scholar 

  17. Bueno, M.B., Giró-i Nieto, X., Marqués, F., Torres, J.: Hierarchical object detection with deep reinforcement learning. In: Deep Learning for Image Processing Applications, vol. 31, no. 164, p. 3 (2017)

    Google Scholar 

  18. Li, Y., Fu, K., Sun, H., Sun, X.: An aircraft detection framework based on reinforcement learning and convolutional neural networks in remote sensing images. Remote Sens. 10(2), 243 (2018)

    Article  Google Scholar 

  19. Yang, B., Liang, M., Urtasun, R.: HDNET: exploiting HD maps for 3D object detection. In: Conference on Robot Learning, pp. 146–155 (2018)

    Google Scholar 

  20. Ku, J., Mozifian, M., Lee, J., Harakeh, A., Waslander, S.L.: Joint 3D proposal generation and object detection from view aggregation. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1–8. IEEE (2018)

    Google Scholar 

  21. Zeng, Y., et al.: RT3D: Real-time 3-D vehicle detection in lidar point cloud for autonomous driving. IEEE Robot. Autom. Lett. 3(4), 3434–3440 (2018)

    Article  Google Scholar 

  22. Yu, S.-L., Westfechtel, T., Hamada, R., Ohno, K., Tadokoro, S.: Vehicle detection and localization on bird’s eye view elevation images using convolutional neural network. In: 2017 IEEE International Symposium on Safety, Security and Rescue Robotics (SSRR), pp. 102–109. IEEE (2017)

    Google Scholar 

  23. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)

    Article  Google Scholar 

  24. Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vision 88(2), 303–338 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huan Luo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, W., Luo, H., Zheng, Q., Wang, C., Guo, W. (2020). A Deep Reinforcement Learning Framework for Vehicle Detection and Pose Estimation in 3D Point Clouds. In: Sun, X., Wang, J., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2020. Lecture Notes in Computer Science(), vol 12240. Springer, Cham. https://doi.org/10.1007/978-3-030-57881-7_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-57881-7_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-57880-0

  • Online ISBN: 978-3-030-57881-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics