Skip to main content

Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection

  • Conference paper
  • First Online:
Advances in Neural Networks - ISNN 2017 (ISNN 2017)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10261))

Included in the following conference series:

Abstract

Autonomous driving has caused extensively attention of academia and industry. Vision-based dangerous object detection is a crucial technology of autonomous driving which detects object and assesses its danger with distance to warn drivers. Previous vision-based dangerous object detections apply two independent models to deal with object detection and distance prediction, respectively. In this paper, we show that object detection and distance prediction have visual relationship, and they can be improved by exploiting the relationship. We jointly optimize object detection and distance prediction with a novel multi-task learning (MTL) model for using the relationship. In contrast to traditional MTL which uses linear multi-task combination strategy, we propose a Cartesian product-based multi-target combination strategy for MTL to consider the dependent among tasks. The proposed novel MTL method outperforms than the traditional MTL and single task methods by a series of experiments.

D. Zhao—This work is supported by National Natural Science Foundation of China (NSFC) under Grants 61573353 and 61533017, and the National Key Research and Development Plan under Grant No. 2016YFB0101000.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bruch, M.: Velodyne HDL-64E lidar for unmanned surface vehicle obstacle detection. In: Proceedings of SPIE - The International Society for Optical Engineering, Florida, 05 April 2010

    Google Scholar 

  2. Chen, X., Kundu, K., Zhang, Z., Ma, H., Fidler, S., Urtasun, R.: Monocular 3D object detection for autonomous driving. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016

    Google Scholar 

  3. Evgeniou, A., Pontil, M.: Multi-task feature learning. Adv. Neural Inf. Process. Syst. 19, 41 (2007)

    Google Scholar 

  4. Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)

    Google Scholar 

  5. Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE Conference on Computer Vision, pp. 1440–1448 (2015)

    Google Scholar 

  6. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). doi:10.1007/978-3-319-46448-0_2

    Chapter  Google Scholar 

  7. Lv, L., Zhao, D., Deng, Q.: A semi-supervised predictive sparse decomposition based on the task-driven dictionary learning. Cogn. Comput. (2016). doi:10.1007/s12559-016-9438-0

  8. Neubeck, A., Gool, L.V.: Efficient non-maximum suppression. In: International Conference on Pattern Recognition, pp. 850–855 (2006)

    Google Scholar 

  9. Xia, Y., Wang, C., Shi, X., Zhang, L.: Vehicles overtaking detection using RGB-D data. Sign. Proces. 112, 98–109 (2015)

    Article  Google Scholar 

  10. Yim, J., Jung, H., Yoo, B.I., Choi, C.: Rotating your face using multi-task deep neural network. In: Computer Vision and Pattern Recognition, pp. 676–684 (2015)

    Google Scholar 

  11. Zhang, C., Zhang, Z.: Improving multiview face detection with multi-task deep convolutional neural networks. In: IEEE Winter Conference on Applications of Computer Vision, pp. 1036–1041 (2014)

    Google Scholar 

  12. Zhang, Z., Luo, P., Chen, C.L., Tang, X.: Facial landmark detection by deep multi-task learning. In: European Conference on Computer Vision, pp. 94–108 (2014)

    Google Scholar 

  13. Zhao, D., Chen, Y., Lv, L.: Deep reinforcement learning with visual attention for vehicle classification. IEEE Trans. Cogn. Dev. Syst. (2016). doi:10.1109/TCDS.2016.2614675

  14. Zhou, Q., Wang, G., Jia, K., Zhao, Q.: Learning to share latent tasks for action recognition. In: IEEE International Conference on Computer Vision, pp. 2264–2271 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dongbin Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Chen, Y., Zhao, D. (2017). Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection. In: Cong, F., Leung, A., Wei, Q. (eds) Advances in Neural Networks - ISNN 2017. ISNN 2017. Lecture Notes in Computer Science(), vol 10261. Springer, Cham. https://doi.org/10.1007/978-3-319-59072-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-59072-1_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-59071-4

  • Online ISBN: 978-3-319-59072-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics