Abstract
Autonomous driving has caused extensively attention of academia and industry. Vision-based dangerous object detection is a crucial technology of autonomous driving which detects object and assesses its danger with distance to warn drivers. Previous vision-based dangerous object detections apply two independent models to deal with object detection and distance prediction, respectively. In this paper, we show that object detection and distance prediction have visual relationship, and they can be improved by exploiting the relationship. We jointly optimize object detection and distance prediction with a novel multi-task learning (MTL) model for using the relationship. In contrast to traditional MTL which uses linear multi-task combination strategy, we propose a Cartesian product-based multi-target combination strategy for MTL to consider the dependent among tasks. The proposed novel MTL method outperforms than the traditional MTL and single task methods by a series of experiments.
D. Zhao—This work is supported by National Natural Science Foundation of China (NSFC) under Grants 61573353 and 61533017, and the National Key Research and Development Plan under Grant No. 2016YFB0101000.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bruch, M.: Velodyne HDL-64E lidar for unmanned surface vehicle obstacle detection. In: Proceedings of SPIE - The International Society for Optical Engineering, Florida, 05 April 2010
Chen, X., Kundu, K., Zhang, Z., Ma, H., Fidler, S., Urtasun, R.: Monocular 3D object detection for autonomous driving. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
Evgeniou, A., Pontil, M.: Multi-task feature learning. Adv. Neural Inf. Process. Syst. 19, 41 (2007)
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE Conference on Computer Vision, pp. 1440–1448 (2015)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). doi:10.1007/978-3-319-46448-0_2
Lv, L., Zhao, D., Deng, Q.: A semi-supervised predictive sparse decomposition based on the task-driven dictionary learning. Cogn. Comput. (2016). doi:10.1007/s12559-016-9438-0
Neubeck, A., Gool, L.V.: Efficient non-maximum suppression. In: International Conference on Pattern Recognition, pp. 850–855 (2006)
Xia, Y., Wang, C., Shi, X., Zhang, L.: Vehicles overtaking detection using RGB-D data. Sign. Proces. 112, 98–109 (2015)
Yim, J., Jung, H., Yoo, B.I., Choi, C.: Rotating your face using multi-task deep neural network. In: Computer Vision and Pattern Recognition, pp. 676–684 (2015)
Zhang, C., Zhang, Z.: Improving multiview face detection with multi-task deep convolutional neural networks. In: IEEE Winter Conference on Applications of Computer Vision, pp. 1036–1041 (2014)
Zhang, Z., Luo, P., Chen, C.L., Tang, X.: Facial landmark detection by deep multi-task learning. In: European Conference on Computer Vision, pp. 94–108 (2014)
Zhao, D., Chen, Y., Lv, L.: Deep reinforcement learning with visual attention for vehicle classification. IEEE Trans. Cogn. Dev. Syst. (2016). doi:10.1109/TCDS.2016.2614675
Zhou, Q., Wang, G., Jia, K., Zhao, Q.: Learning to share latent tasks for action recognition. In: IEEE International Conference on Computer Vision, pp. 2264–2271 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chen, Y., Zhao, D. (2017). Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection. In: Cong, F., Leung, A., Wei, Q. (eds) Advances in Neural Networks - ISNN 2017. ISNN 2017. Lecture Notes in Computer Science(), vol 10261. Springer, Cham. https://doi.org/10.1007/978-3-319-59072-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-59072-1_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59071-4
Online ISBN: 978-3-319-59072-1
eBook Packages: Computer ScienceComputer Science (R0)