Abstract
Autonomous driving research is an emerging area in the machine learning domain. Most existing methods perform single-task learning, while multi-task learning (MTL) is more efficient due to the leverage of shared information between different tasks. However, MTL is challenging because different tasks may have different significance and varying ranges. In this work, we propose an end-to-end deep learning architecture for statistically correlated MTL using a single input image. Statistical correlation of the tasks is handled by including shared layers in the architecture. Later network separates into different branches to handle the difference in the behavior of each task. Training a multi-task model with varying ranges may converge the objective function only with larger values. To this end, we explore different normalization schemes and empirically observe that the inverse validation-loss weighted scheme has best performed. In addition to estimating steering angle, braking, and acceleration, we also estimate the number of lanes on the left and the right side of the vehicle. To the best of our knowledge, we are the first to propose an end-to-end deep learning architecture to estimate this type of lane information. The proposed approach is evaluated on four publicly available datasets including Comma.ai, Udacity, Berkeley Deep Drive, and Sully Chen. We also propose a synthetic dataset GTA-V for autonomous driving research. Our experiments demonstrate the superior performance of the proposed approach compared to the current state-of-the-art methods. The GTA-V dataset and the lane annotations on the four existing datasets will be made publicly available via https://cvlab.lums.edu.pk/scmtl/.















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Abbas W, Taj M (2019) Adaptively weighted multi-task learning using inverse validation loss. In: IEEE International conference on acoustics, speech and signal processing
Alamri A, Gumaei A, Al-Rakhami M, Hassan MM, Alhussein M, Fortino G (2020) An effective bio-signal-based driver behavior monitoring system using a generalized deep learning approach. IEEE Access 8:135037–135049
Al-Qizwini M, Barjasteh I, Al-Qassab H, Radha H (2017) Deep learning algorithm for autonomous driving using GoogLeNet. In: IEEE intelligent vehicles symposium. pp 89–96
Bojarski M, Del Testa D, Dworakowski D et al (2016) End to end learning for self-driving cars. arXiv:1604.07316
Bojarski M, Yeres P, Choromanska A, Choromanski K, Firner B, Jackel LD, Muller U (2017) Explaining how a deep neural network trained with end-to-end learning steers a car. arXiv:1704.07911
Campbell M, Egerstedt M, How JP, Murray RM (2010) Autonomous driving in urban environments: approaches, lessons and challenges. Philos Trans Ser A Math Phys Eng Sci 368(1928):4649–72
Chen CA, Seff AK, Xiao J (2015) Deepdriving: learning affordance for direct perception in autonomous driving. In: IEEE international conference on computer vision
Chen Y, Zhao D, Lv L, Zhang Q (2018) Multi-task learning for dangerous object detection in autonomous driving. Inf Sci 432:559–571
Cheng G, Yang C, Yao X, Guo L, Han J (2018) When deep learning meets metric learning: remote sensing image scene classification via learning discriminative cnns. In: IEEE transactions on geoscience and remote sensing
Chen S. https://goo.gl/5wq7fe. Last accessed 07 Dec 2018
Chen Z, Huang X (2017) End-to-end learning for lane keeping of self-driving cars. In: IEEE intelligent vehicles symposium. pp 1856–1860
Chi L, Mu Y (2017) Deep steering: learning end-to-end driving model from spatial and temporal visual cues. arXiv:1708.03798
Chi L, Mu Y (2017) Learning end-to-end autonomous steering model from spatial and temporal visual cues. In: Workshop on visual analysis in smart and connected communities. pp 9–16
Daniel NJH, Lee TD, Liu SC (2017) Delta networks for optimized recurrent network computation. In: International conference on machine learning
Du S, Guo H, Simpson A (2017) Self-driving car steering angle prediction based on image recognition. Technical report, Stanford, CA, USA CS231 course project
Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite. In: IEEE conference on computer vision and pattern recognition. pp 3354–3361
Hassan MM, Ullah S, Hossain MS et al (2020) An end-to-end deep learning model for human activity recognition from highly sparse body sensor data in internet of medical things environment. J Supercomput
He K, Wang Z, Fu Y, Feng R, Jiang YG, Xue X (2017) Adaptively weighted multi-task deep network for person attribute classification. In: ACM international conference on multimedia. pp 1636–1644
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE conference on computer vision and pattern recognition. pp 770–778
Hou Y, Ma Z, Liu C, Loy CC (2019) Learning to steer by mimicking features from heterogeneous auxiliary networks. Proc AAAI Conf Artif Intell 33:8433–8440
Innocenti C, Lindén H, Panahandeh G, Svensson L, Mohammadiha N (2017) Imitation learning for vision-based lane keeping assistance. In: Intelligent transportation systems. pp 425–430
Jiang J, Astolfi A (2017) A lateral control assistant for the dynamic model of vehicles subject to state constraints. In: IEEE conference on decision and control. pp 244–249
John V, Mita S, Tehrani H, Ishimaru K (2017) Automated driving by monocular camera using deep mixture of experts. In: IEEE intelligent vehicles symposium. pp 127–134
Kim J, Canny JF (2017) Interpretable learning for self-driving cars by visualizing causal attention. In: IEEE international conference on computer vision. pp 2961–2969
Kingma DP, Ba JL (2014) Adam: a method for stochastic optimization In: Proceedings of the 3rd international conference on learning representations
LeCun Y, Muller U, Ben J, Cosatto E, Flepp B (2005) Off-road obstacle avoidance through end-to-end learning. In: Advances in neural information processing systems. pp 739–746
Liang J, Liu Z, Zhou J, Jiang X, Zhang C, Wang F (2018) Model-protected multi-task learning. arXiv:1809.06546
Lio MD, Mazzalai A, Gurney K, Saroldi A (2018) Biologically guided driver modeling: the stop behavior of human car drivers. IEEE Trans. Int. Trans. Syst. 19(8):2454–2469
Markatopoulou F, Mezaris V, Patras I (2018) Implicit and explicit concept relations in deep neural networks for multi-label video/image annotation. In: IEEE transactions on circuits and systems for video technology
Menéndez-Romero C, Winkler F, Dornhege C, Burgard W (2017) Maneuver planning for highly automated vehicles. In: IEEE intelligent vehicles symposium. pp 1458–1464
Olabiyi O, Martinson E, Chintalapudi V, Guo R (2017) Driver action prediction using deep (bidirectional) recurrent neural network. arXiv:1706.02257
Roberts B, Kaltwang S, Samangooei S, Pender-Bare M, Tertikas K, Redford J (2018) A dataset for lane instance segmentation in urban environments. In: European conference on computer vision
Ruder S (2017) An overview of multi-task learning in deep neural networks. arXiv:1706.05098
Santana E, Hotz G (2016) Learning a driving simulator. arXiv:1608.01230
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. AAAI Conf Artif Intell 4:12
Taj M, Abbas W (2019) Multi-task learning for autonomous driving, AI for emerging verticals: human-robot computing, sensing and networking, Chapter 13, eds. Shakir, M.Z., Ramza, M.N., IET
Teichmann M, Weber M, Zöllner JM, Cipolla R, Urtasun R (2018) Multinet: real-time joint semantic reasoning for autonomous driving. In: IEEE intelligent vehicles symposium. pp 1013–1020
Udacity: arXiv:1604.073160. Last accessed: 07 Dec 2018
Wang R, Li M, Peng L, Hu Y, Hassan MM, Alelaiwi A (2020) Cognitive multi-agent empowering mobile edge computing for resource caching and collaboration. In: Future generation computer systems. pp 66–74
Wu Y, Chen Z, Liu R, Li F (2018) Lane departure avoidance control for electric vehicle using torque allocation. Math Prob Eng 2018
Xu H, Gao Y, Yu F, Darrell T (2017) End-to-end learning of driving models from large-scale video datasets. In: IEEE conference on computer vision and pattern recognition. pp 3530–3538
Xu H, Gao Y, Yu F, Darrell T (2017) End-to-end learning of driving models from large-scale video datasets. In: IEEE Conference on Computer Vision and Pattern Recognition. pp 3530–3538
Yang J, You X, Wu G, Hassan MM, Almogren A, Guna J (2019) Application of reinforcement learning in UAV cluster task scheduling, In: Future generation computer systems. pp 140–148
Yang Z, Zhang Y, Yu J, Cai J, Luo J (2018) End-to-end multi-modal multi-task vehicle control for self-driving cars with visual perception. arXiv:1604.073161
Yuan C, Hu W, Tian G, Yang S, Wang H (2013) Multi-task sparse learning with beta process prior for action recognition. In: IEEE conference on computer vision and pattern recognition. pp 423–429
Zeng T, Ji S (2015) Deep convolutional neural networks for multi-instance multi-task learning. In: IEEE ICDM. pp 579–588
Zhang Y, Yang Q (2017) A survey on multi-task learning. arXiv:1604.073162
Zhang Y, Yeung DY (2010) A convex formulation for learning task relationships in multi-task learning. In: Proceedings of the conference on uncertainty in artificial intelligence. pp 733–742
Zhou Q, Wang G, Jia K, Zhao Q (2013) Learning to share latent tasks for action recognition. In: IEEE international conference on computer vision. pp 2264–2271
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
W. Abbas has now joined Cloud Application Solutions Division, Mentor, A Siemens Business and M. F. Khan is now working at SlashNext Inc. M. Taj is also an adjunct faculty at Ontario Tech University, Canada.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Abbas, W., Khan, M.F., Taj, M. et al. Statistically correlated multi-task learning for autonomous driving. Neural Comput & Applic 33, 12921–12938 (2021). https://doi.org/10.1007/s00521-021-05941-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-05941-8