Abstract
Lane detection is one of the most fundamental tasks for autonomous driving. It plays a crucial role in the lateral control and the precise localization of autonomous vehicles. Monocular 3D lane detection methods provide state-of-the-art results for estimating the position of lanes in 3D world coordinates using only the information obtained from the front-view camera. Recent advances in Neural Architecture Search (NAS) facilitate automated optimization of various computer vision tasks. NAS can automatically optimize monocular 3D lane detection methods to enhance the extraction and combination of visual features, consequently reducing computation loads and increasing accuracy. This paper proposes 3DLaneNAS, a multi-objective method that enhances the accuracy of monocular 3D lane detection for both short- and long-distance scenarios while at the same time providing a fair amount of hardware acceleration. 3DLaneNAS utilizes a new multi-objective energy function to optimize the architecture of feature extraction and feature fusion modules simultaneously. Moreover, a transfer learning mechanism is used to improve the convergence of the search process. Experimental results reveal that 3DLaneNAS yields a minimum of 5.2% higher accuracy and \(\approx \)1.33\(\times \) lower latency over competing methods on the synthetic-3D-lanes dataset. Code is at https://github.com/alizoljodi/3DLaneNAS
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amine, K.: Multiobjective simulated annealing: principles and algorithm variants. Adv. Oper. Res. 2019 (2019)
Bai, M., Mattyus, G., Homayounfar, N., Wang, S., Lakshmikanth, S.K., Urtasun, R.: Deep multi-sensor lane detection. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3102–3109. IEEE (2018)
Borji, A.: Vanishing point detection with convolutional neural networks. arXiv preprint arXiv:1609.00967 (2016)
Cai, H., Gan, C., Wang, T., Zhang, Z., Han, S.: Once-for-all: train one network and specialize it for efficient deployment. arXiv preprint arXiv:1908.09791 (2019)
Chen, P.Y., Lee, C.M., Yeh, H.Z., Huang, Y.C.: Design and implementation for a vision-guided wheeled mobile robot system. In: 2018 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), pp. 1–5 (2018). https://doi.org/10.1109/ICCE-China.2018.8448543
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009). https://doi.org/10.1109/CVPR.2009.5206848
Dong, X., Liu, L., Musial, K., Gabrys, B.: Nats-bench: benchmarking nas algorithms for architecture topology and size. IEEE Trans. Pattern Anal. Mach. Intell. 44(7), 3634–3646 (2021). https://doi.org/10.1109/TPAMI.2021.3054824
Elsken, T., Metzen, J.H., Hutter, F.: Neural architecture search: a survey. J. Mach. Learn. Res. 20(1), 1997–2017 (2019)
Foundation, B.: Home of the blender project - free and open 3D creation software. https://www.blender.org/
Garnett, N., Cohen, R., Pe’er, T., Lahav, R., Levi, D.: 3d-lanenet: end-to-end 3d multiple lane detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2921–2930 (2019)
Guo, Y., et al.: Gen-lanenet: a generalized and scalable approach for 3D lane detection. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12366, pp. 666–681. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58589-1_40
Gurghian, A., Koduri, T., Bailur, S.V., Carey, K.J., Murali, V.N.: Deeplanes: end-to-end lane position estimation using deep neural networks. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 38–45 (2016). https://doi.org/10.1109/CVPRW.2016.12
He, X., Zhao, K., Chu, X.: Automl: a survey of the state-of-the-art. Knowl.-Based Syst. 212, 106622 (2021)
Hsu, C.H., et al.: Monas: multi-objective neural architecture search using reinforcement learning. arXiv preprint arXiv:1806.10332 (2018)
Jaderberg, M., Simonyan, K., Zisserman, A., et al.: Spatial transformer networks. In: Advances in Neural Information Processing Systems 28 (2015)
Jung, J., Bae, S.H.: Real-time road lane detection in urban areas using lidar data. Electronics 7(11), 276 (2018)
Kheyrollahi, A., Breckon, T.P.: Automatic real-time road marking recognition using a feature driven approach. Mach. Vis. Appl. 23(1), 123–133 (2012). https://doi.org/10.1007/s00138-010-0289-5
Lee, S., et al.: Vpgnet: vanishing point guided network for lane and road marking detection and recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1947–1955 (2017)
Lindauer, M., Hutter, F.: Best practices for scientific research on neural architecture search. J. Mach. Learn. Res. 21(243), 1–18 (2020)
Liu, C., et al.: Auto-deeplab: hierarchical neural architecture search for semantic image segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 82–92 (2019)
Liu, H., Simonyan, K., Yang, Y.: Darts: differentiable architecture search. arXiv preprint arXiv:1806.09055 (2018)
Loni, M., Mousavi, H., Riazati, M., Daneshtalab, M., Sjödin, M.: Tas: ternarized neural architecture search for resource-constrained edge devices. In: Design, Automation & Test in Europe Conference & Exhibition DATE 2022, 14 March 2022, Antwerp, Belgium. IEEE (2022). http://www.es.mdh.se/publications/6351-
Loni, M., Sinaei, S., Zoljodi, A., Daneshtalab, M., Sjödin, M.: Deepmaker: a multi-objective optimization framework for deep neural networks in embedded systems. Microprocess. Microsyst. 73, 102989 (2020)
Loni, M., et al.: Densedisp: resource-aware disparity map estimation by compressing siamese neural architecture. In: 2020 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8 (2020). https://doi.org/10.1109/CEC48606.2020.9185611
Loni, M., et al.: Faststereonet: a fast neural architecture search for improving the inference of disparity estimation on resource-limited platforms. IEEE Trans. Syst. Man Cybern. Syst. 52(8), 1–13 (2021). https://doi.org/10.1109/TSMC.2021.3123136
Loni, M., Zoljodi, A., Sinaei, S., Daneshtalab, M., Sjödin, M.: NeuroPower: designing energy efficient convolutional neural network architecture for embedded systems. In: Tetko, I.V., Kürková, V., Karpov, P., Theis, F. (eds.) ICANN 2019. LNCS, vol. 11727, pp. 208–222. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30487-4_17
Mallot, H.A., Bülthoff, H.H., Little, J., Bohrer, S.: Inverse perspective mapping simplifies optical flow computation and obstacle detection. Biol. Cybern. 64(3), 177–185 (1991)
Nedevschi, S., Oniga, F., Danescu, R., Graf, T., Schmidt, R.: Increased accuracy stereo approach for 3D lane detection. In: 2006 IEEE Intelligent Vehicles Symposium, pp. 42–49. IEEE (2006)
Nedevschi, S., et al.: 3D lane detection system based on stereovision. In: Proceedings. The 7th International IEEE Conference on Intelligent Transportation Systems (IEEE Cat. No. 04TH8749), pp. 161–166. IEEE (2004)
Padilla, R., Netto, S., da Silva, E.: A survey on performance metrics for object-detection algorithms (2020). https://doi.org/10.1109/IWSSIP48289.2020
Pizzati, F., García, F.: Enhanced free space detection in multiple lanes based on single cnn with scene identification. In: 2019 IEEE Intelligent Vehicles Symposium (IV), pp. 2536–2541 (2019). https://doi.org/10.1109/IVS.2019.8814181
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Tang, J., Li, S., Liu, P.: A review of lane detection methods based on deep learning. Pattern Recogn. 111, 107623 (2021)
White, C., Nolen, S., Savani, Y.: Exploring the loss landscape in neural architecture search. In: Uncertainty in Artificial Intelligence, pp. 654–664. PMLR (2021)
Xu, H., Wang, S., Cai, X., Zhang, W., Liang, X., Li, Z.: CurveLane-NAS: unifying lane-sensitive architecture search and adaptive point blending. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12360, pp. 689–704. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58555-6_41
Zhang, W., Mahale, T.: End to end video segmentation for driving: lane detection for autonomous car. arXiv preprint arXiv:1812.05914 (2018)
Zoph, B., Le, Q.V.: Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578 (2016)
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8697–8710 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zoljodi, A., Loni, M., Abadijou, S., Alibeigi, M., Daneshtalab, M. (2022). 3DLaneNAS: Neural Architecture Search for Accurate and Light-Weight 3D Lane Detection. In: Pimenidis, E., Angelov, P., Jayne, C., Papaleonidas, A., Aydin, M. (eds) Artificial Neural Networks and Machine Learning – ICANN 2022. ICANN 2022. Lecture Notes in Computer Science, vol 13529. Springer, Cham. https://doi.org/10.1007/978-3-031-15919-0_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-15919-0_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15918-3
Online ISBN: 978-3-031-15919-0
eBook Packages: Computer ScienceComputer Science (R0)