High-order deep infomax-guided deformable transformer network for efficient lane detection

Gao, Rong; Hu, Siqi; Yan, Lingyu; Zhang, Li; Ruan, Hang; Yu, Yonghong; Ye, Zhiwei

doi:10.1007/s11760-023-02525-y

High-order deep infomax-guided deformable transformer network for efficient lane detection

Original Paper
Published: 04 April 2023

Volume 17, pages 3045–3052, (2023)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Rong Gao^1,2,
Siqi Hu¹,
Lingyu Yan¹,
Li Zhang³,
Hang Ruan⁴,
Yonghong Yu⁵ &
…
Zhiwei Ye¹

841 Accesses
1 Altmetric
Explore all metrics

Abstract

With the development of deep learning, lane detection models based on deep convolutional neural networks have been widely used in autonomous driving systems and advanced driver assistance systems. However, in the case of harsh and complex environment, the performances of detection models degrade greatly due to the difficulty in merging long-range lane points with global context and exclusion of important higher-order information. To address these issues, we propose a new learning model to better capture lane features, called Deformable Transformer with high-order Deep Infomax (DTHDI) model. Specifically, we propose a Deformable Transformer neural network model based on segmentation techniques for high-accuracy detection, in which local and global contextual information is seamlessly fused and more information about the diversity of lane line shape features is retained, resulting in extraction of rich lane features. Meanwhile, we introduce a mutual information maximization approach for mining higher-order correlations among global shape, local shape, and lane position of lane lines to learn more discriminative representations of lane lines. In addition, we employ a row classification approach to further reduce the computational complexity for robust lane line detection. Our model is evaluated on two popular lane detection datasets. The empirical results show that the proposed DTHDI model outperforms the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 8

LDTR: Transformer-based lane detection with anchor-chain representation

Article Open access 24 July 2024

DILane: Dynamic Instance-Aware Network for Lane Detection

Lane line detection at nighttime on fractional differential and central line point searching with Fragi and Hessian

Article Open access 12 May 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

All of our datasets come from public datasets. You can go to the corresponding official website to download.

References

Wu, P., Chang, C., Lin, C.: Lane-mark extraction for automobiles under complex conditions. Pattern Recognit. 47, 2756–2767 (2014)
Article Google Scholar
Hillel, A., Lerner, R., Levi, D., et al.: Recent progress in road and lane detection: a survey. Mach. Vis. Appl. 25, 727–745 (2014)
Article Google Scholar
Lin, T., Doll, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2117–2125 (2017)
Pan, X., Shi, J., Luo, P., et al.: Spatial as deep: spatial cnn for traffific scene understanding. In: Proceeding of the 32nd AAAI conference on artificial intelligence, pp. 7276–7283 (2018)
Qin, Z., Wang, H., Li, X., et al.: Ultra-fast structure aware deep lane detection. In: Proceedings of European conference on computer vision, pp. 276–291 (2020)
Niu, J., Lu, J., Xu, M., et al.: Robust lane detection using two-stage feature extraction with curvefitting. Pattern Recognit. 59, 225–233 (2016)
Article Google Scholar
Narote, S., Bhujbal, P., Narote, A., et al.: A review of recent advances in lane detection and departure warning system. Pattern Recognit. 73, 216–234 (2018)
Article Google Scholar
Lee, M. Lee, J., Lee, D., et al.: Robust lane detection via expanded self-attention. (2021), arXiv:2102.07037
Xu, H., Wang, S., Cai, X., et al.: Curve lane-NAS: Unifying lane-sensitive architecture search and adaptive point blending. (2020), arXiv:2007.12147
Liu, R., Yuan, Z., Liu, T., et al.: End-to-end lane shape prediction with transformers. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp. 3694–3702 (2021)
Neven, D., Brabandere, B., Georgoulis, S., et al.: Towards end-to-end lane detection: an instance segmentation approach. In: IEEE intelligent vehicles symposium, pp. 286–291 (2018)
Zhang, J., Deng, T., Yan, F., et al.: Lane detection model based on spatio-temporal network with double convolutional gated recurrent units. IEEE Trans. Intell. Transp. Syst. 23(7), 6666–6678 (2021)
Article Google Scholar
Su, J., Chen, C., Zhang, K., et al.: Structure guided lane detection. (2021), arXiv:2105.05403
Xu, H., Wang, S., Cai, X., et al.: Curve lane-NAs: Unifying lane-sensitive architecture search and adaptive point blending. In: Proceedings of the european conference on computer vision, pp. 689–704 (2020)
Lee, M., Lee, J., Lee, D., et al.: Robust lane detection via expanded self-attention. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp. 533–542 (2022)
Jayasinghe, O., Anhettigama, D., Hemachandra, S., et al.: Swiftlane: Towards fast and efficient lane detection. (2021), arXiv:2110.11779
Yoo, S., Lee, H., Myeong, H., et al.: End-to-end lane marker detection via row-wise classification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp. 1006–1007 (2020)
Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. In: Annual conference on neural information processing systems, pp. 5998–6008 (2017)
Wang, W., Xie, E., Li, X., et al.: Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. (2021), arXiv:2102.12122
Hjelm, R., Fedorov, A., Lavoie-Marchildon, S., et al.: Learning deep representations by mutual information estimation and maximization. (2019), arXiv:1808.06670
Mukherjee, S., Asnani, H., Kannan, S.: CCMI: Classifier based conditional mutual information estimation. In: Proceedings of the 35th uncertainty in artificial intelligence conference, pp. 1083–1093 (2020)
Bachman, P., Hjelm, R., Buchwalter, W.: Learning representations by maximizing mutual information across views. In: Proceedings of the 33rd international conference on neural information processing systems, pp. 15535–15545 (2019)
Xu, J., Vedaldi, A., Henriques, J.: Invariant information clustering for unsupervised image classification and segmentation. In: 2019 international conference on computer vision 1, pp. 9865–9874 (2019)
Chen, T., Kornblith, S., Norouzi, M., et al. A simple framework for contrastive learning of visual representations. (2020), arXiv:2002.05709
Tusimple, Tusimple lane detection benchmark (2017). https://github.com/TuSimple/tusimple-benchmark
Tusimple, Tusimple benchmark (2019). https://github.com/TuSimple/tusimple-benchmark
Philion, J.: Fastdraw: Addressing the long tail of lane detection by adapting a sequential prediction network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11582–11591 (2019)
Hou, Y., Ma, Z., Liu, C., et al.: Learning lightweight lane detection cnns by self-attention distillation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 1013–1021 (2019)

Download references

Funding

This work described in this paper was supported by the Open Foundation of State Key Laboratory for Novel Software Technology at Nanjing University of P. R. China (No. KFKT2021B12). This work was supported in part by the Future Network Scientific Research Fund Project (FNSRFP-2021-YB-54), the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province (17KJB520028), Tongda College of Nanjing University of Posts and Telecommunications (XK203XZ21001), Major Science and Technology Project of Jilin Province, China (20210301030GX), and Key Research and Development Program of Hubei Province, China (2021BAA179 and 2022BAA079). The numerical calculations in this paper have been done on the supercomputing system in the Supercomputing Center of Wuhan University.

Author information

Authors and Affiliations

School of Computer Science, Hubei University of Technology, Wuhan, 430068, China
Rong Gao, Siqi Hu, Lingyu Yan & Zhiwei Ye
State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing University, Nanjing, 210023, China
Rong Gao
Department of Computer Science, Royal Holloway, University of London, Surrey, TW20 0EX, UK
Li Zhang
School of Mathematics, University of Edinburgh, Edinburgh, EH9 3FD, UK
Hang Ruan
College of Tongda, Nanjing University of Posts and Telecommunications, Nanjing, China
Yonghong Yu

Authors

Rong Gao
View author publications
You can also search for this author in PubMed Google Scholar
Siqi Hu
View author publications
You can also search for this author in PubMed Google Scholar
Lingyu Yan
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hang Ruan
View author publications
You can also search for this author in PubMed Google Scholar
Yonghong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiwei Ye
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RG: conceptualization, methodology, software. SH: data curation, writing-original draft preparation. LY: supervision, writing. LZ: supervision, writing - review and editing. HR: review, editing. YY: supervision, writing - review & editing. ZY: review, editing.

Corresponding author

Correspondence to Li Zhang.

Ethics declarations

Conflicts of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Gao, R., Hu, S., Yan, L. et al. High-order deep infomax-guided deformable transformer network for efficient lane detection. SIViP 17, 3045–3052 (2023). https://doi.org/10.1007/s11760-023-02525-y

Download citation

Received: 26 October 2022
Revised: 23 December 2022
Accepted: 06 February 2023
Published: 04 April 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s11760-023-02525-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

High-order deep infomax-guided deformable transformer network for efficient lane detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

LDTR: Transformer-based lane detection with anchor-chain representation

DILane: Dynamic Instance-Aware Network for Lane Detection

Lane line detection at nighttime on fractional differential and central line point searching with Fragi and Hessian

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

High-order deep infomax-guided deformable transformer network for efficient lane detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

LDTR: Transformer-based lane detection with anchor-chain representation

DILane: Dynamic Instance-Aware Network for Lane Detection

Lane line detection at nighttime on fractional differential and central line point searching with Fragi and Hessian

Explore related subjects

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation