Skip to main content
Log in

RoadTransNet: advancing remote sensing road extraction through multi-scale features and contextual information

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Road extraction is a crucial task that requires high-resolution remote sensing images with wide-ranging applications in urban planning, navigation, and autonomous vehicles. However, this task is challenged by complex road structures and the need to capture long-range dependencies. RoadTransNet is a new road extraction architecture that aims to solve these problems that making the power of the Swin Transformer and Feature Pyramid Network (FPN) while introducing Transformer-like attention mechanisms. RoadTransNet combines a robust convolutional backbone, inspired by the Swin Transformer, with an FPN to capture multi-scale features effectively. The Transformer-like attention mechanisms, including multi-head self-attention and cross-attention, enable the network to represent context information on a local and global scale, ensuring accurate road extraction. The skip connections facilitate gradient flow, preserving fine details, and decoding layers transform extracted features into precise road predictions. Our experiments are conducted using the RoadTransNet, which is subject to rigorous assessment on the following datasets: the DeepGlobe road extraction challenge Dataset and the CHN6-cUG roads dataset. The outcomes indicate its superior performance in achieving high-level metrics of precision and recall, as well as achieving high F1 scores and IoU. The comparative evaluations performed against traditional methods showcase RoadTransNet's ability to capture complex road structures and long-range dependencies. The RoadTransNet stands as a comprehensive solution for the extraction of roads in high-resolution remote sensing images, offering promising opportunities for improving urban planning, navigation systems, and autonomous vehicle technologies. Its success lies in the synergy of convolutional and transformer-based architectures, paving the way for advanced remote sensing applications in smart cities and others.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Availability of data and material

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  1. Yi, F., Te, R., Zhao, Y., Xu, G.: EUNetMTL: multitask joint learning for road extraction from high-resolution RS images. Remote Sensing Letters. 13(3), 258–268 (2022)

    Article  Google Scholar 

  2. Abdollahi, A., Pradhan, B., Alamri, A.: SC-RoadDeepNet: A new shape and connectivity-preserving road extraction deep learning-based network from remote sensing data. IEEE Trans. Geosci. Remote Sens. 60, 1–15 (2022)

    Article  Google Scholar 

  3. Chen, W., Zhou, G., Liu, Z., Li, X., Zheng, X., Wang, L.: NIGAN: A framework for mountain road extraction integrating remote sensing road-scene neighborhood probability enhancements and improved conditional generative adversarial network. IEEE Trans. Geosci. Remote Sens. 60, 1–15 (2022). https://doi.org/10.1109/TGRS.2022.3188908

    Article  CAS  Google Scholar 

  4. Zhang, Z., Sun, X., Liu, Y.: GMR-Net: road-extraction network based on fusion of local and global information. Remote Sensing. 14(21), 5476 (2022)

    Article  Google Scholar 

  5. Jie, Y., He, H., Xing, K., Yue, A., Tan, W., Yue, C., Jiang, C., Chen, X.: MECA-net: a multiscale feature encoding and long-range context-aware network for road extraction from remote sensing images. Remote Sensing. 14(21), 5342 (2022)

    Article  Google Scholar 

  6. Li, S., Liao, C., Ding, Y., Hu, H., Jia, Y., Chen, M., Xu, B., Ge, X., Liu, T., Wu, D.: Cascaded residual attention enhanced road extraction from remote sensing images. ISPRS Int. J. Geo Inf. 11(1), 9 (2022)

    Article  Google Scholar 

  7. Li, Z., Chen, H., Jing, N., Li, J.: RemainNet: explore road extraction from remote sensing image using mask image modeling. Remote Sensing. 15(17), 4215 (2023). https://doi.org/10.3390/rs15174215

    Article  Google Scholar 

  8. Luo, L., Wang, J.X., Chen, S.B., Tang, J., Luo, B.: BDTNet: Road extraction by bi-direction transformer from remote sensing images. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)

    Google Scholar 

  9. Yang, Z., Zhou, D., Yang, Y., Zhang, J., Chen, Z.: TransRoadNet: A novel road extraction method for remote sensing images via combining high-level semantic feature and context. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)

    CAS  Google Scholar 

  10. Hu, P.C., Chen, S.B., Huang, L.L., Wang, G.Z., Tang, J., Luo, B.: Road extraction by multi-scale deformable transformer from remote sensing images. IEEE Geosci. Remote. Sens. Lett. (2023)

  11. Wang, Y., Peng, Y., Li, W., Alexandropoulos, G.C., Yu, J., Ge, D., Xiang, W.: DDU-Net: Dual-decoder-U-Net for road extraction using high resolution remote sensing images. IEEE Trans. Geosci. Remote Sens. 60, 1–12 (2022)

    Google Scholar 

  12. Yan, J., Ji, S., Wei, Y.: A combination of convolutional and graph neural networks for regularized road surface extraction. IEEE Trans. Geosci. Remote Sens. 60, 1–13 (2022)

    Google Scholar 

  13. Chandra, N., Vaidya, H., Ghosh, J.K.: Human cognition based framework for detecting roads from remote sensing images. Geocarto Int. 37(8), 2365–2384 (2022)

    Article  Google Scholar 

  14. Abdollahi, A., Pradhan, B., Alamri, A.: VNet: An end-to-end fully convolutional neural network for road extraction from high resolution remote sensing data. IEEE Access. 8, 179424–179436 (2020)

    Article  Google Scholar 

  15. Wei, Y., Zhang, K., Ji, S.: Simultaneous road surface and centerline extraction from large-scale remote sensing images using CNN-based segmentation and tracing. IEEE Trans. Geosci. Remote Sens. 58(12), 8919–8931 (2020)

    Article  Google Scholar 

  16. Luo, Z., Zhou, K., Tan, Y., Wang, X., Zhu, R., Zhang, L.: AD-RoadNet: an auxiliary-decoding road extraction network improving connectivity while preserving multiscale road details. IEEE J. Selected Top. Appl. Earth Observ. Remote Sens (2023)

  17. Yin, A., Ren, C., Yan, Z., Xue, X., Zhou, Y., Liu, Y., Lu, J., Ding, C.: C2S-RoadNet: road extraction model with depth-wise separable convolution and self-attention. Remote Sens. 15(18), 4531 (2023)

  18. Yang, Z.X., You, Z.H., Chen, S.B., Tang, J., Luo, B.: Semi-supervised edge-aware road extraction via cross teaching between CNN and transformer. IEEE J. Selected Top. Appl. Earth Observ. Remote Sens. (2023)

  19. Jiang, X., Li, Y., Jiang, T., Xie, J., Wu, Y., Cai, Q., Jiang, J., Xu, J., Zhang, H.: RoadFormer: pyramidal deformable vision transformers for road network extraction with remote sensing images. Int. J. Appl. Earth Observ. Geoinf. 113, 102987 (2022). https://doi.org/10.1016/j.jag.2022.102987

  20. Christophe, E., Inglada, J.: Robust road extraction for high resolution satellite images. In 2007 IEEE International Conference on Image Processing. IEEE. 5, V-437 (2007, September)

  21. https://www.kaggle.com/datasets/balraj98/deepglobe-road-extraction-dataset

  22. Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., Tuia, D., Raskar, R.: Deepglobe 2018: A challenge to parse the earth through satellite images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp .172–181 (2018)

  23. https://www.kaggle.com/datasets/hithere016/chn6-roads-dataset

  24. Dai, L., Zhang, G., Zhang, R.: RADANet: road augmented deformable attention network for road extraction from complex high-resolution remote-sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–13 (2023)

    Google Scholar 

  25. Jing, Y., Zhang, T., Liu, Z., Hou, Y., Sun, C.: Swin-ResUNet+: An edge enhancement module for road extraction from remote sensing images. Comput. Vis. Image Understand. 103807 (2023)

  26. Li, R., Chen, T., Liu, Y., Jiang, H.: CoupleUNet: Swin Transformer coupling CNNs makes strong contextual encoders for VHR image road extraction. Int. J. Remote Sens. 44(18), 5788–5813 (2023)

    Article  Google Scholar 

  27. Miao, C., Zhang, Z., Tian, Q.: TransLinkNet: LinkNet with transformer for road extraction. In: International Conference on Optics and Machine Vision (ICOMV 2022). SPIE. 12173,138–143 (2022, May)

  28. Tao, J., Chen, Z., Sun, Z., Guo, H., Leng, B., Yu, Z., Wang, Y., He, Z., Lei, X., Yang, J.: Seg-Road: a segmentation network for road extraction based on transformer and CNN with connectivity structures. Remote Sens. 15(6), 1602 (2023)

    Article  Google Scholar 

  29. Lan, M., Zhang, Y., Zhang, L., Du, B.: Global context based automatic road segmentation via dilated convolutional neural network. Inf. Sci. 535, 156–171 (2020)

    Article  MathSciNet  Google Scholar 

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

KMK agreed on the content of the study. KMK collected all the data for analysis. KMK agreed on the methodology. KMK completed the analysis based on agreed steps. Results and conclusions are discussed and written together. The author read and approved the final manuscript.

Corresponding author

Correspondence to K. Madhan Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethics approval

This article does not contain any studies with human participants.

Human and animal rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kumar, K.M. RoadTransNet: advancing remote sensing road extraction through multi-scale features and contextual information. SIViP 18, 2403–2412 (2024). https://doi.org/10.1007/s11760-023-02916-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-023-02916-1

Keywords

Navigation