Abstract
Road extraction is a crucial task that requires high-resolution remote sensing images with wide-ranging applications in urban planning, navigation, and autonomous vehicles. However, this task is challenged by complex road structures and the need to capture long-range dependencies. RoadTransNet is a new road extraction architecture that aims to solve these problems that making the power of the Swin Transformer and Feature Pyramid Network (FPN) while introducing Transformer-like attention mechanisms. RoadTransNet combines a robust convolutional backbone, inspired by the Swin Transformer, with an FPN to capture multi-scale features effectively. The Transformer-like attention mechanisms, including multi-head self-attention and cross-attention, enable the network to represent context information on a local and global scale, ensuring accurate road extraction. The skip connections facilitate gradient flow, preserving fine details, and decoding layers transform extracted features into precise road predictions. Our experiments are conducted using the RoadTransNet, which is subject to rigorous assessment on the following datasets: the DeepGlobe road extraction challenge Dataset and the CHN6-cUG roads dataset. The outcomes indicate its superior performance in achieving high-level metrics of precision and recall, as well as achieving high F1 scores and IoU. The comparative evaluations performed against traditional methods showcase RoadTransNet's ability to capture complex road structures and long-range dependencies. The RoadTransNet stands as a comprehensive solution for the extraction of roads in high-resolution remote sensing images, offering promising opportunities for improving urban planning, navigation systems, and autonomous vehicle technologies. Its success lies in the synergy of convolutional and transformer-based architectures, paving the way for advanced remote sensing applications in smart cities and others.
Similar content being viewed by others
Availability of data and material
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Yi, F., Te, R., Zhao, Y., Xu, G.: EUNetMTL: multitask joint learning for road extraction from high-resolution RS images. Remote Sensing Letters. 13(3), 258–268 (2022)
Abdollahi, A., Pradhan, B., Alamri, A.: SC-RoadDeepNet: A new shape and connectivity-preserving road extraction deep learning-based network from remote sensing data. IEEE Trans. Geosci. Remote Sens. 60, 1–15 (2022)
Chen, W., Zhou, G., Liu, Z., Li, X., Zheng, X., Wang, L.: NIGAN: A framework for mountain road extraction integrating remote sensing road-scene neighborhood probability enhancements and improved conditional generative adversarial network. IEEE Trans. Geosci. Remote Sens. 60, 1–15 (2022). https://doi.org/10.1109/TGRS.2022.3188908
Zhang, Z., Sun, X., Liu, Y.: GMR-Net: road-extraction network based on fusion of local and global information. Remote Sensing. 14(21), 5476 (2022)
Jie, Y., He, H., Xing, K., Yue, A., Tan, W., Yue, C., Jiang, C., Chen, X.: MECA-net: a multiscale feature encoding and long-range context-aware network for road extraction from remote sensing images. Remote Sensing. 14(21), 5342 (2022)
Li, S., Liao, C., Ding, Y., Hu, H., Jia, Y., Chen, M., Xu, B., Ge, X., Liu, T., Wu, D.: Cascaded residual attention enhanced road extraction from remote sensing images. ISPRS Int. J. Geo Inf. 11(1), 9 (2022)
Li, Z., Chen, H., Jing, N., Li, J.: RemainNet: explore road extraction from remote sensing image using mask image modeling. Remote Sensing. 15(17), 4215 (2023). https://doi.org/10.3390/rs15174215
Luo, L., Wang, J.X., Chen, S.B., Tang, J., Luo, B.: BDTNet: Road extraction by bi-direction transformer from remote sensing images. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Yang, Z., Zhou, D., Yang, Y., Zhang, J., Chen, Z.: TransRoadNet: A novel road extraction method for remote sensing images via combining high-level semantic feature and context. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Hu, P.C., Chen, S.B., Huang, L.L., Wang, G.Z., Tang, J., Luo, B.: Road extraction by multi-scale deformable transformer from remote sensing images. IEEE Geosci. Remote. Sens. Lett. (2023)
Wang, Y., Peng, Y., Li, W., Alexandropoulos, G.C., Yu, J., Ge, D., Xiang, W.: DDU-Net: Dual-decoder-U-Net for road extraction using high resolution remote sensing images. IEEE Trans. Geosci. Remote Sens. 60, 1–12 (2022)
Yan, J., Ji, S., Wei, Y.: A combination of convolutional and graph neural networks for regularized road surface extraction. IEEE Trans. Geosci. Remote Sens. 60, 1–13 (2022)
Chandra, N., Vaidya, H., Ghosh, J.K.: Human cognition based framework for detecting roads from remote sensing images. Geocarto Int. 37(8), 2365–2384 (2022)
Abdollahi, A., Pradhan, B., Alamri, A.: VNet: An end-to-end fully convolutional neural network for road extraction from high resolution remote sensing data. IEEE Access. 8, 179424–179436 (2020)
Wei, Y., Zhang, K., Ji, S.: Simultaneous road surface and centerline extraction from large-scale remote sensing images using CNN-based segmentation and tracing. IEEE Trans. Geosci. Remote Sens. 58(12), 8919–8931 (2020)
Luo, Z., Zhou, K., Tan, Y., Wang, X., Zhu, R., Zhang, L.: AD-RoadNet: an auxiliary-decoding road extraction network improving connectivity while preserving multiscale road details. IEEE J. Selected Top. Appl. Earth Observ. Remote Sens (2023)
Yin, A., Ren, C., Yan, Z., Xue, X., Zhou, Y., Liu, Y., Lu, J., Ding, C.: C2S-RoadNet: road extraction model with depth-wise separable convolution and self-attention. Remote Sens. 15(18), 4531 (2023)
Yang, Z.X., You, Z.H., Chen, S.B., Tang, J., Luo, B.: Semi-supervised edge-aware road extraction via cross teaching between CNN and transformer. IEEE J. Selected Top. Appl. Earth Observ. Remote Sens. (2023)
Jiang, X., Li, Y., Jiang, T., Xie, J., Wu, Y., Cai, Q., Jiang, J., Xu, J., Zhang, H.: RoadFormer: pyramidal deformable vision transformers for road network extraction with remote sensing images. Int. J. Appl. Earth Observ. Geoinf. 113, 102987 (2022). https://doi.org/10.1016/j.jag.2022.102987
Christophe, E., Inglada, J.: Robust road extraction for high resolution satellite images. In 2007 IEEE International Conference on Image Processing. IEEE. 5, V-437 (2007, September)
https://www.kaggle.com/datasets/balraj98/deepglobe-road-extraction-dataset
Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., Tuia, D., Raskar, R.: Deepglobe 2018: A challenge to parse the earth through satellite images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp .172–181 (2018)
https://www.kaggle.com/datasets/hithere016/chn6-roads-dataset
Dai, L., Zhang, G., Zhang, R.: RADANet: road augmented deformable attention network for road extraction from complex high-resolution remote-sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–13 (2023)
Jing, Y., Zhang, T., Liu, Z., Hou, Y., Sun, C.: Swin-ResUNet+: An edge enhancement module for road extraction from remote sensing images. Comput. Vis. Image Understand. 103807 (2023)
Li, R., Chen, T., Liu, Y., Jiang, H.: CoupleUNet: Swin Transformer coupling CNNs makes strong contextual encoders for VHR image road extraction. Int. J. Remote Sens. 44(18), 5788–5813 (2023)
Miao, C., Zhang, Z., Tian, Q.: TransLinkNet: LinkNet with transformer for road extraction. In: International Conference on Optics and Machine Vision (ICOMV 2022). SPIE. 12173,138–143 (2022, May)
Tao, J., Chen, Z., Sun, Z., Guo, H., Leng, B., Yu, Z., Wang, Y., He, Z., Lei, X., Yang, J.: Seg-Road: a segmentation network for road extraction based on transformer and CNN with connectivity structures. Remote Sens. 15(6), 1602 (2023)
Lan, M., Zhang, Y., Zhang, L., Du, B.: Global context based automatic road segmentation via dilated convolutional neural network. Inf. Sci. 535, 156–171 (2020)
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
KMK agreed on the content of the study. KMK collected all the data for analysis. KMK agreed on the methodology. KMK completed the analysis based on agreed steps. Results and conclusions are discussed and written together. The author read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval
This article does not contain any studies with human participants.
Human and animal rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, K.M. RoadTransNet: advancing remote sensing road extraction through multi-scale features and contextual information. SIViP 18, 2403–2412 (2024). https://doi.org/10.1007/s11760-023-02916-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02916-1