Skip to main content
Log in

Using contour loss constraining residual attention U-net on optical remote sensing interpretation

  • Original article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Using deep learning in remote sensing interpretation could reduce a lot of human and material costs. Semantic segmentation is the main method for this task. It can automatically outline the objects and it has recently achieved great success in remote sensing images. However, in the appliance of remote sensing interpretation, the accuracy of contour largely determines the evaluation of remote sensing interpretation. Though the current loss functions reflect the segmentation performance, they could not guide the model to optimize itself toward a more precise contour. This paper proposed an exactly defined contour loss (CL) for remote sensing interpretation with Residual Attention U-Net (RA U-Net) as the main framework. The RA U-Net uses the residual attention module as the skip connection layer. It enhances the judgment of U-Net. In CL, image processing methods are used to extract the contours of the foreground. And elements-sum and elements-subtract operations are used to transfer the contour information to a matrix of the same size as label images. Then, these matrices would be the weights for CE. By assigning different weights for different elements in different regions, this function will guide the model to reach a balance between accurate segmentation results and precise contours. The experiment on open datasets shows a good performance. The proposed model was also trained on the Construction Disturbance Dataset collected from Jiang Xi Province, China. The dataset was labeled manually. The evaluation enhanced a lot on the Construction Disturbance Dataset and the IoU on two datasets increased \(1\%\) to \(2\%\) when using CL as the loss function. This paper also compared the proposed method with other state-of-the-art methods and the results showed extensive effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The Mnih Massachusetts Building Dataset used or analyzed during the current study are available from the corresponding author on reasonable request. Another Construction Disturbance Dataset is not publicly available until permission is granted by the provider.

Code availability

The relevant codes are available.

References

  1. Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017). https://doi.org/10.1109/TPAMI.2016.2644615

    Article  Google Scholar 

  2. Bao, Y., Liu, W., Gao, O., Lin, Z., Hu, Q.: A semantic segmentation method for remote sensing images. In: 2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), vol. 4, pp. 1858–1862 (2021). https://doi.org/10.1109/IMCEC51613.2021.9482266

  3. Chen, C., Jiange, J., Rufei, F., Lanlan, C., Cong, L., Shaohua, W.: An intelligent caching strategy considering time-space characteristics in vehicular nameddata networks. IEEE Trans. Intell. Transp. Syst. (2021). https://doi.org/10.1109/TITS.2021.3128012

  4. Chen, C., Zhang, Y., Wang, Z., Wan, S., Pei, Q.: Distributed computation offloading method based on deep reinforcement learning in ICV. Appl. Soft Comput. 103, 107108 (2021)

    Article  Google Scholar 

  5. Chen, C., Jiang, J., Zhou, Y., Lv, N., Liang, X., Wan, S.: An edge intelligence empowered flooding process prediction using internet of things in smart city. J. Parallel Distrib. Comput. 165, 66–78 (2022)

    Article  Google Scholar 

  6. Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2017)

    Article  Google Scholar 

  7. Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018)

  8. Chen, Z., Zhou, H., Xie, X., Lai, J.: Contour loss: Boundary-aware learning for salient object segmentation. arXiv:1908.01975 (2019)

  9. Cheng, Z., Qu, A., He, X.: Contour-aware semantic segmentation network with spatial attention mechanism for medical image. Vis. Comput. 38(3), 749–762 (2022)

    Article  Google Scholar 

  10. Cong, W., Chen, C., Qingqi, P., Zhiyuan, J., Shugong, X.: An information centric in-network caching scheme for 5g-enabled internet of connected vehicles. IEEE Trans. Mob. Comput. (2021). https://doi.org/10.1109/TMC.2021.3137219

  11. Di Martino, T., Lenormand, M., Koeniguer, E.C.: Multi-branch deep learning model for detection of settlements without electricity. In: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, pp. 1847–1850. https://doi.org/10.1109/IGARSS47720.2021.9554286 (2021)

  12. Farhangfar, S., Rezaeian, M.: Semantic segmentation of aerial images using FCN-based network. In: 2019 27th Iranian Conference on Electrical Engineering (ICEE), pp. 1864–1868. https://doi.org/10.1109/IranianCEE.2019.8786455 (2019)

  13. Feng, C., Liu, B., Yu, K., Goudos, S.K., Wan, S.: Blockchain-empowered decentralized horizontal federated learning for 5G-enabled UAVs. IEEE Trans. Ind. Inform. 1 (2021). https://doi.org/10.1109/TII.2021.3116132

  14. Goel, A., Banerjee, B., Pizurica, A.: Hierarchical metric learning for optical remote sensing scene categorization. IEEE Geosci. Remote Sens. Lett. 1–5 (2018)

  15. Goldberg, M., Shlien, S.: A clustering scheme for multispectral images. IEEE Trans. Syst. Man Cybern. 8(2), 86–92 (1978). https://doi.org/10.1109/TSMC.1978.4309905

    Article  Google Scholar 

  16. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. IEEE

  17. Hu, J., Chen, C., Cai, L., Khosravi, M.R., Pei, Q., Wan, S.: UAV-assisted vehicular edge computing for the 6g internet of vehicles: architecture, intelligence, and challenges. IEEE Commun. Stand. Mag. 5(2), 12–18 (2021)

    Article  Google Scholar 

  18. Iandola, F.N., Moskewicz, M.W., Karayev, S., Girshick, R.B., Darrell, T., Keutzer, K.: Densenet: Implementing efficient convnet descriptor pyramids. CoRR arXiv:1404.1869 (2014)

  19. Jiang, M., Zhai, F., Kong, J.: Sparse attention module for optimizing semantic segmentation performance combined with a multi-task feature extraction network. Vis. Comput. 1–16 (2021)

  20. Kai, Y., Jiahang, L., Lu, Z.: An adaptive multi-threshold image segmentation algorithm based on object-oriented classification for high-resolution remote sensing images. In: Optical Sensing and Imaging Technology and Applications (2017)

  21. Karimi, D., Salcudean, S.E.: Reducing the hausdorff distance in medical image segmentation with convolutional neural networks. IEEE Trans. Med. Imaging 39(2), 499–513 (2020). https://doi.org/10.1109/TMI.2019.2930068

    Article  Google Scholar 

  22. Li, X., Du, Z., Huang, Y., Tan, Z.: A deep translation (GAN) based change detection network for optical and SAR remote sensing images. ISPRS J. Photogram. Remote Sens. 179, 14–34 (2021)

    Article  Google Scholar 

  23. Li, Z., Guo, Y.: Semantic segmentation of landslide images in Nyingchi region based on PSPNet network. In: 2020 7th International Conference on Information Science and Control Engineering (ICISCE), pp. 1269–1273. (2020). https://doi.org/10.1109/ICISCE50968.2020.00256

  24. Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2999–3007 (2017). https://doi.org/10.1109/ICCV.2017.324

  25. Liu, Y., Fan, B., Wang, L., Bai, J., Xiang, S., Pan, C.: Semantic labeling in very high resolution images via a self-cascaded convolutional neural network. ISPRS J. Photogram. Remote Sens. 145, 78–95 (2018)

    Article  Google Scholar 

  26. Lu, L., Wang, C., Yin, X.: Incorporating texture into SLIC super-pixels method for high spatial resolution remote sensing image segmentation. In: 2019 8th International Conference on Agro-Geoinformatics (Agro-Geoinformatics), pp. 1–5 (2019). https://doi.org/10.1109/Agro-Geoinformatics.2019.8820692

  27. Lv, N., Ma, H., Chen, C., Pei, Q., Zhou, Y., Xiao, F., Li, J.: Remote sensing data augmentation through adversarial training. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 14, 9318–9333 (2021). https://doi.org/10.1109/JSTARS.2021.3110842

    Article  Google Scholar 

  28. Ma, J.: Segmentation loss odyssey. arXiv:2005.13449 (2020)

  29. Malik, R., Kheddam, R., Belhadj-Aissa, A.: Toward an optimal object-oriented image classification using SVM and MLLH approaches. In: 2015 First International Conference on New Technologies of Information and Communication (NTIC), pp. 1–6 (2015). https://doi.org/10.1109/NTIC.2015.7368750

  30. Milletari, F., Navab, N., Ahmadi, S.A.: V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571 (2016). https://doi.org/10.1109/3DV.2016.79

  31. Ming, D., Ci, T., Cai, H., Li, L., Qiao, C., Du, J.: Semivariogram-based spatial bandwidth selection for remote sensing image segmentation with mean-shift algorithm. IEEE Geosci. Remote Sens. Lett. 9(5), 813–817 (2012). https://doi.org/10.1109/LGRS.2011.2182604

    Article  Google Scholar 

  32. Mnih, V.: Mnih Massachusetts building dataset. http://www.cs.toronto.edu/~vmnih/data/ (2013)

  33. Rahman, M.A., Yang, W.: Optimizing intersection-over-union in deep neural networks for image segmentation. In: International Symposium on Visual Computing (2016)

  34. Rekkas, V.P., Sotiroudis, S., Sarigiannidis, P., Wan, S., Karagiannidis, G.K., Goudos, S.K.: Machine learning in beyond 5G/6G networks-state-of-the-art and future trends. Electronics 10(22), 2786 (2021)

    Article  Google Scholar 

  35. Ren, J., Tong, L., Li, Y., Yuan, L., Si, Y.: Improved unet combining dropout and acnet for remote sensing image change detection. In: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, pp. 4380–4383 (2021). https://doi.org/10.1109/IGARSS47720.2021.9553666

  36. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015, pp. 234–241. Springer International Publishing, Cham (2015)

    Google Scholar 

  37. Saxena, N., N K.B., Raman, B.: Semantic segmentation of multispectral images using res-seg-net model. In: 2020 IEEE 14th International Conference on Semantic Computing (ICSC), pp. 154–157 (2020). https://doi.org/10.1109/ICSC.2020.00030

  38. Sw, A., Sd, B., Chen, C.C.: Edge computing enabled video segmentation for real-time traffic monitoring in internet of vehicles. Pattern Recognit. (2021)

  39. Taghanaki, S.A., Zheng, Y., Kevin, Z.S., Georgescu, B., Sharma, P., Xu, D., Comaniciu, D., Hamarneh, G.: Combo loss: handling input and output imbalance in multi-organ segmentation. Comput. Med. Imaging Graphics 75, 24 (2019)

    Article  Google Scholar 

  40. Tan, M., Le, Q.: Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, PMLR, pp. 6105–6114 (2019)

  41. Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H., Wang, X., Tang, X.: Residual attention network for image classification. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6450–6458 (2017). https://doi.org/10.1109/CVPR.2017.683

  42. Wu, G., Guo, Z., Shao, X., Shibasaki, R.: Geoseg: A computer vision package for automatic building segmentation and outline extraction. In: IGARSS 2019–2019 IEEE International Geoscience and Remote Sensing Symposium, pp. 158–161 (2019). https://doi.org/10.1109/IGARSS.2019.8900475

  43. Wu, Z., Shen, C., Hengel, A.: Bridging category-level and instance-level semantic image segmentation. arXiv:1605.06885 (2016)

  44. Xiang, D., Tang, T., Hu, C., Li, Y., Su, Y.: A kernel clustering algorithm with fuzzy factor: Application to SAR image segmentation. IEEE Geosci. Remote Sens. Lett. 11(7), 1290–1294 (2014). https://doi.org/10.1109/LGRS.2013.2292820

    Article  Google Scholar 

  45. Xia, X., Xu, C., Nan, B.: Inception-v3 for flower classification. In: 2017 2nd International Conference on Image, Vision and Computing (ICIVC), pp. 783–787 (2017). https://doi.org/10.1109/ICIVC.2017.7984661

  46. Xu, G., Yang, L., Liu, X., Li, R.: Research of road extraction based on hough transformation and morphology. In: 2012 International Conference on Computer Science and Service System, pp. 2261–2264 (2012). https://doi.org/10.1109/CSSS.2012.561

  47. Zeng, X., Chen, I., Liu, P.: Improve semantic segmentation of remote sensing images with k-mean pixel clustering: A semantic segmentation post-processing method based on k-means clustering. In: 2021 IEEE International Conference on Computer Science, Artificial Intelligence and Electronic Engineering (CSAIEE), pp. 231–235. (2021) https://doi.org/10.1109/CSAIEE54046.2021.9543336

  48. Zhang, H., Zhu, Q., Guan, X.: Probe into image segmentation based on sobel operator and maximum entropy algorithm. In: 2012 International Conference on Computer Science and Service System, pp. 238–241 (2012). https://doi.org/10.1109/CSSS.2012.67

  49. Zhou, Z., Siddiquee, M.M.R., Tajbakhsh, N., Liang, J.: Unet++: A nested u-net architecture for medical image segmentation. In: Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, pp. 3–11. Springer (2018)

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2019YFE0196600), the National Natural Science Foundation of China (62072360, 61902292, 62001357, 62072359, 62172438), the key research and development plan of Shaanxi Province (2019ZDLGY13-07, 2019ZDLGY13-04, 2020JQ-844), the Natural Science Foundation of Guangdong Province of China (2022A1515010988), the Xi’an Science and Technology Plan (20RGZN0005), and the Xi’ an Key Laboratory of Mobile Edge Computing and Security (201805052-ZD3CG36).

Author information

Authors and Affiliations

Authors

Contributions

PY performed the data analyses and wrote the manuscript. MW helped perform the analysis with constructive discussions. HY contributed to the conception of the study. CH contributed to analysis and manuscript preparation. LC contributed to analysis and manuscript review.

Corresponding author

Correspondence to Hao Yuan.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, P., Wang, M., Yuan, H. et al. Using contour loss constraining residual attention U-net on optical remote sensing interpretation. Vis Comput 39, 4279–4291 (2023). https://doi.org/10.1007/s00371-022-02590-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-022-02590-3

Keywords

Navigation