Abstract
Aerial image segmentation is an essential problem for land management which can be used for change detection and policy planning. However, traditional semantic segmentation methods focus on single-perspective images in road scenes, while aerial images are top-down views and objects are of a small size. Existing aerial segmentation methods tend to modify the network architectures proposed for traditional semantic segmentation problems, yet to the best of our knowledge, none of them focus on the noisy information present in the aerial images. In this work, we conduct an investigation on the effectiveness of each channels of the aerial image on the segmentation performance. Then, we propose a disentangle learning method to investigate the differences and similarities between channels and images, so that potential noisy information can be removed for higher segmentation accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Boguszewski, A., et al.: Landcover. AI: dataset for automatic mapping of buildings, woodlands and water from aerial imagery. arXiv preprint arXiv:2005.02264 (2020)
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Gerard, F., et al.: Land cover change in Europe between 1950 and 2000 determined employing aerial photography. Prog. Phys. Geogr. 34(2), 183–205 (2010)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Huang, X., Liu, M.-Y., Belongie, S., Kautz, J.: Multimodal unsupervised image-to-image translation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11207, pp. 179–196. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01219-9_11
Khalel, A., El-Saban, M.: Automatic pixelwise object labeling for aerial imagery using stacked u-nets. arXiv preprint arXiv:1803.04953 (2018)
Li, X., Jiang, Y., Peng, H., Yin, S.: An aerial image segmentation approach based on enhanced multi-scale convolutional neural network. In: 2019 IEEE International Conference on Industrial Cyber Physical Systems (ICPS). pp. 47–52. IEEE (2019)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Liu, M.Y., Breuel, T., Kautz, J.: Unsupervised image-to-image translation networks. Adv. Neural. Inf. Process. Syst. 30, 700–708 (2017)
Liu, S., Qi, L., Qin, H., Shi, J., Jia, J.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Maggiori, E., Tarabalka, Y., Charpiat, G., Alliez, P.: Can semantic labeling methods generalize to any city? the iNRIA aerial image labeling benchmark. In: 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pp. 3226–3229. IEEE (2017)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sun, Y., Wu, X., Bandoh, Y., Kitahara, M. (2023). Aerial Image Segmentation via Noise Dispelling and Content Distilling. In: Zheng, Y., Keleş, H.Y., Koniusz, P. (eds) Computer Vision – ACCV 2022 Workshops. ACCV 2022. Lecture Notes in Computer Science, vol 13848. Springer, Cham. https://doi.org/10.1007/978-3-031-27066-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-031-27066-6_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27065-9
Online ISBN: 978-3-031-27066-6
eBook Packages: Computer ScienceComputer Science (R0)