Detail Perception Network for Semantic Segmentation in Water Scenes

Liang, Cuixiao; Cai, Wenjie; Peng, Shaowu; Liu, Qiong

doi:10.1007/978-3-031-05981-0_15

Cuixiao Liang¹³,
Wenjie Cai¹⁴,
Shaowu Peng¹³ &
…
Qiong Liu¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13282))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1524 Accesses
3 Citations

Abstract

Semantic segmentation in the water scene is significant for water environment monitoring. Recent water scene segmentation methods usually regard all floating objects as only one foreground category that limits the understanding of the water scene. Considering various floating objects, we propose a Detail Perception Network (DPNet) to address two challenges for water scene segmentation with multi-categories floating objects. One is the sample imbalance among objects of different scales, which leads to low accuracy on small objects covering a few pixel samples. Another is the weak discriminability of features among the categories that are close to the blurred edge, which leads to the miss-segmentation in the blurred edge region. For sample imbalance, we design Distance Field Loss (DF Loss) to strengthen the learning of small objects by a pixel-wise weight calculated from a distance field during training. To address the weak discriminability of features among categories that are close to the blurred edge, we propose a Category Edge Perception Pyramid (CEPP) module that learns the edge feature of each category as prior knowledge to enhance edge features. For training and evaluating relative models, we also establish a dataset named ColorWater, which contains 1279 images with 9 semantic labels over various water scenes. Extensive experiments demonstrate that our model performs favorably against the state-of-the-art models on our ColorWater dataset and the public Aeroscapes dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bovcon, B., Kristan, M.: WaSR-a water segmentation and refinement maritime obstacle detection network. IEEE Trans. Cyberneti. 1–14 (2021)
Google Scholar
Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with Atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 801–818 (2018)
Google Scholar
Chen, X., Liu, Y., Achuthan, K.: WODIS: water obstacle detection network based on image segmentation for autonomous surface vehicles in maritime environments. IEEE Trans. Instrum. Measur 70, 1–13 (2021)
Google Scholar
Chen, Y., Li, Y., Wang, J.: An end-to-end oil-spill monitoring method for multisensory satellite images based on deep semantic segmentation. Sensors 20(3), 725 (2020)
Article Google Scholar
Cheng, F., et al.: Learning directional feature maps for cardiac MRI segmentation. In: Martel, A.L., et al. (eds.) MICCAI 2020. LNCS, vol. 12264, pp. 108–117. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59719-1_11
Chapter Google Scholar
Cheng, Y., Jiang, M., Zhu, J., Liu, Y.: Are we ready for unmanned surface vehicles in inland waterways? The USVInland multisensor dataset and benchmark. IEEE Robot. Autom. Lett. 6(2), 3964–3970 (2021)
Article Google Scholar
Cui, Y., Jia, M., Lin, T.Y., Song, Y., Belongie, S.: Class-balanced loss based on effective number of samples. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9268–9277 (2019)
Google Scholar
He, J., Deng, Z., Qiao, Y.: Dynamic multi-scale filters for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3562–3572 (2019)
Google Scholar
He, J., Deng, Z., Zhou, L., Wang, Y., Qiao, Y.: Adaptive pyramid context network for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7519–7528 (2019)
Google Scholar
Huang, C., Li, Y., Loy, C.C., Tang, X.: Deep imbalanced learning for face recognition and attribute prediction. IEEE Trans. Pattern Anal. Mach. Intell. 42(11), 2781–2794 (2019)
Article Google Scholar
Kirillov, A., Girshick, R., He, K., Dollár, P.: Panoptic feature pyramid networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6399–6408 (2019)
Google Scholar
Lee, H.J., Kim, J.U., Lee, S., Kim, H.G., Ro, Y.M.: Structure boundary preserving segmentation for medical image with ambiguous boundary. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4817–4826 (2020)
Google Scholar
Li, B., Liu, Y., Wang, X.: Gradient harmonized single-stage detector. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8577–8584 (2019)
Google Scholar
Li, X., et al.: Improving semantic segmentation via decoupled body and edge supervision. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12362, pp. 435–452. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58520-4_26
Chapter Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Nigam, I., Huang, C., Ramanan, D.: Ensemble knowledge transfer for semantic segmentation. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1499–1508. IEEE (2018)
Google Scholar
Sun, K., Xiao, B., Liu, D., Wang, J.: Deep high-resolution representation learning for human pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5693–5703 (2019)
Google Scholar
Takikawa, T., Acuna, D., Jampani, V., Fidler, S.: Gated-SCNN: gated shape CNNs for semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5229–5238 (2019)
Google Scholar
Tian, J., Mithun, N.C., Seymour, Z., Chiu, H.P., Kira, Z.: Striking the right balance: recall loss for semantic segmentation. CoRR abs/2106.14917 (2021). https://arxiv.org/abs/2106.14917
Yuan, Y., Chen, X., Wang, J.: Object-contextual representations for semantic segmentation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12351, pp. 173–190. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58539-6_11
Chapter Google Scholar
Zhang, Z., Zhang, X., Peng, C., Xue, X., Sun, J.: ExFuse: enhancing feature fusion for semantic segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 269–284 (2018)
Google Scholar
Štricelj, A., Kačič, Z.: Detection of objects on waters’ surfaces using CEIEMV method. Comput. Electr. Eng. 46, 511–527 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

South China University of Technology, Guangzhou, China
Cuixiao Liang, Shaowu Peng & Qiong Liu
Shunfeng Technology (Shenzhen) Co., Ltd., Shenzhen, China
Wenjie Cai

Authors

Cuixiao Liang
View author publications
You can also search for this author in PubMed Google Scholar
Wenjie Cai
View author publications
You can also search for this author in PubMed Google Scholar
Shaowu Peng
View author publications
You can also search for this author in PubMed Google Scholar
Qiong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiong Liu .

Editor information

Editors and Affiliations

Laboratory of Artificial Intelligence and Decision Support, University of Porto, Porto, Portugal
João Gama
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
Tianrui Li
National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Yang Yu
School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Enhong Chen
JD iCity, JD Technology & JD Intelligent Cities Research, Beijing, China
Yu Zheng
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
Fei Teng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liang, C., Cai, W., Peng, S., Liu, Q. (2022). Detail Perception Network for Semantic Segmentation in Water Scenes. In: Gama, J., Li, T., Yu, Y., Chen, E., Zheng, Y., Teng, F. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2022. Lecture Notes in Computer Science(), vol 13282. Springer, Cham. https://doi.org/10.1007/978-3-031-05981-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-05981-0_15
Published: 10 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05980-3
Online ISBN: 978-3-031-05981-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Detail Perception Network for Semantic Segmentation in Water Scenes