Abstract
To prevent damage caused by cracks, accurate segmentation of cracks is necessary. Deep learning models are commonly employed to achieve this goal, typically consisting of data-driven neural networks that are trained to determine classification probability for each pixel. However, these models often ignore the optimization of the binarization function, which maps the probability distribution of each pixel to a specific class. Typically, a fixed threshold of 0.5 is used, disregarding the sensitivity of crack data to the threshold. As a result, segmentation accuracy is compromised. To address this issue, we propose a multi-objective optimization method that incorporates both the conventional segmentation model’s objective function and a dynamic threshold-based binarization objective function. By doing so, we aim to improve the accuracy of the segmentation results. Specifically, we introduce a dynamic thresholding branch (DTB) to our approach, which performs a regression task to determine the optimal threshold for each crack image at the image level. This optimal threshold is then utilized in the binarization function to optimize the dynamic thresholding-based binarization objective function. We have conducted experiments to validate the effectiveness of our multi-objective optimization approach with DTB on several well-known crack segmentation models. Additionally, we have evaluated its performance on various crack segmentation datasets. The results indicate that our approach can improve the accuracy of crack segmentation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Augustauskas, R., Lipnickas, A.: Improved pixel-level pavement-defect segmentation using a deep autoencoder. Sensors 20(9), 2557 (2020)
Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Cheng, J., Xiong, W., Chen, W., Gu, Y., Li, Y.: Pixel-level crack detection using U-Net. In: TENCON 2018–2018 IEEE Region 10 Conference, pp. 0462–0466. IEEE (2018)
Dung, C.V., et al.: Autonomous concrete crack detection using deep fully convolutional neural network. Autom. Constr. 99, 52–58 (2019)
Fan, Z., Wu, Y., Lu, J., Li, W.: Automatic pavement crack detection based on structured prediction with the convolutional neural network. arXiv preprint arXiv:1802.02208 (2018)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Huang, H., Zhao, S., Zhang, D., Chen, J.: Deep learning-based instance segmentation of cracks from shield tunnel lining images. Struct. Infrastruct. Eng. 18(2), 183–196 (2022)
Inoue, Y., Nagayoshi, H.: Deployment conscious automatic surface crack detection. In: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 686–694. IEEE (2019)
Inoue, Y., Nagayoshi, H.: Crack detection as a weakly-supervised problem: towards achieving less annotation-intensive crack detectors. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 65–72. IEEE (2021)
Jenkins, M.D., Carr, T.A., Iglesias, M.I., Buggy, T., Morison, G.: A deep convolutional neural network for semantic pixel-wise segmentation of road and pavement surface cracks. In: 2018 26th European Signal Processing Conference (EUSIPCO), pp. 2120–2124. IEEE (2018)
Kang, D., Benipal, S.S., Gopal, D.L., Cha, Y.J.: Hybrid pixel-level concrete crack segmentation and quantification across complex backgrounds using deep learning. Autom. Constr. 118, 103291 (2020)
König, J., Jenkins, M., Mannion, M., Barrie, P., Morison, G.: What’s cracking? A review and analysis of deep learning methods for structural crack segmentation, detection and quantification. arXiv preprint arXiv:2202.03714 (2022)
König, J., Jenkins, M.D., Barrie, P., Mannion, M., Morison, G.: A convolutional neural network for pavement surface crack segmentation using residual connections and attention gating. In: 2019 IEEE international conference on image processing (ICIP), pp. 1460–1464. IEEE (2019)
Li, Y., Wang, H., Dang, L.M., Piran, M.J., Moon, H.: A robust instance segmentation framework for underground sewer defect detection. Measurement 190, 110727 (2022)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Liu, Y., Yao, J., Lu, X., Xie, R., Li, L.: DeepCrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing 338, 139–153 (2019)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Safaei, N., Smadi, O., Masoud, A., Safaei, B.: An automatic image processing algorithm based on crack pixel density for pavement crack detection and classification. Int. J. Pavement Res. Technol. 15(1), 159–172 (2022)
Tabernik, D., Šela, S., Skvarč, J., Skočaj, D.: Segmentation-based deep-learning approach for surface-defect detection. J. Intell. Manuf. 31(3), 759–776 (2020)
Wu, Z., Lu, T., Zhang, Y., Wang, B., Zhao, X.: Crack detecting by recursive attention U-Net. In: 2020 3rd International Conference on Robotics, Control and Automation Engineering (RCAE), pp. 103–107. IEEE (2020)
Yang, F., Zhang, L., Yu, S., Prokhorov, D., Mei, X., Ling, H.: Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Trans. Intell. Transp. Syst. 21(4), 1525–1535 (2019)
Zhao, S., Zhang, D., Xue, Y., Zhou, M., Huang, H.: A deep learning-based approach for refined crack evaluation from shield tunnel lining images. Autom. Constr. 132, 103934 (2021)
Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., Liang, J.: UNet++: a nested U-Net architecture for medical image segmentation. In: Stoyanov, D., et al. (eds.) DLMIA/ML-CDS-2018. LNCS, vol. 11045, pp. 3–11. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00889-5_1
Zou, Q., Zhang, Z., Li, Q., Qi, X., Wang, Q., Wang, S.: DeepCrack: learning hierarchical convolutional features for crack detection. IEEE Trans. Image Process. 28(3), 1498–1512 (2018)
Acknowledgements
This work is funded in part by the National Natural Science Foundation of China under Grants No. 62176029. This work also is supported in part by the Chongqing Technology Innovation and Application Development Special under Grants CSTB2022TIAD-KPX0206. We express our sincere gratitude to the above funding agencies. We also acknowledge the computational support from the Chongqing Artificial Intelligence Innovation Center.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Ethical Statement
This study followed the ethical guidelines of the National Natural Science Foundation of China and the ethical review standards of the Chongqing Artificial Intelligence Innovation Center. This study did not involve any human or animal participants, nor did it use any sensitive or private data. The purpose of this study was to promote scientific advancement and social welfare in the field of artificial intelligence, and it did not cause any harm or adverse effects to any individual or group. All authors of this study are responsible for the content of this paper, and declare that they have no conflicts of interest or academic misconduct.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lei, Q., Zhong, J., Wang, C., Xia, Y., Zhou, Y. (2023). Dynamic Thresholding for Accurate Crack Segmentation Using Multi-objective Optimization. In: Koutra, D., Plant, C., Gomez Rodriguez, M., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14169. Springer, Cham. https://doi.org/10.1007/978-3-031-43412-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-43412-9_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43411-2
Online ISBN: 978-3-031-43412-9
eBook Packages: Computer ScienceComputer Science (R0)