A crack detection network with multi-channel attention and enhanced information interaction

Qu, Zhong; Zhou, Lihui; Yin, Xuehui; Lu, Tong

doi:10.1007/s11760-024-03581-8

A crack detection network with multi-channel attention and enhanced information interaction

Original Paper
Published: 02 December 2024

Volume 19, article number 37, (2025)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Zhong Qu¹,
Lihui Zhou¹,
Xuehui Yin¹ &
…
Tong Lu¹

175 Accesses
Explore all metrics

Abstract

Roads frequently experience cracks. It adversely impact the safe passage of vehicles and pedestrians, and have the potential to alter the road’s structure. To address this issue, we propose a novel crack detection network. The network constructs multi-channel attention and enhanced information interaction mechanisms to capture more granular semantic information. In our network, each convolutional layer is followed by a convolution combining asymmetric convolutions and criss-cross attention to enhance the feature maps post-convolution. This is followed by spatial and channel reconstruction convolutions and shuffle attention to optimize the generated side-output features. By extensively mining features from the deep network and ingeniously integrating bottom-level and top-level features through a new feature fusion module. The network achieves precise crack prediction results. Extensive experiments on the general-purpose crack image datasets Crack500, CFD and DeepCrack demonstrate the model’s effectiveness. In these three datasets, F1-score values of 0.734, 0.635, and 0.881, MIoU values of 0.773, 0.726 and 0.888.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel real-time pixel-level road crack segmentation network

Article 20 April 2024

Gating attention convolutional networks with dense connection for pixel-level crack detection

Article 15 October 2022

Scd-yolo: a novel object detection method for efficient road crack detection

Article 20 November 2024

Data availability

Data are available on request to the authors.

References

Krishnamoorthy, S., Zhang, C., Yanxin, Z.: Implementation of image fusion to investigate wall crack. In: International Conference on Emerging Trends in Information Technology and Engineering, pp. 1–3, Vellore, India (February 2020)
Liu, Y., Yao, J., Xiaohu, L., Xie, R., Li, L.: Deepcrack: a deep hierarchical feature learning architecture for crack segmentation. Neurocomputing 338, 139–153 (2019)
Article MATH Google Scholar
Oliveira, H., Correia, P.L.: Automatic road crack detection and characterization. IEEE Trans. Intell. Transp. Syst. 14(1), 155–168 (2012)
Article MATH Google Scholar
He, J., Wang, Y., Wang, Y., Li, R., Zhang, D., Zheng, Z.: A lightweight road crack detection algorithm based on improved yolov7 model. Signal Image Video Process. 18, 1–14 (2024)
Article MATH Google Scholar
Sheng, S., Yin, H., Yang, Y., Chong, A., Huang, H.: DUNet: dense u-blocks network for fine-grained crack detection. SIViP 18(2), 1929–1938 (2024)
Article MATH Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015: 18th International Conference, pp. 234–241 (2015)
Xie, Y., Zhan, N., Zhu, J., Bingli, X., Chen, H., Mao, W., Luo, X., Ya, H.: Landslide extraction from aerial imagery considering context association characteristics. Int. J. Appl. Earth Obs. Geoinf. 131, 103950 (2024)
Google Scholar
Zhu, J., Zhang, J., Chen, H., Xie, Y., Hengchao, G., Lian, H.: A cross-view intelligent person search method based on multi-feature constraints. Int. J. Digit. Earth 17(1), 2346259 (2024)
Article Google Scholar
Cao, S., Feng, D., Liu, S., Xu, W., Chen, H., Xie, Y., Zhang, H., Pirasteh, S., Zhu, J.: BEMRF-net: boundary enhancement and multiscale refinement fusion for building extraction from remote sensing imagery. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 1, 1–17 (2024)
Google Scholar
Zhang, T., Wang, D., Yang, L.: ECSNET: an accelerated real-time image segmentation CNN architecture for pavement crack detection. IEEE Trans. Intell. Transp. Syst. 24(12), 15105–15112 (2023)
Article Google Scholar
Enhui, G., Xiao, G., Lian, F., Tongyao, M., Hong, J., Liu, J.: Segmentation and evaluation of crack image from aircraft fuel tank via atrous spatial pyramid fusion and hybrid attention network. IEEE Trans. Instrum. Meas. 72, 1–14 (2023)
Google Scholar
Zhou, Q., Zhong, Q., Fang-rong, J.: A lightweight network for crack detection with split exchange convolution and multi-scale features fusion. IEEE Trans. Intell. Veh. 8(3), 2296–2306 (2023)
Article MATH Google Scholar
Yang, L., Huang, H., Kong, S., Liu, Y., Hongnian, Yu.: PAF-net: a progressive and adaptive fusion network for pavement crack segmentation. IEEE Trans. Intell. Transp. Syst. 24(11), 12686–12700 (2023)
Article MATH Google Scholar
Sun, X., Xie, Y., Jiang, L., Cao, Yu., Liu, B.: DMA-net: deeplab with multi-scale attention for pavement crack segmentation. IEEE Trans. Intell. Transp. Syst. 23(10), 18392–18403 (2022)
Article Google Scholar
Yao, H., Liu, Y., Li, X., You, Z., Feng, Yu., Weiwei, L.: A detection method for pavement cracks combining object detection and attention mechanism. IEEE Trans. Intell. Transp. Syst. 23(11), 22179–22789 (2022)
Article Google Scholar
Zhang, X., Huang, H.: Lightaunet: a lightweight fusing attention based U-Net for crack detection. In: 2022 7th International Conference on Image, Vision and Computing, pp. 178–182, Xi’an, China (2022)
Zhou, Q., Zhong, Q., Wang, S.-Y., Bao, K.-H.: A method of potentially promising network for crack detection with enhanced convolution and dynamic feature fusion. IEEE Trans. Intell. Transp. Syst. 23(10), 18736–18745 (2022)
Article Google Scholar
Jia, Y., Rong, C., Wu, C., Yang, Y.: Research on the decomposition and fusion method for the infrared and visible images based on the guided image filtering and gaussian filter. In: 2017 3rd IEEE International Conference on Computer and Communications, pp. 1797–1802, Chengdu, China (March 2017)
Huang, Zilong, Wang, Xinggang, Huang, Lichao, Huang, Chang, Wei, Yunchao, Liu, Wenyu: Ccnet: Criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pages 603–612, Long Beach, CA, USA (February 2019)
Ding, X., Guo, Y., Ding, G., Han, J.: strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1911–1920, Long Beach, CA, USA (February 2019)
Li, J., Wen, Y., He, L.: SCConv: spatial and channel reconstruction convolution for feature redundancy. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6153–6162, Vancouver, Canada (August 2023)
Zhang, Q.-L. Yang, Y.-B.: SA-Net: shuffle attention for deep convolutional neural networks. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2235–2239, Toronto, Ontario, Canada (May 2021)
Shi, Y., Cui, L., Qi, Z., Meng, F., Chen, Z.: Automatic road crack detection using random structured forests. IEEE Trans. Intell. Transp. Syst. 17(12), 3434–3445 (2016)
Article Google Scholar
Yang, F., Zhang, L., Sijia, Yu., Prokhorov, D., Mei, X., Ling, H.: Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Trans. Intell. Transp. Syst. 21(4), 1525–1535 (2020)
Article MATH Google Scholar
Han, C., Tao Ma, J., Huyan, X.H., Zhang, Y.: CrackW-Net: a novel pavement crack image segmentation convolutional neural network. IEEE Trans. Intell. Transp. Syst. 23(11), 22135–22144 (2022)
Article Google Scholar
Ran, R., Xu, X., Qiu, S., Cui, X., Wu, F.: Crack-SegNet: surface crack detection in complex background using encoder–decoder architecture. In: Proceedings of the 2021 4th International Conference on Sensors, Signal and Image Processing, pp. 15–22, New York, USA (October 2021)
Zou, Q., Zhang, Z., Li, Q., Qi, X., Wang, Q., Wang, S.: Deepcrack: learning hierarchical convolutional features for crack detection. IEEE Trans. Image Process. 28(3), 1498–1512 (2019)
Liu, H., Miao, X., Mertz, C., Xu, C., Kong, H.: Crackformer: transformer network for fine-grained crack detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3763–3772, Montreal, QC, Canada (October 2021)
Xie, S., Tu, Z.: Holistically-nested edge detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1395–1403, Santiago, Chile (October 2015)
Gao, G., Guoan, X., Yi, Yu., Xie, J., Yang, J., Yue, D.: MSCFNet: a lightweight network with multi-scale context fusion for real-time semantic segmentation. IEEE Trans. Intell. Transp. Syst. 23(12), 25489–25499 (2022)
Article Google Scholar
Fan, Z., Li, C., Chen, Y., Wei, J., Loprencipe, G., Chen, X., Di Mascio, P.: Automatic crack detection on road pavements using encoder–decoder architecture. Materials 13(13), 2960 (2020)
Article MATH Google Scholar
Yang, L., Huang, H., Kong, S., Liu, Y., Hongnian, Yu.: PAF-net: a progressive and adaptive fusion network for pavement crack segmentation. IEEE Trans. Intell. Transp. Syst. 24(11), 12686–18700 (2023)
Article MATH Google Scholar
Qi, Y., He, Y., Qi, X., Zhang, Y., Yang, G.: Dynamic snake convolution based on topological geometric constraints for tubular structure segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6070–6079
Chen, L., Ying, F., Lin, G., Yan, C., Harada, T., Huang, G.: Frequency-aware feature fusion for dense image prediction. IEEE Trans. Pattern Anal. Mach. Intell. 46, 10763–10780 (2024)
Article MATH Google Scholar
Zheng, Wenwen, Jiang, Xiaoyan, Fang, Zhijun, Gao, Yongbin: TV-Net: a structure-level feature fusion network based on tensor voting for road crack segmentation. IEEE Trans. Intell. Transp. Syst. (2024). https://doi.org/10.1109/TITS.2023.3334266
Article MATH Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (62176034, 62471076), the Science and Technology Research Program of Chongqing Municipal Education Commission (KJZDM202300604, KJZDM202301902), the Natural Science Foundation of Chongqing (cstc2021jcyjmsxmX0518, 2023NSCQMSX1781).

Author information

Authors and Affiliations

Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
Zhong Qu, Lihui Zhou, Xuehui Yin & Tong Lu

Authors

Zhong Qu
View author publications
You can also search for this author inPubMed Google Scholar
Lihui Zhou
View author publications
You can also search for this author inPubMed Google Scholar
Xuehui Yin
View author publications
You can also search for this author inPubMed Google Scholar
Tong Lu
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Zhong Qu and Lihui Zhou wrote the main manuscript text. Xuehui Yin prepared figures 1-5. Tong Lu prepared figures 6-8. All authors reviewed the manuscript.

Corresponding author

Correspondence to Zhong Qu.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Qu, Z., Zhou, L., Yin, X. et al. A crack detection network with multi-channel attention and enhanced information interaction. SIViP 19, 37 (2025). https://doi.org/10.1007/s11760-024-03581-8

Download citation

Received: 25 September 2024
Revised: 11 November 2024
Accepted: 13 November 2024
Published: 02 December 2024
DOI: https://doi.org/10.1007/s11760-024-03581-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A crack detection network with multi-channel attention and enhanced information interaction

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A novel real-time pixel-level road crack segmentation network

Gating attention convolutional networks with dense connection for pixel-level crack detection

Scd-yolo: a novel object detection method for efficient road crack detection

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now