skip to main content
10.1145/3511808.3557252acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

CASA-Net: A Context-Aware Correlation Convolutional Network for Scale-Adaptive Crack Detection

Published: 17 October 2022 Publication History

Abstract

Surface cracks in infrastructure are a key indicator of structural safety and degradation. Visual-based crack detection is a critical task for the enormous application demands of infrastructure industries. Convolution operations have been widely deployed due to the strong feature learning abilities. However, global feature dependencies of multi-scale cracks are ignored due to the limited receptive field.In addition, the detection of cracks with low contrast suffers a serious performance loss.Therefore, to address the scale-adaptive crack detection problem, we propose a context-aware correlation convolutional network for scale-adaptive crack detection named CASA-Net. CASA-Net is capable of extracting multi-scale crack features for distinguishing between cracks and surface backgrounds, and evaluating feature correlations to capture global contexts. CASA-Net is composed of the multi-scale distinguishing feature extraction (MDFE) module and the context-aware feature correlation (CAFC) module. Specifically, the MDFE module consists of multiple cascaded convolutional layers and distinguishing feature extraction layers (DFLayers). The CAFC module consists of a mapping block and cascaded correlators to capture the context-aware features for long-range interactions. The performance of CASA-Net is evaluated on a benchmark crack dataset. The experimental results indicate that CASA-Net outperforms rival methods by achieving an F1-Score of 0.65 and an AP50 of 63.9%.

References

[1]
Mohammad R Jahanshahi, Farrokh Jazizadeh, Sami F Masri, and Burcin Becerik-Gerber. Unsupervised approach for autonomous pavement-defect detection and quantification using an inexpensive depth sensor. Journal of Computing in Civil Engineering, 27(6):743--754, 2013.
[2]
Yann LeCun, Yoshua Bengio, et al. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 3361(10):1995, 1995.
[3]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, 25, 2012.
[4]
Xiongwei Wu, Doyen Sahoo, and Steven CH Hoi. Recent advances in deep learning for object detection. Neurocomputing, 396:39--64, 2020.
[5]
Swarnendu Ghosh, Nibaran Das, Ishita Das, and Ujjwal Maulik. Understanding deep learning techniques for image segmentation. ACM Computing Surveys (CSUR), 52(4):1--35, 2019.
[6]
Zhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, and Qixiang Ye. Conformer: Local features coupling global representations for visual recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 367--376, 2021.
[7]
Fisher Yu and Vladlen Koltun. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122, 2015.
[8]
Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, and Yichen Wei. Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision, pages 764--773, 2017.
[9]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9):1904--1916, 2015.
[10]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770--778, 2016.
[12]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pages 2980--2988, 2017.
[13]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, pages 234--241, 2015.
[14]
Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, and Michael Felsberg. Eco: Efficient convolution operators for tracking. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 6638--6646, 2017.
[15]
Yang Zhang and Ka-Veng Yuen. Crack detection using fusion features-based broad learning system and image processing. Computer-Aided Civil and Infrastructure Engineering, 36(12):1568--1584, 2021.
[16]
Bubryur Kim, N Yuvaraj, KR Sri Preethaa, and R Arun Pandian. Surface crack detection using deep learning with shallow cnn architecture for enhanced computation. Neural Computing and Applications, 33(15):9289--9305, 2021.
[17]
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278--2324, 1998.
[18]
Shadrack Fred Mahenge, Stephen Wambura, and Licheng Jiao. Robust deep representation learning for road crack detection. In 2021 The 5th International Conference on Video and Image Processing, pages 117--125, 2021.
[19]
Babloo Kumar and Sayantari Ghosh. Detection of concrete cracks using dual-channel deep convolutional network. In 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pages 1--7, 2020.
[20]
Chun Li, Yu Wen, Qingxuan Shi, Fang Yang, Hongyan Ma, and Xuedong Tian. A pavement crack detection method based on multiscale attention and hfs. Computational Intelligence and Neuroscience, 2022, 2022.
[21]
Hanshen Chen and Huiping Lin. An effective hybrid atrous convolutional network for pixel-level crack detection. IEEE Transactions on Instrumentation and Measurement, 70:1--12, 2021.
[22]
Huajun Liu, Xiangyu Miao, Christoph Mertz, Chengzhong Xu, and Hui Kong. Crackformer: Transformer network for fine-grained crack detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3783--3792, 2021.
[23]
Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, 39(12):2481--2495, 2017.
[24]
Tingyang Chen, Zhenhua Cai, Xi Zhao, Chen Chen, Xufeng Liang, Tierui Zou, and Pan Wang. Pavement crack detection and recognition using the architecture of segnet. Journal of Industrial Information Integration, 18:100144, 2020.
[25]
Rodrigo Rill-Garc'ia, Eva Dokládalová, and Petr Doklédal. Pixel-accurate road crack detection in presence of inaccurate annotations. Neurocomputing, 2022.
[26]
Kaige Zhang, Yingtao Zhang, and Heng-Da Cheng. Crackgan: Pavement crack detection using partially accurate ground truths based on generative adversarial learning. IEEE Transactions on Intelligent Transportation Systems, 22(2):1306--1319, 2020.
[27]
Fan Yang, Lei Zhang, Sijia Yu, Danil Prokhorov, Xue Mei, and Haibin Ling. Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Transactions on Intelligent Transportation Systems, 21(4):1525--1535, 2019.
[28]
Qiang Zhou, Zhong Qu, and Chong Cao. Mixed pooling and richer attention feature fusion for crack detection. Pattern Recognition Letters, 145:96--102, 2021.
[29]
Changyu Jin, Kai Wang, Tao Han, Yu Lu, Aixin Liu, and Dong Liu. Segmentation of ore and waste rocks in borehole images using the multi-module densely connected u-net. Computers & Geosciences, 159:105018, 2022.
[30]
Nhung Hong Thi Nguyen, Stuart Perry, Don Bone, Ha Thanh Le, and Thuy Thi Nguyen. Two-stage convolutional neural network for road crack detection and segmentation. Expert Systems with Applications, 186:115718, 2021.
[31]
Wenbo Jiang, Min Liu, Yunuo Peng, Lehui Wu, and Yaonan Wang. Hdcb-net: A neural network with the hybrid dilated convolution for pixel-level crack detection on concrete bridges. IEEE Transactions on Industrial Informatics, 17(8):5485--5494, 2020.
[32]
Jianghua Deng, Ye Lu, and Vincent Cheng-Siong Lee. Concrete crack detection with handwriting script interferences using faster region-based convolutional neural network. Computer-Aided Civil and Infrastructure Engineering, 35(4):373--388, 2020.
[33]
Guo X Hu, Bao L Hu, Zhong Yang, Li Huang, and Ping Li. Pavement crack detection method based on deep learning models. Wireless Communications and Mobile Computing, 2021, 2021.
[34]
Lijuan Duan, Jun Zeng, Junbiao Pang, and Junzhe Wang. Pavement crack detection using multi-stage structural feature extraction model. In 2021 IEEE International Conference on Image Processing, pages 969--973, 2021.
[35]
Beixin Xia, Jianbin Cao, Xu Zhang, and Yunfang Peng. Automatic concrete sleeper crack detection using a one-stage detector. International Journal of Intelligent Robotics and Applications, 4(3):319--327, 2020.
[36]
Fen Fang, Liyuan Li, Ying Gu, Hongyuan Zhu, and Joo-Hwee Lim. A novel hybrid approach for crack detection. Pattern Recognition, 107:107474, 2020.
[37]
Deeksha Arya, Hiroya Maeda, Sanjay Kumar Ghosh, Durga Toshniwal, Hiroshi Omata, Takehiro Kashiyama, and Yoshihide Sekimoto. Global road damage detection: State-of-the-art solutions. In 2020 IEEE International Conference on Big Data, pages 5533--5539, 2020.
[38]
Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. The pascal visual object classes (voc) challenge. International journal of computer vision, 88(2):303--338, 2010.
[39]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28, 2015.
[40]
Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, and Qi Tian. Centernet: Keypoint triplets for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6569--6578, 2019.
[41]
Mingxing Tan, Ruoming Pang, and Quoc V Le. Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10781--10790, 2020.
[42]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. Ssd: Single shot multibox detector. In European conference on computer vision, pages 21--37, 2016.
[43]
Joseph Redmon and Ali Farhadi. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
[44]
Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934, 2020.
[45]
Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, and Jian Sun. Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430, 2021.
[46]
Sadra Naddaf-Sh, M-Mahdi Naddaf-Sh, Amir R. Kashani, and Hassan Zargarzadeh. An efficient and scalable deep learning approach for road damage detection. In 2020 IEEE International Conference on Big Data (Big Data), pages 5602--5608, 2020.

Cited By

View all
  • (2024)Towards Real-world Deployment of Deep Learning Solutions for Global Road Damage Detection and Classification2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825129(8485-8494)Online publication date: 15-Dec-2024
  • (2024)Artificial intelligence technology in rock mechanics and rock engineeringDeep Resources Engineering10.1016/j.deepre.2024.1000081:2(100008)Online publication date: Jun-2024
  • (2023)Rockburst time warning method with blasting cycle as the unit based on microseismic information time series: a case studyBulletin of Engineering Geology and the Environment10.1007/s10064-023-03141-382:4Online publication date: 15-Mar-2023

Index Terms

  1. CASA-Net: A Context-Aware Correlation Convolutional Network for Scale-Adaptive Crack Detection

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
    October 2022
    5274 pages
    ISBN:9781450392365
    DOI:10.1145/3511808
    • General Chairs:
    • Mohammad Al Hasan,
    • Li Xiong
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. context-aware feature correlation
    2. object detection
    3. scale-adaptive crack detection

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    CIKM '22
    Sponsor:

    Acceptance Rates

    CIKM '22 Paper Acceptance Rate 621 of 2,257 submissions, 28%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)32
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Real-world Deployment of Deep Learning Solutions for Global Road Damage Detection and Classification2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825129(8485-8494)Online publication date: 15-Dec-2024
    • (2024)Artificial intelligence technology in rock mechanics and rock engineeringDeep Resources Engineering10.1016/j.deepre.2024.1000081:2(100008)Online publication date: Jun-2024
    • (2023)Rockburst time warning method with blasting cycle as the unit based on microseismic information time series: a case studyBulletin of Engineering Geology and the Environment10.1007/s10064-023-03141-382:4Online publication date: 15-Mar-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media