skip to main content
10.1145/3503161.3548304acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections

Authors Info & Claims
Published:10 October 2022Publication History

ABSTRACT

The defect diagnosis of modern infrastructures is crucial to public safety. In this work, we propose a complete crack inspection system with three main components, including the autonomous system setup, the geographic-information-system-based 3D reconstruction, and the database construction as well as domain adaptive algorithms design. To fulfill the unsupervised domain adaptation (UDA) task of cracks recognition in infrastructural inspections, we propose a robust unsupervised domain adaptive learning strategy termed Crack-DA to increase the generalization capacity of the model in unseen test circumstances. Specifically, firstly, we propose leveraging the self-supervised depth information to help the learning of semantics. Secondly, we propose using the edge information to suppress the non-edge background objects and noises. Thirdly, we propose using the data augmentation-based consistency learning to increase the prediction robustness. Finally, we use the disparity in depth to evaluate the domain gap in semantics and explicitly consider the domain gap in the optimization of the network. Also, we propose a source database consisting of 11,298 crack images with detailed pixel-level labels for network training in domain adaptations. Extensive experiments on UAV-captured highway cracks and real-site UAV inspections of building cracks demonstrate the robustness and effectiveness of the proposed domain adaptive crack recognition approach.

Skip Supplemental Material Section

Supplemental Material

References

  1. Paul Bao, Lei Zhang, and Xiaolin Wu. 2005. Canny edge detection enhancement by scale multiplication. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 27, 9 (2005), 1485--1490.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, and Jurgen Gall. 2019. SemanticKITTI: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 9297--9307.Google ScholarGoogle ScholarCross RefCross Ref
  3. Guowei Cai, BenMChen, and Tong Heng Lee. 2011. Unmanned Rotorcraft Systems. Springer Science & Business Media.Google ScholarGoogle Scholar
  4. Guowei Cai, Ben M Chen, Kemao Peng, Miaobo Dong, and Tong H Lee. 2008. Modeling and Control of the Yaw Channel of a UAV Helicopter. IEEE Transactions on Industrial Electronics (TIE) 55, 9 (2008), 3426--3434.Google ScholarGoogle ScholarCross RefCross Ref
  5. Ben M Chen. 2000. Robust and H ∞ Control. Springer.Google ScholarGoogle ScholarCross RefCross Ref
  6. Fu-Chen Chen and Mohammad R Jahanshahi. 2018. NB-CNN: Deep learning-based crack detection using convolutional neural network and Naïve Bayes data fusion. IEEE Transactions on Industrial Electronics (TIE) 65, 5 (2018), 4392--4400.Google ScholarGoogle ScholarCross RefCross Ref
  7. Liang-Chieh Chen, Jonathan T Barron, George Papandreou, Kevin Murphy, and Alan L Yuille. 2016. Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4545--4554.Google ScholarGoogle ScholarCross RefCross Ref
  8. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2018. Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 40, 4 (2018), 834--848.Google ScholarGoogle ScholarCross RefCross Ref
  9. Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV). 801--818.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Wooram Choi and Young-Jin Cha. 2020. SDDNet: Real-time crack segmentation. IEEE Transactions on Industrial Electronics (TIE) 67, 9 (2020), 8016--8025.Google ScholarGoogle ScholarCross RefCross Ref
  11. Lijuan Duan, Huiling Geng, Jun Zeng, Junbiao Pang, and Qingming Huang. 2019. Fast and Accurately Measuring Crack Width via Cascade Principal Component Analysis. In Proceedings of the ACM Multimedia Asia. 1--7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Fei Gao, LuqiWang, Boyu Zhou, Xin Zhou, Jie Pan, and Shaojie Shen. 2020. Teach-repeat-replan: A complete and robust system for aggressive flight in complex environments. IEEE Transactions on Robotics 36, 5 (2020), 1526--1545.Google ScholarGoogle ScholarCross RefCross Ref
  13. Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, 3354--3361.Google ScholarGoogle ScholarCross RefCross Ref
  14. Dayan Guan, Jiaxing Huang, Aoran Xiao, and Shijian Lu. 2021. Domain adaptive video segmentation via temporal consistency regularization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8053--8064.Google ScholarGoogle ScholarCross RefCross Ref
  15. Zhonghua Hong, Fan Yang, Haiyan Pan, Ruyan Zhou, Yun Zhang, Yanling Han, Jing Wang, Shuhu Yang, Peng Chen, Xiaohua Tong, et al. 2022. Highway Crack Segmentation from Unmanned Aerial Vehicle Images Using Deep Learning. IEEE Geoscience and Remote Sensing Letters (GRSL) (2022).Google ScholarGoogle Scholar
  16. Jiaxing Huang, Dayan Guan, Aoran Xiao, and Shijian Lu. 2021. Model adaptation: Historical contrastive learning for unsupervised domain adaptation without source data. Advances in Neural Information Processing Systems 34 (2021), 3635--3649.Google ScholarGoogle Scholar
  17. Jiaxing Huang, Dayan Guan, Aoran Xiao, and Shijian Lu. 2022. Multi-level adversarial network for domain adaptive semantic segmentation. Pattern Recognition 123 (2022), 108384.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu, and Ling Shao. 2022. Category contrast for unsupervised domain adaptation in visual tasks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1203--1214.Google ScholarGoogle ScholarCross RefCross Ref
  19. Jiaxing Huang, Shijian Lu, Dayan Guan, and Xiaobing Zhang. 2020. Contextual relation consistent domain adaptation for semantic segmentation. In European conference on computer vision. Springer, 705--722.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Sophie Lambert-Lacroix and Laurent Zwald. 2016. The adaptive BerHu penalty in robust regression. Journal of Nonparametric Statistics 28, 3 (2016), 487--514.Google ScholarGoogle ScholarCross RefCross Ref
  21. Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, and Kilian Q Weinberger. 2021. On feature normalization and data augmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 12383--12392.Google ScholarGoogle ScholarCross RefCross Ref
  22. Zhaoshuo Li, Xingtong Liu, Nathan Drenkow, Andy Ding, Francis X Creighton, Russell H Taylor, and Mathias Unberath. 2021. Revisiting stereo depth estimation from a sequence-to-sequence perspective with transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 6197--6206.Google ScholarGoogle ScholarCross RefCross Ref
  23. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2980--2988.Google ScholarGoogle ScholarCross RefCross Ref
  24. Kangcheng Liu, Zhi Gao, Feng Lin, and BenMChen. 2020. Fg-net: Fast large-scale lidar point clouds understanding network leveraging correlated feature mining and geometric-aware modelling. arXiv preprint arXiv:2012.09439 (2020).Google ScholarGoogle Scholar
  25. Kangcheng Liu, Zhi Gao, Feng Lin, and Ben M Chen. 2021. FG-Conv: Large-Scale LiDAR Point Clouds Understanding Leveraging Feature Correlation Mining and Geometric-Aware Modeling. In 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 12896--12902.Google ScholarGoogle Scholar
  26. Kangcheng Liu, Zhi Gao, Feng Lin, and Ben M Chen. 2022. FG-Net: A Fast and Accurate Framework for Large-Scale LiDAR Point Cloud Understanding. IEEE Transactions on Cybernetics (2022).Google ScholarGoogle Scholar
  27. Kangcheng Liu, Xiaodong Han, and Ben M Chen. 2019. Deep learning based automatic crack detection and segmentation for unmanned aerial vehicle inspections. In 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE, 381--387.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Kangcheng Liu, Yanbin Qu, Hak-Man Kim, and Huihui Song. 2017. Avoiding frequency second dip in power unreserved control during wind power rotational speed recovery. IEEE transactions on power systems 33, 3 (2017), 3097--3106.Google ScholarGoogle Scholar
  29. Kangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, and Ben M. Chen. 2022. WS3D: Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination. In European Conference on Computer Vision (ECCV).Google ScholarGoogle Scholar
  30. Kangcheng Liu, Xunkuai Zhou, and Ben M Chen. 2022. An Enhanced LiDAR Inertial Localization and Mapping System for Unmanned Ground Vehicles. In 2022 17th IEEE International Conference on Control and Automation (ICCA). IEEE.Google ScholarGoogle Scholar
  31. Kangcheng Liu, Xunkuai Zhou, Benyun Zhao, Huosen Ou, and Ben M Chen. 2022. An Integrated Visual System for Unmanned Aerial Vehicles Following Ground Vehicles: Simulations and Experiments. In 2022 17th IEEE International Conference on Control and Automation (ICCA). IEEE.Google ScholarGoogle Scholar
  32. Yahui Liu, Jian Yao, Xiaohu Lu, Renping Xie, and Li Li. 2019. DeepCrack: A deep hierarchical feature learning architecture for crack segmentation. Neurocomputing 338 (2019), 139--153.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 10012--10022.Google ScholarGoogle ScholarCross RefCross Ref
  34. Zhicheng Liu, Xilong Liu, Zhiqiang Cao, Xurong Gong, Min Tan, and Junzhi Yu. 2022. High Precision Calibration for 3D Vision-Guided Robot System. IEEE Transactions on Industrial Electronics (TIE) (2022).Google ScholarGoogle Scholar
  35. Zhengzhe Liu, Xiaojuan Qi, and Chi-Wing Fu. 2021. One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1726--1736.Google ScholarGoogle ScholarCross RefCross Ref
  36. Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, and Ziwei Liu. 2021. Unsupervised domain adaptive 3d detection with multi-level consistency. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8866--8875.Google ScholarGoogle ScholarCross RefCross Ref
  37. Lorenz Meier, Petri Tanskanen, Lionel Heng, Gim Hee Lee, Friedrich Fraundorfer, and Marc Pollefeys. 2012. PIXHAWK: A micro aerial vehicle design for autonomous flight using onboard computer vision. Autonomous Robots 33, 1 (2012), 21--39.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Martin Mundt, Sagnik Majumder, Sreenivas Murali, Panagiotis Panetsos, and Visvanathan Ramesh. 2019. Meta-learning convolutional neural architectures for multi-target concrete defect classification with the concrete defect bridge image dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11196--11205.Google ScholarGoogle ScholarCross RefCross Ref
  39. Quang-Hieu Pham, Thanh Nguyen, Binh-Son Hua, Gemma Roig, and Sai-Kit Yeung. 2019. JSIS3D: Joint semantic-instance segmentation of 3D point clouds with multi-task point-wise networks and multi-value conditional random fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (ICCV). 8827--8836.Google ScholarGoogle ScholarCross RefCross Ref
  40. Paolo Sassi, Paolo Tripicchio, and Carlo Alberto Avizzano. 2019. A smart monitoring system for automatic welding defect detection. IEEE Transactions on Industrial Electronics (TIE) 66, 12 (2019), 9641--9650.Google ScholarGoogle ScholarCross RefCross Ref
  41. Jasper RR Uijlings, Koen EA Van De Sande, Theo Gevers, and Arnold WM Smeulders. 2013. Selective search for object recognition. International Journal of Computer Vision (IJCV) 104, 2 (2013), 154--171.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Qian Wan, Liang Gao, Xinyu Li, and Long Wen. 2022. Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature. IEEE Transactions on Industrial Electronics (TIE) (2022).Google ScholarGoogle Scholar
  43. Long Wang and Zijun Zhang. 2017. Automatic detection of wind turbine blade surface cracks based on UAV-taken images. IEEE Transactions on Industrial Electronics (TIE) 64, 9 (2017), 7293--7303.Google ScholarGoogle ScholarCross RefCross Ref
  44. Yuxing Wang, Yawen Lu, Zhihua Xie, and Guoyu Lu. 2021. Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment. In Proceedings of the 29th ACM International Conference on Multimedia. 1350--1358.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Long Wen, Xinyu Li, and Liang Gao. 2021. A new reinforcement learning based learning rate scheduler for convolutional neural network in fault classification. Transactions on Industrial Electronics (TIE) 68, 12 (2021), 12890--12900.Google ScholarGoogle ScholarCross RefCross Ref
  46. Qingsong Wen, Liang Sun, Fan Yang, Xiaomin Song, Jingkun Gao, Xue Wang, and Huan Xu. 2020. Time series data augmentation for deep learning: A survey. arXiv preprint arXiv:2002.12478 (2020).Google ScholarGoogle Scholar
  47. Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. 2021. SegFormer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems (Neurips) 34 (2021).Google ScholarGoogle Scholar
  48. Guanglei Yang, Hao Tang, Mingli Ding, Nicu Sebe, and Elisa Ricci. 2021. Transformer-based attention networks for continuous pixel-wise prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 16269--16279.Google ScholarGoogle ScholarCross RefCross Ref
  49. Ming-Jia Yang, Yu-Xiao Guo, Bin Zhou, and Xin Tong. 2021. Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 15203--15212.Google ScholarGoogle ScholarCross RefCross Ref
  50. Fangneng Zhan, Chuhui Xue, and Shijian Lu. 2019. Ga-dan: Geometry-aware domain adaptation network for scene text detection and recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9105--9115.Google ScholarGoogle ScholarCross RefCross Ref
  51. Chong Zhang, Zongxian Li, Jingjing Liu, Peixi Peng, Qixiang Ye, Shijian Lu, Tiejun Huang, and Yonghong Tian. 2021. Self-guided adaptation: Progressive representation alignment for domain adaptive object detection. IEEE Transactions on Multimedia 24 (2021), 2246--2258.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Zhenyu Zhang, Shouwei Gao, and Zheng Huang. 2021. An automatic glioma segmentation system using a multilevel attention pyramid scene parsing network. Current Medical Imaging 17, 6 (2021), 751--761.Google ScholarGoogle ScholarCross RefCross Ref
  53. Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2881--2890.Google ScholarGoogle ScholarCross RefCross Ref
  54. Yuzhi Zhao, Lai-Man Po, Tingyu Lin, XuehuiWang, Kangcheng Liu, Yujia Zhang, Wing-Yin Yu, Pengfei Xian, and Jingjing Xiong. 2021. Legacy photo editing with learned noise prior. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2103--2112.Google ScholarGoogle ScholarCross RefCross Ref
  55. Mingmin Zhen, Jinglu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, and Long Quan. 2020. Joint semantic segmentation and boundary detection using iterative pyramid contexts. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 13666--13675.Google ScholarGoogle ScholarCross RefCross Ref
  56. Qin Zou, Zheng Zhang, Qingquan Li, Xianbiao Qi, Qian Wang, and Song Wang. 2019. Deepcrack: Learning hierarchical convolutional features for crack detection. IEEE Transactions on Image Processing (TIP) 28, 3 (2019), 1498--1512.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader