research-article

Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections

Author:
Kangcheng Liu

Nanyang Technological University & The Chinese University of Hong Kong, Singapore, Singapore

Nanyang Technological University & The Chinese University of Hong Kong, Singapore, Singapore
View Profile

MM '22: Proceedings of the 30th ACM International Conference on MultimediaOctober 2022Pages 5361–5370https://doi.org/10.1145/3503161.3548304

Published:10 October 2022Publication History

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 5361–5370

ABSTRACT

The defect diagnosis of modern infrastructures is crucial to public safety. In this work, we propose a complete crack inspection system with three main components, including the autonomous system setup, the geographic-information-system-based 3D reconstruction, and the database construction as well as domain adaptive algorithms design. To fulfill the unsupervised domain adaptation (UDA) task of cracks recognition in infrastructural inspections, we propose a robust unsupervised domain adaptive learning strategy termed Crack-DA to increase the generalization capacity of the model in unseen test circumstances. Specifically, firstly, we propose leveraging the self-supervised depth information to help the learning of semantics. Secondly, we propose using the edge information to suppress the non-edge background objects and noises. Thirdly, we propose using the data augmentation-based consistency learning to increase the prediction robustness. Finally, we use the disparity in depth to evaluate the domain gap in semantics and explicitly consider the domain gap in the optimization of the network. Also, we propose a source database consisting of 11,298 crack images with detailed pixel-level labels for network training in domain adaptations. Extensive experiments on UAV-captured highway cracks and real-site UAV inspections of building cracks demonstrate the robustness and effectiveness of the proposed domain adaptive crack recognition approach.

Supplemental Material

Available for Download

mp4

ACM MM_2417_Presentation video.mp4 (32.2 MB)

References

Paul Bao, Lei Zhang, and Xiaolin Wu. 2005. Canny edge detection enhancement by scale multiplication. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 27, 9 (2005), 1485--1490.Google ScholarDigital Library
Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, and Jurgen Gall. 2019. SemanticKITTI: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 9297--9307.Google ScholarCross Ref
Guowei Cai, BenMChen, and Tong Heng Lee. 2011. Unmanned Rotorcraft Systems. Springer Science & Business Media.Google Scholar
Guowei Cai, Ben M Chen, Kemao Peng, Miaobo Dong, and Tong H Lee. 2008. Modeling and Control of the Yaw Channel of a UAV Helicopter. IEEE Transactions on Industrial Electronics (TIE) 55, 9 (2008), 3426--3434.Google ScholarCross Ref
Ben M Chen. 2000. Robust and H ∞ Control. Springer.Google ScholarCross Ref
Fu-Chen Chen and Mohammad R Jahanshahi. 2018. NB-CNN: Deep learning-based crack detection using convolutional neural network and Naïve Bayes data fusion. IEEE Transactions on Industrial Electronics (TIE) 65, 5 (2018), 4392--4400.Google ScholarCross Ref
Liang-Chieh Chen, Jonathan T Barron, George Papandreou, Kevin Murphy, and Alan L Yuille. 2016. Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4545--4554.Google ScholarCross Ref
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2018. Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 40, 4 (2018), 834--848.Google ScholarCross Ref
Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV). 801--818.Google ScholarDigital Library
Wooram Choi and Young-Jin Cha. 2020. SDDNet: Real-time crack segmentation. IEEE Transactions on Industrial Electronics (TIE) 67, 9 (2020), 8016--8025.Google ScholarCross Ref
Lijuan Duan, Huiling Geng, Jun Zeng, Junbiao Pang, and Qingming Huang. 2019. Fast and Accurately Measuring Crack Width via Cascade Principal Component Analysis. In Proceedings of the ACM Multimedia Asia. 1--7.Google ScholarDigital Library
Fei Gao, LuqiWang, Boyu Zhou, Xin Zhou, Jie Pan, and Shaojie Shen. 2020. Teach-repeat-replan: A complete and robust system for aggressive flight in complex environments. IEEE Transactions on Robotics 36, 5 (2020), 1526--1545.Google ScholarCross Ref
Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for autonomous driving? the kitti vision benchmark suite. In 2012 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, 3354--3361.Google ScholarCross Ref
Dayan Guan, Jiaxing Huang, Aoran Xiao, and Shijian Lu. 2021. Domain adaptive video segmentation via temporal consistency regularization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8053--8064.Google ScholarCross Ref
Zhonghua Hong, Fan Yang, Haiyan Pan, Ruyan Zhou, Yun Zhang, Yanling Han, Jing Wang, Shuhu Yang, Peng Chen, Xiaohua Tong, et al. 2022. Highway Crack Segmentation from Unmanned Aerial Vehicle Images Using Deep Learning. IEEE Geoscience and Remote Sensing Letters (GRSL) (2022).Google Scholar
Jiaxing Huang, Dayan Guan, Aoran Xiao, and Shijian Lu. 2021. Model adaptation: Historical contrastive learning for unsupervised domain adaptation without source data. Advances in Neural Information Processing Systems 34 (2021), 3635--3649.Google Scholar
Jiaxing Huang, Dayan Guan, Aoran Xiao, and Shijian Lu. 2022. Multi-level adversarial network for domain adaptive semantic segmentation. Pattern Recognition 123 (2022), 108384.Google ScholarDigital Library
Jiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu, and Ling Shao. 2022. Category contrast for unsupervised domain adaptation in visual tasks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1203--1214.Google ScholarCross Ref
Jiaxing Huang, Shijian Lu, Dayan Guan, and Xiaobing Zhang. 2020. Contextual relation consistent domain adaptation for semantic segmentation. In European conference on computer vision. Springer, 705--722.Google ScholarDigital Library
Sophie Lambert-Lacroix and Laurent Zwald. 2016. The adaptive BerHu penalty in robust regression. Journal of Nonparametric Statistics 28, 3 (2016), 487--514.Google ScholarCross Ref
Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, and Kilian Q Weinberger. 2021. On feature normalization and data augmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 12383--12392.Google ScholarCross Ref
Zhaoshuo Li, Xingtong Liu, Nathan Drenkow, Andy Ding, Francis X Creighton, Russell H Taylor, and Mathias Unberath. 2021. Revisiting stereo depth estimation from a sequence-to-sequence perspective with transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 6197--6206.Google ScholarCross Ref
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 2980--2988.Google ScholarCross Ref
Kangcheng Liu, Zhi Gao, Feng Lin, and BenMChen. 2020. Fg-net: Fast large-scale lidar point clouds understanding network leveraging correlated feature mining and geometric-aware modelling. arXiv preprint arXiv:2012.09439 (2020).Google Scholar
Kangcheng Liu, Zhi Gao, Feng Lin, and Ben M Chen. 2021. FG-Conv: Large-Scale LiDAR Point Clouds Understanding Leveraging Feature Correlation Mining and Geometric-Aware Modeling. In 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 12896--12902.Google Scholar
Kangcheng Liu, Zhi Gao, Feng Lin, and Ben M Chen. 2022. FG-Net: A Fast and Accurate Framework for Large-Scale LiDAR Point Cloud Understanding. IEEE Transactions on Cybernetics (2022).Google Scholar
Kangcheng Liu, Xiaodong Han, and Ben M Chen. 2019. Deep learning based automatic crack detection and segmentation for unmanned aerial vehicle inspections. In 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE, 381--387.Google ScholarDigital Library
Kangcheng Liu, Yanbin Qu, Hak-Man Kim, and Huihui Song. 2017. Avoiding frequency second dip in power unreserved control during wind power rotational speed recovery. IEEE transactions on power systems 33, 3 (2017), 3097--3106.Google Scholar
Kangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, and Ben M. Chen. 2022. WS3D: Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination. In European Conference on Computer Vision (ECCV).Google Scholar
Kangcheng Liu, Xunkuai Zhou, and Ben M Chen. 2022. An Enhanced LiDAR Inertial Localization and Mapping System for Unmanned Ground Vehicles. In 2022 17th IEEE International Conference on Control and Automation (ICCA). IEEE.Google Scholar
Kangcheng Liu, Xunkuai Zhou, Benyun Zhao, Huosen Ou, and Ben M Chen. 2022. An Integrated Visual System for Unmanned Aerial Vehicles Following Ground Vehicles: Simulations and Experiments. In 2022 17th IEEE International Conference on Control and Automation (ICCA). IEEE.Google Scholar
Yahui Liu, Jian Yao, Xiaohu Lu, Renping Xie, and Li Li. 2019. DeepCrack: A deep hierarchical feature learning architecture for crack segmentation. Neurocomputing 338 (2019), 139--153.Google ScholarDigital Library
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 10012--10022.Google ScholarCross Ref
Zhicheng Liu, Xilong Liu, Zhiqiang Cao, Xurong Gong, Min Tan, and Junzhi Yu. 2022. High Precision Calibration for 3D Vision-Guided Robot System. IEEE Transactions on Industrial Electronics (TIE) (2022).Google Scholar
Zhengzhe Liu, Xiaojuan Qi, and Chi-Wing Fu. 2021. One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1726--1736.Google ScholarCross Ref
Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, and Ziwei Liu. 2021. Unsupervised domain adaptive 3d detection with multi-level consistency. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8866--8875.Google ScholarCross Ref
Lorenz Meier, Petri Tanskanen, Lionel Heng, Gim Hee Lee, Friedrich Fraundorfer, and Marc Pollefeys. 2012. PIXHAWK: A micro aerial vehicle design for autonomous flight using onboard computer vision. Autonomous Robots 33, 1 (2012), 21--39.Google ScholarDigital Library
Martin Mundt, Sagnik Majumder, Sreenivas Murali, Panagiotis Panetsos, and Visvanathan Ramesh. 2019. Meta-learning convolutional neural architectures for multi-target concrete defect classification with the concrete defect bridge image dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11196--11205.Google ScholarCross Ref
Quang-Hieu Pham, Thanh Nguyen, Binh-Son Hua, Gemma Roig, and Sai-Kit Yeung. 2019. JSIS3D: Joint semantic-instance segmentation of 3D point clouds with multi-task point-wise networks and multi-value conditional random fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (ICCV). 8827--8836.Google ScholarCross Ref
Paolo Sassi, Paolo Tripicchio, and Carlo Alberto Avizzano. 2019. A smart monitoring system for automatic welding defect detection. IEEE Transactions on Industrial Electronics (TIE) 66, 12 (2019), 9641--9650.Google ScholarCross Ref
Jasper RR Uijlings, Koen EA Van De Sande, Theo Gevers, and Arnold WM Smeulders. 2013. Selective search for object recognition. International Journal of Computer Vision (IJCV) 104, 2 (2013), 154--171.Google ScholarDigital Library
Qian Wan, Liang Gao, Xinyu Li, and Long Wen. 2022. Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature. IEEE Transactions on Industrial Electronics (TIE) (2022).Google Scholar
Long Wang and Zijun Zhang. 2017. Automatic detection of wind turbine blade surface cracks based on UAV-taken images. IEEE Transactions on Industrial Electronics (TIE) 64, 9 (2017), 7293--7303.Google ScholarCross Ref
Yuxing Wang, Yawen Lu, Zhihua Xie, and Guoyu Lu. 2021. Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment. In Proceedings of the 29th ACM International Conference on Multimedia. 1350--1358.Google ScholarDigital Library
Long Wen, Xinyu Li, and Liang Gao. 2021. A new reinforcement learning based learning rate scheduler for convolutional neural network in fault classification. Transactions on Industrial Electronics (TIE) 68, 12 (2021), 12890--12900.Google ScholarCross Ref
Qingsong Wen, Liang Sun, Fan Yang, Xiaomin Song, Jingkun Gao, Xue Wang, and Huan Xu. 2020. Time series data augmentation for deep learning: A survey. arXiv preprint arXiv:2002.12478 (2020).Google Scholar
Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. 2021. SegFormer: Simple and efficient design for semantic segmentation with transformers. Advances in Neural Information Processing Systems (Neurips) 34 (2021).Google Scholar
Guanglei Yang, Hao Tang, Mingli Ding, Nicu Sebe, and Elisa Ricci. 2021. Transformer-based attention networks for continuous pixel-wise prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 16269--16279.Google ScholarCross Ref
Ming-Jia Yang, Yu-Xiao Guo, Bin Zhou, and Xin Tong. 2021. Indoor Scene Generation from a Collection of Semantic-Segmented Depth Images. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 15203--15212.Google ScholarCross Ref
Fangneng Zhan, Chuhui Xue, and Shijian Lu. 2019. Ga-dan: Geometry-aware domain adaptation network for scene text detection and recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9105--9115.Google ScholarCross Ref
Chong Zhang, Zongxian Li, Jingjing Liu, Peixi Peng, Qixiang Ye, Shijian Lu, Tiejun Huang, and Yonghong Tian. 2021. Self-guided adaptation: Progressive representation alignment for domain adaptive object detection. IEEE Transactions on Multimedia 24 (2021), 2246--2258.Google ScholarDigital Library
Zhenyu Zhang, Shouwei Gao, and Zheng Huang. 2021. An automatic glioma segmentation system using a multilevel attention pyramid scene parsing network. Current Medical Imaging 17, 6 (2021), 751--761.Google ScholarCross Ref
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2881--2890.Google ScholarCross Ref
Yuzhi Zhao, Lai-Man Po, Tingyu Lin, XuehuiWang, Kangcheng Liu, Yujia Zhang, Wing-Yin Yu, Pengfei Xian, and Jingjing Xiong. 2021. Legacy photo editing with learned noise prior. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2103--2112.Google ScholarCross Ref
Mingmin Zhen, Jinglu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, and Long Quan. 2020. Joint semantic segmentation and boundary detection using iterative pyramid contexts. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 13666--13675.Google ScholarCross Ref
Qin Zou, Zheng Zhang, Qingquan Li, Xianbiao Qi, Qian Wang, and Song Wang. 2019. Deepcrack: Learning hierarchical convolutional features for crack detection. IEEE Transactions on Image Processing (TIP) 28, 3 (2019), 1498--1512.Google ScholarDigital Library

Index Terms

Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections
1. Computer systems organization
  1. Embedded and cyber-physical systems
    1. Robotics
      1. Robotic autonomy

Recommendations

ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels
MM '23: Proceedings of the 31st ACM International Conference on Multimedia

In recent years, Unsupervised Domain Adaptation (UDA) has emerged as a popular technique for transferring knowledge from a labeled source domain to an unlabeled target domain. However, almost all of the existing approaches implicitly assume that the ...
Read More
Unsupervised and Semi-Supervised Robust Spherical Space Domain Adaptation
Adversarial domain adaptation has been an effective approach for learning domain-invariant features by adversarial training. In this paper, we propose a novel adversarial domain adaptation approach defined in the spherical feature space, in which we ...
Read More
Domain consistency regularization for unsupervised multi-source domain adaptive classification
Highlights
- We propose a novel multi-source domain adaptation method for classification.
- We ...
Abstract
Deep learning-based multi-source unsupervised domain adaptation (MUDA) has been actively studied in recent years. Compared with single-source unsupervised domain adaptation (SUDA), domain shift in MUDA exists not only between the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '22: Proceedings of the 30th ACM International Conference on Multimedia
October 2022
7537 pages
ISBN:9781450392037
DOI:10.1145/3503161
General Chairs:
João Magalhães
NOVA University of Lisbon, Portugal
,
Alberto del Bimbo
University of Florence, Italy
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Nicu Sebe
University of Trento, Italy
,
Program Chairs:
Xavier Alameda-Pineda
Inria, Grenoble, France
,
Qin Jin
Renmin University of China, China
,
Vincent Oria
New Jersey Institute of Technology, USA
,
Laura Toni
University College London, UK
Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 October 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
3D reconstructions
autonomous infrastructural inspection systems
crack segmentation and detection
unmanned aerial/ground vehicles
unsupervised domain adaptive learning
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 274
  Total Downloads
- Downloads (Last 12 months)121
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels

Unsupervised and Semi-Supervised Robust Spherical Space Domain Adaptation

Domain consistency regularization for unsupervised multi-source domain adaptive classification