Abstract
Stagnant water on roads has always been a major cause of traffic jams and accidents. Traditional urban waterlogging monitoring and warning system is mainly based on a large amount of historical data and predictive network, which has low accuracy and weak generalization ability. Considering the deep neural network algorithms have demonstrated strong capabilities in computer vision tasks such as object detection, we aim to apply them to road stagnant water detection. In this paper, a novel automatic stagnant water localization method under weak supervision based on visual image is proposed. First, the template matching method is applied to extract road information from the traffic image. Then, due to the complexity of data annotation, we locate stagnant water in image based on Class Activation Maps (CAM) mechanism, which is a weakly supervised method. The detection model consists of the ResNet-18 and the Grad-CAM++ mechanism. Finally, based on the heat map and template, we set a suitable threshold to segment stagnant water area in image. In the experiments, the precision and recall for road stagnant water classification by the proposed model are 99.39% and 99.60%, while the Intersection over Union (IoU) for stagnant water area segmentation is up to 63%. These show that our method is effective for road stagnant water localization.
Similar content being viewed by others
References
Cao C, Liu X, Yang Y et al (2015) Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: IEEE international conference on computer vision (ICCV). Santiago, Chile, pp 2956–2964
Chattopadhyay A, Sarkar A, Howlader P et al (2018) Grad-CAM++: Generalized gradient-based visual explanations for deep convolutional networks. In: IEEE winter conference on applications of computer vision (WACV). Lake Tahoe, NV, USA, pp 839–847
Choudhury AD, Agrawal A, Sinha P et al (2012) A methodology for GPS-based waterlogging prediction and smart route generation. In: International conference on intelligent systems design and applications. Kochi, India, pp 274–278
Fan Q, Zhuo W, Tang CK et al (2020) Few-shot object detection with attention-RPN and multi-relation detector. In: IEEE/CVF conference on computer vision and pattern recognition (CVPR). Seattle, WA, USA, pp 4012–4021
Gupta A, Bansal A, Gupta R et al (2017) Urban waterlogging detection and severity prediction using artificial neural networks. In: IEEE 19th International conference on high performance computing and communications; IEEE 15th international conference on smart city; IEEE 3rd international conference on data science and systems (HPCC/SmartCity/DSS). Bangkok, Thailand, pp 42–49
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: IEEE conference on computer vision and pattern recognition (CVPR). Las Vegas, NV, USA, pp 770–778
Huang G, Liu Z, Van Der Maaten L et al (2017) Densely connected convolutional networks. In: IEEE conference on computer vision and pattern recognition (CVPR). Honolulu, HI, USA, pp 2261–2269
Huang K, Meng F, Li H et al (2019) Class activation map generation by multiple level class grouping and orthogonal constraint. In: Perth WA (ed) Digital image computing: techniques and applications (DICTA). Australia, pp 1–6
Janos V, Kriz M (2018) Smart urban transport. In: Smart city symposium prague (SCSP). Czech Republic, Prague, pp 1–5
Jarrett K, Kavukcuoglu K, Ranzato M et al (2009) What is the best multi-stage architecture for object recognition. In: IEEE international conference on computer vision (ICCV). Kyoto, Japan, pp 2146–2153
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Conference and workshop on neural information processing systems (NIPS). Lake Tahoe, Nevada, USA, pp 1106–1114
Sandler M, Howard A, Zhu M et al (2018) Inverted residuals and linear bottlenecks: mobile networks for classification, detection and segmentation. [Online]. Available: arxiv: 1801.04381
Selvaraju RR, Cogswell M, Das A et al (2020) Grad-CAM: Visual explanations from deep networks via gradient-based localization. International Journal of Computer Vision 128(2):336–359
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations. San Diego, CA, USA
Szegedy C, Liu W, Jia Y et al (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition (CVPR). MA, USA, Boston, pp 1–9
Szegedy C, Vanhoucke V, Ioffe S et al (2016) Rethinking the inception architecture for computer vision. In: IEEE conference on computer vision and pattern recognition (CVPR). Las Vegas, NV, USA, pp 2818–2826
Tagaris T, Sdraka M, Stafylopatis A (2019) High-resolution class activation mapping. In: IEEE international conference on image processing (ICIP). Taipei, Taiwan, pp 4514–4518
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. In: Conference and workshop on neural information processing systems (NIPS). Long Beach, CA, USA, pp 5998–6008
Wang Y, Li J, Zhang H (2016) Study on city rainstorm waterlogging warning system based on historical data. In: 13th International computer conference on wavelet active media technology and information processing (ICCWAMTIP). Chengdu, China, pp 171–174
Wang H, Wang Z, Du M et al (2020) Score-CAM: Score-weighted visual explanations for convolutional neural networks. In: IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW). Seattle, WA, USA, pp 111–119
Xia J, Cao H, Yang Y et al (2019) Detection of waterlogging stress based on hyperspectral images of oilseed rape leaves. Computers and Electronics in Agriculture 159:59–68
Xu M, Wang H, Ni B et al (2020) Cross-domain detection via graph-induced prototype alignment. IEEE/CVF conference on computer vision and pattern recognition (CVPR). Seattle, WA, USA, pp 12352–12361
Yang J, Zhao Z, Zhang H et al (2019) Data augmentation for X-Ray prohibited item images using generative adversarial networks. IEEE Access 7:28894–28902
Zhang J, Bargal SA, Lin Z et al (2018) Top-down neural attention by excitation backprop. International Journal of Computer Vision 126(10):1084–1102
Zhang T, Lin G, Cai J et al (2019) Decoupled spatial neural attention for weakly supervised semantic segmentation. IEEE Transactions on Multimedia 21(11):2930–2941
Zhang Z, Li D, Zeng F et al (2018) A dynamic risk assessment method of waterlogging points by coupling hydrology model with deep neural network. In: 26th International conference on geoinformatics. pp 1–6
Zhang W, Wang X, Liu Y et al (2016) Simulation of rainstorm waterlogging based on SWMM and visualization module research. In: International conference on smart city and systems engineering (ICSCSE). Hunan, China, pp 394–397
Zhou B, Khosla A, Lapedriza A et al (2016) Learning deep features for discriminative localization. In: IEEE conference on computer vision and pattern recognition (CVPR). Las Vegas, NV, USA, pp 2921–2929
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China (Grant No. 61806208, 62076166), in part by the General Higher Education Project of Guangdong Provincial Education Department (Grant No. 2020ZDZX3082), in part by the Guangdong Provincial Rural Science and Technology Specialists Project (Grant No. KPT20200220), in part by the Shenzhen Science and Technology Program (Grand No. RCBS20200714114940262).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhao, Z., Zhang, H. A localization method for stagnant water in city road traffic image. Multimed Tools Appl 81, 2453–2466 (2022). https://doi.org/10.1007/s11042-021-11638-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11638-w