skip to main content
10.1145/3615900.3628772acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM

Authors Info & Claims
Published:29 November 2023Publication History

ABSTRACT

Filling data gaps in various global regions requires a robust approach that can accurately provide detection results from earth observation data. One of the challenges arises from significant heterogeneity in satellite images and variation in features and characteristics for specific ground objects like Wastewater Treatment Plants (WTPs). To overcome these challenges, we propose a novel multiscale multifeature hybrid model. This model leverages the power of deep learning-based object detection models, namely Yolov6, RTMDET, EfficientDET, and Domain Adaptation, to accurately and efficiently identify WTP locations worldwide. Our approach focuses on performance enhancements, including reduced false positives (FPs) and broad coverage. The strategies for achieving these improvements involve effective data processing approaches, model tuning, and adaptation. Moreover, we optimize training data features using Volunteered Geographic Information (VGI) data. We demonstrated the effectiveness of the suggested approach for three diverse global regions: Germany, France, and Malaysia. Our study gives new insights into WTP distribution when compared to existing databases like OpenStreetMap (OSM). The resulting pipeline delivers good results even in challenging rural and urban context. Moreover, it is well-suited for generating large scale WTP datasets, which is useful for many applications such as Critical Water Infrastructure mapping, Urban Planning, Climate Action and many more.

References

  1. U.S. Environmental Protection Agency. 2016. (2016).Google ScholarGoogle Scholar
  2. Tolga Bakırman. 2023. An Assessment of YOLO Architectures for Oil Tank Detection from SPOT Imagery. International Journal of Environment and Geoinformatics 10, 1 (2023), 9 -- 15. Google ScholarGoogle ScholarCross RefCross Ref
  3. Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. http://arxiv.org/abs/2005.12872 cite arxiv:2005.12872.Google ScholarGoogle Scholar
  4. Jiaoyan Chen, Yan Zhou, Alexander Zipf, and Hongchao Fan. 2019. Deep Learning From Multiple Crowds: A Case Study of Humanitarian Mapping. IEEE Transactions on Geoscience and Remote Sensing 57, 3 (2019), 1713--1722. Google ScholarGoogle ScholarCross RefCross Ref
  5. Jiaoyan Chen and Alexander Zipf. 2017. Deep Learning with Satellite Images and Volunteered Geographic Information. 63--78. Google ScholarGoogle ScholarCross RefCross Ref
  6. Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, and Jian Sun. 2021. RepVGG:Making VGG-stay ConvNets Great Again. In in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.Google ScholarGoogle Scholar
  7. P Gajalakshmi, J V Satyanarayana, G Venkat Reddy, and Sunita Dhavale. 2020. Detection of Strategic Targets of Interest in Satellite Images using YOLO. In 2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP). 1--5. Google ScholarGoogle ScholarCross RefCross Ref
  8. Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 1 (2016), 2096--2030.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Zhora Gevorgyan. 2022. SIoU Loss: More Powerful Learning for Bounding Box Regression. arXiv preprint arXiv:2205.12740 (2022). Google ScholarGoogle ScholarCross RefCross Ref
  10. Yulia Grinblat, Sukanya Randhawa, Yuze Li, and et. al. [n. d.]. Towards scalable geosmart wastewater treatment plant detection with decision-tree-based models and Sentinel-2 data. ([n. d.]). https://doi.org/TBAGoogle ScholarGoogle Scholar
  11. Mazin Hnewa and Hayder Radha. 2021. Multiscale Domain Adaptive Yolo For Cross-Domain Object Detection. In 2021 IEEE International Conference on Image Processing (ICIP). 3323--3327. Google ScholarGoogle ScholarCross RefCross Ref
  12. Toan Minh Hoang, Phong Ha Nguyen, Noi Quang Truong, Young Won Lee, and Kang Ryoung Park. 2019. Deep RetinaNet-Based Detection and Classification of Road Markings by Visible Light Camera Sensors. Sensors 19, 2 (2019). Google ScholarGoogle ScholarCross RefCross Ref
  13. Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng, Liang Li, Zaidan Ke, Qingyuan Li, Meng Cheng, Weiqiang Nie, et al. 2022. YOLOv6: A single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022).Google ScholarGoogle Scholar
  14. Guofa Li, Zefeng Ji, Xingda Qu, Rui Zhou, and Dongpu Cao. 2022. Cross-Domain Object Detection for Autonomous Driving: A Stepwise Domain Adaptative YOLO Approach. IEEE Transactions on Intelligent Vehicles 7, 3 (2022), 603--615. Google ScholarGoogle ScholarCross RefCross Ref
  15. Hao Li, Pedram Ghamisi, Benhood Rasti, Zhaoyan Wu, Aurelie Shapiro, Michael Schultz, and Alexander Zipf. 2020. A multi-sensor fusion framework based on coupled residual convolutional neural networks. Remote Sensing 12, 12 (2020).Google ScholarGoogle Scholar
  16. Hao Li, Johannes Zech, Danfeng Hong, Pedram Ghamisi, Michael Schultz, and Alexander Zipf. 2022. Leveraging OpenStreetMap and Multimodal Remote Sensing Data with Joint Deep Learning for Wastewater Treatment Plants Detection. International Journal of Applied Earth Observation and Geoinformation 110 (2022).Google ScholarGoogle ScholarCross RefCross Ref
  17. Hao Li, Johannes Zech, Christina Ludwig, Sascha Fendrich, Aurelie Shapiro, Michael Schultz, and Alexander Zipf. 2021. Automatic mapping of national surface water with Open-StreetMap and Sentinel-2 MSI data using deep learning. International Journal of Applied Earth Observation and Geoinformation 104 (2021).Google ScholarGoogle Scholar
  18. Wenwen et. al. Li. 2020. GeoAI: Where machine learning and big data converge in GIScience. Journal of Spatial Information Science 20 (2020), 71--77. Google ScholarGoogle ScholarCross RefCross Ref
  19. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal Loss for Dense Object Detection. In 2017 IEEE International Conference on Computer Vision (ICCV). 2999--3007. Google ScholarGoogle ScholarCross RefCross Ref
  20. Pengyuan Liu and Filip Biljecki. 2022. A review of spatially-explicit GeoAI applications in Urban Geography. International Journal of Applied Earth Observation and Geoinformation 112 (2022), 102936. Google ScholarGoogle ScholarCross RefCross Ref
  21. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single Shot MultiBox Detector. In Computer Vision - ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 21--37.Google ScholarGoogle Scholar
  22. Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo. 2021. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society, Los Alamitos, CA, USA, 9992--10002. Google ScholarGoogle ScholarCross RefCross Ref
  23. Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, and Kai Chen. 2022. RTMDet: An Empirical Study of Designing Real-Time Object Detectors. arXiv:2212.07784 [cs.CV]Google ScholarGoogle Scholar
  24. Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, and Kai Chen. 2022. RTMDet: An Empirical Study of Designing Real-Time Object Detectors. arXiv preprint arXiv:2212.07784 (2022). Google ScholarGoogle ScholarCross RefCross Ref
  25. World Health Organization. 2019. (2019).Google ScholarGoogle Scholar
  26. Claudio Persello, Jan Wegner, Ronny Hänsch, Devis Tuia, Pedram Ghamisi, Mila Koeva, and Gustau Camps-Valls. 2022. Deep Learning and Earth Observation to Support the Sustainable Development Goals: Current Approaches, Open Challenges, and Future Opportunities. IEEE Geoscience and Remote Sensing Magazine (2022).Google ScholarGoogle Scholar
  27. M. Raifer, Rafael Troilo, and F. et al. Kowatsch. 2019. OSHDB: a framework for spatio-temporal analysis of OpenStreetMap history data. (Aug. 2019). Google ScholarGoogle ScholarCross RefCross Ref
  28. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788. Google ScholarGoogle ScholarCross RefCross Ref
  29. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.), Vol. 28. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2015/file/14bfa6bb14875e45bba028a21ed38046-Paper.pdfGoogle ScholarGoogle Scholar
  30. Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, and Silvio Savarese. 2019. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. In in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.Google ScholarGoogle ScholarCross RefCross Ref
  31. Dimitrios Sykas, Maria Sdraka, Dimitrios Zografakis, and Ioannis Papoutsis. 2022. A Sentinel-2 Multiyear, Multicountry Benchmark Dataset for Crop Classification and Segmentation With Deep Learning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 15 (2022), 3323--3339. Google ScholarGoogle ScholarCross RefCross Ref
  32. Mingxing Tan, Ruoming Pang, and Quoc V. Le. 2020. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  33. Franklin; Stensel David Tchobanoglous, George; Burton. 2003. (2003).Google ScholarGoogle Scholar
  34. Eric Tzeng, Judy Hoffman, Kate Saenko, and Trevor Darrell. 2017. Adversarial Discriminative Domain Adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle ScholarCross RefCross Ref
  35. Brandon Victor, Zhen He, and Aiden Nibali. 2022. A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture.Google ScholarGoogle Scholar
  36. Zhaoxan Wu, Hao Li, and Alexander Zipf. 2020. From Historical OpenStreetMap data to customized training samples for geospatial machine learning. Proceedings of the Academic Track at the State of the Map 2020 Online Conference (2020).Google ScholarGoogle Scholar
  37. Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, and Liangpei Zhang. 2018. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.Google ScholarGoogle Scholar
  38. Jiangye Yuan, Hsiu-Han Lexie Yang, Olufemi A. Omitaomu, and Budhendra L. Bhaduri. 2016. Large-scale solar panel mapping from aerial images using deep convolutional networks. In 2016 IEEE International Conference on Big Data (Big Data). 2703--2708. Google ScholarGoogle ScholarCross RefCross Ref
  39. Haoyang Zhang, Ying Wang, Feras Dayoub, and Sünderhauf Niko. 2021. VerifocalNet: An IoU-aware Dense Object Detector. In in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.Google ScholarGoogle Scholar
  40. Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, and Quoc V. Le. 2020. Learning Data Augmentation Strategies for Object Detection. In Vedaldi, A., Bischof, H., Brox, T., Frahm, JM. (eds) Computer Vision - ECCV 2020. ECCV 2020. Lecture Notes in Computer Science(), vol 12372. Springer, Cham.Google ScholarGoogle Scholar

Index Terms

  1. Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Article Metrics

      • Downloads (Last 12 months)54
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader