research-article

Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM

Authors:
Sukanya Randhawa

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

https://orcid.org/0009-0005-5068-3246
View Profile

,
Guntaj Randhawa

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

https://orcid.org/0009-0006-3487-9326
View Profile

,
Olena Sivak

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

https://orcid.org/0009-0006-8708-931X
View Profile

,
Johannes Zech

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

https://orcid.org/0009-0002-3139-6813
View Profile

,
Maria Martin

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

https://orcid.org/0009-0003-4054-781X
View Profile

,
Alexander Zipf

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

Heidelberg Institute for Geoinformation Technology, Heidelberg, Germany

https://orcid.org/0000-0003-4916-9838
View Profile

,
Yuze Li

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

GIScience Chair, Institute of Geography, Heidelberg University, Heidelberg, Germany

https://orcid.org/0000-0001-5184-2724
View Profile

UrbanAI '23: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AINovember 2023Pages 10–21https://doi.org/10.1145/3615900.3628772

Published:29 November 2023Publication History

UrbanAI '23: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI

Pages 10–21

ABSTRACT

Filling data gaps in various global regions requires a robust approach that can accurately provide detection results from earth observation data. One of the challenges arises from significant heterogeneity in satellite images and variation in features and characteristics for specific ground objects like Wastewater Treatment Plants (WTPs). To overcome these challenges, we propose a novel multiscale multifeature hybrid model. This model leverages the power of deep learning-based object detection models, namely Yolov6, RTMDET, EfficientDET, and Domain Adaptation, to accurately and efficiently identify WTP locations worldwide. Our approach focuses on performance enhancements, including reduced false positives (FPs) and broad coverage. The strategies for achieving these improvements involve effective data processing approaches, model tuning, and adaptation. Moreover, we optimize training data features using Volunteered Geographic Information (VGI) data. We demonstrated the effectiveness of the suggested approach for three diverse global regions: Germany, France, and Malaysia. Our study gives new insights into WTP distribution when compared to existing databases like OpenStreetMap (OSM). The resulting pipeline delivers good results even in challenging rural and urban context. Moreover, it is well-suited for generating large scale WTP datasets, which is useful for many applications such as Critical Water Infrastructure mapping, Urban Planning, Climate Action and many more.

References

U.S. Environmental Protection Agency. 2016. (2016).Google Scholar
Tolga Bakırman. 2023. An Assessment of YOLO Architectures for Oil Tank Detection from SPOT Imagery. International Journal of Environment and Geoinformatics 10, 1 (2023), 9 -- 15. Google ScholarCross Ref
Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-End Object Detection with Transformers. http://arxiv.org/abs/2005.12872 cite arxiv:2005.12872.Google Scholar
Jiaoyan Chen, Yan Zhou, Alexander Zipf, and Hongchao Fan. 2019. Deep Learning From Multiple Crowds: A Case Study of Humanitarian Mapping. IEEE Transactions on Geoscience and Remote Sensing 57, 3 (2019), 1713--1722. Google ScholarCross Ref
Jiaoyan Chen and Alexander Zipf. 2017. Deep Learning with Satellite Images and Volunteered Geographic Information. 63--78. Google ScholarCross Ref
Xiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, and Jian Sun. 2021. RepVGG:Making VGG-stay ConvNets Great Again. In in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.Google Scholar
P Gajalakshmi, J V Satyanarayana, G Venkat Reddy, and Sunita Dhavale. 2020. Detection of Strategic Targets of Interest in Satellite Images using YOLO. In 2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP). 1--5. Google ScholarCross Ref
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 1 (2016), 2096--2030.Google ScholarDigital Library
Zhora Gevorgyan. 2022. SIoU Loss: More Powerful Learning for Bounding Box Regression. arXiv preprint arXiv:2205.12740 (2022). Google ScholarCross Ref
Yulia Grinblat, Sukanya Randhawa, Yuze Li, and et. al. [n. d.]. Towards scalable geosmart wastewater treatment plant detection with decision-tree-based models and Sentinel-2 data. ([n. d.]). https://doi.org/TBAGoogle Scholar
Mazin Hnewa and Hayder Radha. 2021. Multiscale Domain Adaptive Yolo For Cross-Domain Object Detection. In 2021 IEEE International Conference on Image Processing (ICIP). 3323--3327. Google ScholarCross Ref
Toan Minh Hoang, Phong Ha Nguyen, Noi Quang Truong, Young Won Lee, and Kang Ryoung Park. 2019. Deep RetinaNet-Based Detection and Classification of Road Markings by Visible Light Camera Sensors. Sensors 19, 2 (2019). Google ScholarCross Ref
Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng, Liang Li, Zaidan Ke, Qingyuan Li, Meng Cheng, Weiqiang Nie, et al. 2022. YOLOv6: A single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022).Google Scholar
Guofa Li, Zefeng Ji, Xingda Qu, Rui Zhou, and Dongpu Cao. 2022. Cross-Domain Object Detection for Autonomous Driving: A Stepwise Domain Adaptative YOLO Approach. IEEE Transactions on Intelligent Vehicles 7, 3 (2022), 603--615. Google ScholarCross Ref
Hao Li, Pedram Ghamisi, Benhood Rasti, Zhaoyan Wu, Aurelie Shapiro, Michael Schultz, and Alexander Zipf. 2020. A multi-sensor fusion framework based on coupled residual convolutional neural networks. Remote Sensing 12, 12 (2020).Google Scholar
Hao Li, Johannes Zech, Danfeng Hong, Pedram Ghamisi, Michael Schultz, and Alexander Zipf. 2022. Leveraging OpenStreetMap and Multimodal Remote Sensing Data with Joint Deep Learning for Wastewater Treatment Plants Detection. International Journal of Applied Earth Observation and Geoinformation 110 (2022).Google ScholarCross Ref
Hao Li, Johannes Zech, Christina Ludwig, Sascha Fendrich, Aurelie Shapiro, Michael Schultz, and Alexander Zipf. 2021. Automatic mapping of national surface water with Open-StreetMap and Sentinel-2 MSI data using deep learning. International Journal of Applied Earth Observation and Geoinformation 104 (2021).Google Scholar
Wenwen et. al. Li. 2020. GeoAI: Where machine learning and big data converge in GIScience. Journal of Spatial Information Science 20 (2020), 71--77. Google ScholarCross Ref
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal Loss for Dense Object Detection. In 2017 IEEE International Conference on Computer Vision (ICCV). 2999--3007. Google ScholarCross Ref
Pengyuan Liu and Filip Biljecki. 2022. A review of spatially-explicit GeoAI applications in Urban Geography. International Journal of Applied Earth Observation and Geoinformation 112 (2022), 102936. Google ScholarCross Ref
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single Shot MultiBox Detector. In Computer Vision - ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 21--37.Google Scholar
Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo. 2021. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE Computer Society, Los Alamitos, CA, USA, 9992--10002. Google ScholarCross Ref
Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, and Kai Chen. 2022. RTMDet: An Empirical Study of Designing Real-Time Object Detectors. arXiv:2212.07784 [cs.CV]Google Scholar
Chengqi Lyu, Wenwei Zhang, Haian Huang, Yue Zhou, Yudong Wang, Yanyi Liu, Shilong Zhang, and Kai Chen. 2022. RTMDet: An Empirical Study of Designing Real-Time Object Detectors. arXiv preprint arXiv:2212.07784 (2022). Google ScholarCross Ref
World Health Organization. 2019. (2019).Google Scholar
Claudio Persello, Jan Wegner, Ronny Hänsch, Devis Tuia, Pedram Ghamisi, Mila Koeva, and Gustau Camps-Valls. 2022. Deep Learning and Earth Observation to Support the Sustainable Development Goals: Current Approaches, Open Challenges, and Future Opportunities. IEEE Geoscience and Remote Sensing Magazine (2022).Google Scholar
M. Raifer, Rafael Troilo, and F. et al. Kowatsch. 2019. OSHDB: a framework for spatio-temporal analysis of OpenStreetMap history data. (Aug. 2019). Google ScholarCross Ref
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788. Google ScholarCross Ref
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.), Vol. 28. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2015/file/14bfa6bb14875e45bba028a21ed38046-Paper.pdfGoogle Scholar
Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir Sadeghian, Ian Reid, and Silvio Savarese. 2019. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. In in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.Google ScholarCross Ref
Dimitrios Sykas, Maria Sdraka, Dimitrios Zografakis, and Ioannis Papoutsis. 2022. A Sentinel-2 Multiyear, Multicountry Benchmark Dataset for Crop Classification and Segmentation With Deep Learning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 15 (2022), 3323--3339. Google ScholarCross Ref
Mingxing Tan, Ruoming Pang, and Quoc V. Le. 2020. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
Franklin; Stensel David Tchobanoglous, George; Burton. 2003. (2003).Google Scholar
Eric Tzeng, Judy Hoffman, Kate Saenko, and Trevor Darrell. 2017. Adversarial Discriminative Domain Adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarCross Ref
Brandon Victor, Zhen He, and Aiden Nibali. 2022. A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture.Google Scholar
Zhaoxan Wu, Hao Li, and Alexander Zipf. 2020. From Historical OpenStreetMap data to customized training samples for geospatial machine learning. Proceedings of the Academic Track at the State of the Map 2020 Online Conference (2020).Google Scholar
Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, and Liangpei Zhang. 2018. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. In in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.Google Scholar
Jiangye Yuan, Hsiu-Han Lexie Yang, Olufemi A. Omitaomu, and Budhendra L. Bhaduri. 2016. Large-scale solar panel mapping from aerial images using deep convolutional networks. In 2016 IEEE International Conference on Big Data (Big Data). 2703--2708. Google ScholarCross Ref
Haoyang Zhang, Ying Wang, Feras Dayoub, and Sünderhauf Niko. 2021. VerifocalNet: An IoU-aware Dense Object Detector. In in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.Google Scholar
Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, and Quoc V. Le. 2020. Learning Data Augmentation Strategies for Object Detection. In Vedaldi, A., Bischof, H., Brox, T., Frahm, JM. (eds) Computer Vision - ECCV 2020. ECCV 2020. Lecture Notes in Computer Science(), vol 12372. Springer, Cham.Google Scholar

Index Terms

Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object detection

Recommendations

VGI as a dynamically updating data source in location-based services in urban environments
UbiCrowd '11: Proceedings of the 2nd international workshop on Ubiquitous crowdsouring

Urban environments, or living cities, share the characteristic of a high degree of organized complexity. This complexity arises from the components of the urban fabric: streets, shops, offices, houses, pedestrian zones, green spaces, plazas, parking ...
Read More
Precise orbit determination for BDS-3 satellites using satellite-ground and inter-satellite link observations

Since November 2017, eight BeiDou global navigation system (BDS-3) satellites equipped with Ka-band inter-satellite link (ISL) payloads have been launched into medium earth orbit. We present the precise orbit determination (POD) for BDS-3 satellites ...
Read More
Automatic Geolocation Correction of Satellite Imagery

Modern satellites tag their images with geolocation information using GPS and star tracking systems. Depending on the quality of the geopositioning equipment, errors may range from a few meters to tens of meters on the ground. At the current state of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

UrbanAI '23: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI
November 2023
84 pages
ISBN:9798400703621
DOI:10.1145/3615900

Copyright © 2023 Owner/Author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 November 2023
Check for updates
Author Tags
object detection
GeoAI
satellite imagery
OpenStreetMap
remote sensing
computer vision
infrastructure detection
wastewater treatment plants
deep learning
YOLO
YOLOv6
EfficientDet
RT-MDet
domain adaptation
multiscale learning
VGI
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 54
  Total Downloads
- Downloads (Last 12 months)54
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM

UrbanAI '23: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI

ABSTRACT

References

Cited By

Index Terms

Recommendations

VGI as a dynamically updating data source in location-based services in urban environments

Precise orbit determination for BDS-3 satellites using satellite-ground and inter-satellite link observations

Automatic Geolocation Correction of Satellite Imagery

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Multiscale Multifeature Vision Learning for Scalable and Efficient Wastewater Treatment Plant Detection using Hi-Res Satellite Imagery and OSM

UrbanAI '23: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI

ABSTRACT

References

Cited By

Index Terms

Recommendations

VGI as a dynamically updating data source in location-based services in urban environments

Precise orbit determination for BDS-3 satellites using satellite-ground and inter-satellite link observations

Automatic Geolocation Correction of Satellite Imagery

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media