short-paper

Open access

WAGL: Extreme Weather Adaptive Method for Robust and Generalizable UAV-based Cross-View Geo-localization

Authors:

Chi-Man VongAuthors Info & Claims

UAVM '24: Proceedings of the 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective

Pages 14 - 18

https://doi.org/10.1145/3689095.3689100

Published: 28 October 2024 Publication History

Abstract

As drones become increasingly utilized across various fields, related multimedia applications are also emerging. One significant application is cross-view geo-localization, which leverages aerial drone and satellite imagery data to facilitate drone navigation and geo-localization. In this paper, we focus on the robustness and generalization of retrieval under various extreme weather conditions. Considering the significant gap between training and testing data, our research emphasizes exploring and employing a powerful self-supervised backbone and an unsupervised aggregator to achieve domain adaptation. Additionally, from a data perspective, we simulate various weather conditions to bridge the gap between training and testing drone data through data augmentation. Futhermore, a cross-weather triplet loss is utilized to minimize the domain differences between drone and satellite images under extreme weather conditions. Our method achieves 94.07% Recall@1 accuracy on University-160k-WX, and ranks 4th in the UAVM2024 Challenge. Code will be released at https://github.com/SunJ1025/WAGL.

References

[1]

Relja Arandjelovic and Andrew Zisserman. 2013. All about VLAD. In Pro- ceedings of the IEEE conference on Computer Vision and Pattern Recognition. 1578--1585.

[2]

Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence 35, 8 (2013), 1798--1828.

Digital Library

[3]

Aditya Chattopadhay, Anirban Sarkar, Prantik Howlader, and Vineeth N Balasub- ramanian. 2018. Grad-cam: Generalized gradient-based visual explanations for deep convolutional networks. In 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, 839--847.

[4]

Yireng Chen, Zihao Yang, and Quan Chen. 2023. A Cross-View Matching Method Based on Dense Partition Strategy for UAV Geolocalization. In Proceedings of the 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. 19--23.

Digital Library

[5]

Nikhil Keetha, Avneesh Mishra, Jay Karhade, Krishna Murthy Jatavallabhula, Sebastian Scherer, Madhava Krishna, and Sourav Garg. 2023. Anyloc: Towards universal visual place recognition. IEEE Robotics and Automation Letters (2023).

[6]

Haoran Li, Quan Chen, Zhiwen Yang, and Jiong Yin. 2023. Drone Satellite Matching based on Multi-scale Local Pattern Network. In Proceedings of the 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. 51--55.

Digital Library

[7]

Liu Liu and Hongdong Li. 2019. Lending orientation to neural networks for cross- view geo-localization. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 5624--5633.

[8]

Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El- Nouby, et al. 2023. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 (2023).

[9]

Filip Radenovi´c, Giorgos Tolias, and Ond'rej Chum. 2018. Fine-tuning CNN image retrieval with no human annotation. IEEE transactions on pattern analysis and machine intelligence 41, 7 (2018), 1655--1668.

[10]

Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition. 815--823.

[11]

Junge Shen, Jian Sun, Xin Wang, and Zhaoyong Mao. 2022. Joint metric learning of local and global features for vehicle re-identification. Complex & Intelligent Systems 8, 5 (2022), 4005--4020.

[12]

Tianrui Shen, Yingmei Wei, Lai Kang, Shanshan Wan, and Yee-Hong Yang. 2023. MCCG: A ConvNeXt-based multiple-classifier method for cross-view geo- localization. IEEE Transactions on Circuits and Systems for Video Technology (2023).

[13]

Jian Sun, Hao Sun, Lin Lei, Kefeng Ji, and Gangyao Kuang. 2024. TirSA: A Three Stage Approach for UAV-Satellite Cross-View Geo-Localization Based on Self-Supervised Feature Enhancement. IEEE Transactions on Circuits and Systems for Video Technology (2024).

[14]

Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, and Yi Yang. 2021. Each part matters: Local patterns facilitate cross-view geo-localization. IEEE Transactions on Circuits and Systems for Video Technology 32, 2 (2021), 867--879.

[15]

Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, and Saining Xie. 2023. Convnext v2: Co-designing and scaling convnets with masked autoencoders. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 16133--16142.

[16]

Scott Workman, Richard Souvenir, and Nathan Jacobs. 2015. Wide-area im- age geolocalization with aerial reference imagery. In Proceedings of the IEEE International Conference on Computer Vision. 3961--3969.

Digital Library

[17]

Zhedong Zheng, Yujiao Shi, Tingyu Wang, Chen Chen, Pengfei Zhu, and Richard Hartley. 2024. The 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. In Proceedings of the 32nd ACM International Confer- ence on Multimedia Workshop.

[18]

Zhedong Zheng, Yujiao Shi, Tingyu Wang, Jun Liu, Jianwu Fang, Yunchao Wei, and Tat-seng Chua. 2023. UAVM'23: 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. In Proceedings of the 31st ACM International Conference on Multimedia. 9715--9717.

[19]

Zhedong Zheng, Yunchao Wei, and Yi Yang. 2020. University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM international conference on Multimedia. 1395--1403.

Digital Library

[20]

Zhedong Zheng, Liang Zheng, Michael Garrett, Yi Yang, Mingliang Xu, and Yi-Dong Shen. 2020. Dual-path convolutional image-text embeddings with in- stance loss. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 16, 2 (2020), 1--23.

Digital Library

[21]

Sijie Zhu, Taojiannan Yang, and Chen Chen. 2021. Vigor: Cross-view image geo-localization beyond one-to-one retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3640--3649.

Index Terms

WAGL: Extreme Weather Adaptive Method for Robust and Generalizable UAV-based Cross-View Geo-localization
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations

Recommendations

WELN: Siamese Network-based Framework for Geo-localization in Extreme Weather
UAVM '24: Proceedings of the 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective

Cross-view geo-localization is a task of matching the same geographic image from differerent views, e.g., drone and satellite. Due to its GPS-free advantage, cross-view geo-localization is gaining increasing research interest, especially in drone-based ...
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models
UAVM '24: Proceedings of the 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective

Cross-view geo-localization in GNSS-denied environments aims to determine an unknown location by matching drone-view images with the correct geo-tagged satellite-view images from a large gallery. Recent research shows that learning discriminative image ...
University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization
MM '20: Proceedings of the 28th ACM International Conference on Multimedia

We consider the problem of cross-view geo-localization. The primary challenge is to learn the robust feature against large viewpoint changes. Existing benchmarks can help, but are limited in the number of viewpoints. Image pairs, containing two ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

UAVM '24: Proceedings of the 2nd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective

October 2024

41 pages

ISBN:9798400712067

DOI:10.1145/3689095

General Chairs:
Zhedong Zheng
University of Macau, China
,
Yujiao Shi
ShanghaiTech University, China
,
Tingyu Wang
Hangzhou Dianzi University, China
,
Chen Chen
University of Central Florida, USA
,
Pengfei Zhu
Tianjin University, China
,
Richard Hartley
Australian National University, Australia

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

This work was supported by -- Shenzhen Science and Technology Innovation Committee (File No. SGDX20220530111001006), and the Hong Kong and Macau Joint Research and Development Fund of Wuyi University (File No. 2021WGALH19).

Conference

MM '24

Sponsor:

SIGMM

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne VIC, Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
97
Total Downloads

Downloads (Last 12 months)97
Downloads (Last 6 weeks)59

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents