View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization

Liu, Cuiwei; Liu, Jiahao; Qiu, Huaijun; Li, Zhaokui; Shi, Xiangbin

doi:10.1007/978-3-031-40286-9_20

Cuiwei Liu¹³,
Jiahao Liu¹³,
Huaijun Qiu¹³,
Zhaokui Li¹³ &
…
Xiangbin Shi¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14118))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

857 Accesses

Abstract

Unmanned Aerial Vehicle (UAV) visual geo-localization aims to match images of the same geographic target captured from different views, i.e., the UAV view and the satellite view. It is very challenging due to the large appearance differences in UAV-satellite image pairs. Previous works map images captured by UAVs and satellites to a shared feature space and employ a classification framework to learn location-dependent features while neglecting the overall distribution shift between the UAV view and the satellite view. In this paper, we address these limitations by introducing distribution alignment of the two views to shorten their distance in a common space. Specifically, we propose an end-to-end network, called PVDA (Progressive View Distribution Alignment). During training, feature encoder, location classifier, and view discriminator are jointly optimized by a novel progressive adversarial learning strategy. Competition between feature encoder and view discriminator prompts both of them to be stronger. It turns out that the adversarial learning is progressively emphasized until UAV-view images are indistinguishable from satellite-view images. As a result, the proposed PVDA becomes powerful in learning location-dependent yet view-invariant features with good scalability towards unseen images of new locations. Compared to the state-of-the-art methods, the proposed PVDA requires less inference time but has achieved superior performance on the University-1652 dataset.

This work was supported in part by the National Natural Science Foundation of China under Grant No. 62171295, and in part by the Liaoning Provincial Natural Science Foundation under Grant No. 2021-MS-266, and in part by the Applied Basic Research Project of Liaoning Province under Grant 2023JH2/101300204, and in part by the Shenyang Science and Technology Innovation Program for Young and Middle-aged Scientists under Grant No. RC210427.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wilson, D., Alshaabi, T., Van Oort, C., Zhang, X., Nelson, J., Wshah, S.: Object tracking and geo-localization from street images. Remote Sens. 14(11), 1–26 (2022)
Article Google Scholar
Erra, U., Capece, N.: Engineering an advanced geo-location augmented reality framework for smart mobile devices. J. Ambient. Intell. Humaniz. Comput. 10(1), 255–265 (2019)
Article Google Scholar
Senlet, T., Elgammal, A.: Satellite image based precise robot localization on sidewalks. In: IEEE International Conference on Robotics and Automation, pp. 2647–2653 (2012)
Google Scholar
Lin, T.Y., Cui, Y., Belongie, S., Hays, J.: Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5007–5015 (2015)
Google Scholar
Zhai, M., Bessinger, Z., Workman, S., Jacobs, N.: Predicting ground-level scene layout from aerial imagery. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 867–875 (2017)
Google Scholar
Tian, Y., Chen, C., Shah, M.: Cross-view image matching for geo-localization in urban environments. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3608–3616 (2017)
Google Scholar
Liu, L., Li, H.: Lending orientation to neural networks for cross-view geo-localization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5624–5633 (2019)
Google Scholar
Zheng, Z., Wei, Y., Yang, Y.: University-1652: a multi-view multi-source benchmark for drone-based geo-localization. In: Proceedings of the 28th ACM International Conference on Multimedia, pp. 1395–1403 (2020)
Google Scholar
Ding, L., Zhou, J., Meng, L., Long, Z.: A practical cross-view image matching method between UAV and satellite for UAV-based geo-localization. Remote Sens. 13(47), 1–20 (2021)
Google Scholar
Wang, T., et al.: Each part matters: local patterns facilitate cross-view geo-localization. IEEE Trans. Circ. Syst. Video Technol. 32(2), 867–879 (2021)
Article Google Scholar
Lin, J., et al.: Joint representation learning and keypoint detection for cross-view geo-localization. IEEE Trans. Image Process. 31, 3780–3792 (2022)
Article Google Scholar
Zhuang, J., Dai, M., Chen, X., Zheng, E.: A faster and more effective cross-view matching method of UAV and satellite images for UAV geolocalization. Remote Sens. 13(3979), 1–16 (2021)
Article Google Scholar
Dai, M., Hu, J., Zhuang, J., Zheng, E.: A transformer-based feature segmentation and region alignment method for UAV-view geo-localization. IEEE Trans. Circ. Syst. Video Technol. 32(7), 4376–4389 (2022)
Article Google Scholar
Regmi, K., Shah, M.: Bridging the domain gap for ground-to-aerial image matching. In: Proceedings of the IEEE/CVF Conference on Computer Vision (2019)
Google Scholar
Shi, Y., Liu, L., Yu, X., Li, H.: Spatial-aware feature aggregation for cross-view image based geo-localization. In: 33rd Conference on Neural Information Processing Systems (2019)
Google Scholar
Guo, Y., Choi, M., Li, K., Boussaid, F.: Soft exemplar highlighting for cross-view image-based geo-localization. IEEE Trans. Image Process. 31, 2094–2105 (2022)
Article Google Scholar
Zhang, X., et al.: SSA-Net: spatial scale attention network for image-based geo-localization. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Google Scholar
Tian, X., Shao, J., Ouyang, D., Shen, H.T.: UAV-satellite view synthesis for cross-view geo-localization. IEEE Trans. Circ. Syst. Video Technol. 32(7), 4804–4815 (2022)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Loshchilov, I., Hutter, F.: SGDR: stochastic gradient descent with warm restarts. arXiv e-prints (2016)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Google Scholar
Dosovitskiy, A., et al.: An image is worth 16 $\times $ 16 words: transformers for image recognition at scale. In: International Conference on Learning Representations (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Shenyang Aerospace University, Shenyang, China
Cuiwei Liu, Jiahao Liu, Huaijun Qiu, Zhaokui Li & Xiangbin Shi

Authors

Cuiwei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jiahao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Huaijun Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Zhaokui Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiangbin Shi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huaijun Qiu .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Zhi Jin
South China Normal University, Guangzhou, China
Yuncheng Jiang
Babeș-Bolyai University, Cluj-Napoca, Romania
Robert Andrei Buchmann
Ulster University, Belfast, UK
Yaxin Bi
Babeș-Bolyai University, Cluj-Napoca, Romania
Ana-Maria Ghiran
South China Normal University, Guangzhou, China
Wenjun Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, C., Liu, J., Qiu, H., Li, Z., Shi, X. (2023). View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization. In: Jin, Z., Jiang, Y., Buchmann, R.A., Bi, Y., Ghiran, AM., Ma, W. (eds) Knowledge Science, Engineering and Management. KSEM 2023. Lecture Notes in Computer Science(), vol 14118. Springer, Cham. https://doi.org/10.1007/978-3-031-40286-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-031-40286-9_20
Published: 09 August 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40285-2
Online ISBN: 978-3-031-40286-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

View Distribution Alignment with Progressive Adversarial Learning for UAV Visual Geo-Localization