PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization

Guo, Ya; Yang, Fan; Si, Yazhong; Yang, Yipu; Zhang, Wei; Zhang, Xiaolong; Zhou, Yatong

doi:10.1007/s11227-024-06128-3

PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization

Published: 01 May 2024

Volume 80, pages 17711–17740, (2024)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Ya Guo¹,
Fan Yang¹,
Yazhong Si¹,
Yipu Yang¹,
Wei Zhang¹,
Xiaolong Zhang^2,3 &
…
Yatong Zhou¹

446 Accesses
Explore all metrics

Abstract

In response to the significant positioning errors that arise in visual localization algorithms for unmanned aerial vehicles (UAVs) when relying on drone image matching in areas devoid of satellite signals, we propose a deep learning-based algorithm named PnP-UGCSuperGlue. This algorithm employs a convolutional neural network (CNN) that is enhanced with a graph encoding module. The resulting enriched features contain vital information that refines the feature map and improves the overall accuracy of the visual localization process. The PnP-UGCSuperGlue framework initiates with the semantic feature extraction from both the real-time drone image and the geo-referenced image. This extraction process is facilitated by a CNN-based feature extractor. In the subsequent phase, a graph encoding module is integrated to aggregate the extracted features. This integration significantly enhances the quality of the generated feature keypoints and descriptors. Following this, a graph matching network is applied to leverage the generated descriptors, thereby facilitating a more precise feature point matching and filtering process. Ultimately, the perspective-n-point (PnP) method is utilized to calculate the rotation matrix and translation vector. This calculation is based on the results of the feature matching phase, as well as the camera intrinsic parameters and distortion coefficients. The proposed algorithm’s efficacy is validated through experimental evaluation, which demonstrates a mean absolute error of 0.0005 during the drone’s hovering state and 0.0083 during movement. These values indicate a significant reduction of 0.0010 and 0.0028, respectively, compared to the USuperGlue network.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lightweight visual localization algorithm for UAVs

Article Open access 19 February 2025

UAV scale enhanced cross-modality graph matching net-USCMGM-net

Article 18 August 2023

Speeding Up SIFT for Faster Feature-Based UAV Visual Localization

Availability of data and access

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Shendryk Y, Sofonia J, Garrard R, Rist Y, Skocaj D, Thorburn P (2020) Fine-scale prediction of biomass and leaf nitrogen content in sugarcane using UAV LiDAR and multispectral imaging. Int J Appl Earth Observ Geoinf 92:102177. https://doi.org/10.1016/j.jag.2020.102177
Article Google Scholar
Budiyono A, Higashino SI (2023) A review of the latest innovations in UAV technology. J Instrument Autom Syst 10(1):7–16. https://doi.org/10.5281/zenodo.8062292
Article Google Scholar
ZHANG J (2021) Review of the light-weighted and small UAV system for aerial photography and remote sensing. National Remote Sens Bull 25(3):708-724. https://doi.org/10.11834/jrs.20210052
Gupta A, Fernando X (2022) Simultaneous localization and mapping (slam) and data fusion in unmanned aerial vehicles: recent advances and challenges. Drones 6(4):85. https://doi.org/10.3390/drones6040085
Article Google Scholar
Zhou T, Hasheminasab SM, Habib A (2021) Tightly-coupled camera/LiDAR integration for point cloud generation from GNSS/INS-assisted UAV mapping systems. ISPRS J Photogram Remote Sens 180:336–356. https://doi.org/10.1016/j.isprsjprs.2021.08.020
Article Google Scholar
Asadzadeh S, de Oliveira WJ, de Souza Filho CR (2022) UAV-based remote sensing for the petroleum industry and environmental monitoring: state-of-the-art and perspectives. J Petrol Sci Eng 208:109633. https://doi.org/10.1016/j.petrol.2021.109633
Article Google Scholar
Pan M, Chen C, Yin X, Huang Z (2021) UAV-aided emergency environmental monitoring in infrastructure-less areas: LoRa mesh networking approach. IEEE Internet Things J 9(4):2918–2932. https://doi.org/10.1016/10.1109/JIOT.2021.3095494
Article Google Scholar
Dong J, Ren X, Han S, Luo S (2022) UAV vision aided INS/odometer integration for land vehicle autonomous navigation. IEEE Trans Veh Technol 71(5):4825–4840. https://doi.org/10.1109/TVT.2022.3151729
Article Google Scholar
Chaturvedi SK, Sekhar R, Banerjee S, Kamal H (2019) Comparative review study of military and civilian unmanned aerial vehicles (UAVs). INCAS Bull 11(3):183-198. https://doi.org/10.13111/2066-8201.2019.11.3.16
Deng L, He Y, Liu Q (2019) Research on application of fire uumanned aerial vehicles in emergency rescue. 2019 9th International Conference on Fire Science and Fire Protection Engineering (ICFSFPE), pp 1–5. https://doi.org/10.1109/ICFSFPE48751.2019.9055875
Lippiello V, Cacace J (2021) Robust visual localization of a uav over a pipe-rack based on the lie group se (3). IEEE Robot Autom Lett 7(1):295–302. https://doi.org/10.1109/LRA.2021.3125039
Article Google Scholar
Guo Y, Zhou Y, Yang F (2023) AGCosPlace: a UAV visual positioning algorithm based on transformer. Drones 7(8):498. https://doi.org/10.3390/drones7080498
Article Google Scholar
Jiang X, Ma J, Xiao G, Shao Z, Guo X (2021) A review of multimodal image matching: methods and applications. Inf Fusion 73:22–71. https://doi.org/10.1016/j.inffus.2021.02.012
Article Google Scholar
Ma J, Jiang X, Fan A, Jiang J, Yan J (2021) Image matching from handcrafted to deep features: a survey. Int J Comput Vis 129:23–79. https://doi.org/10.1007/s11263-020-01359-2
Article MathSciNet Google Scholar
Luo C, Yang W, Huang P, Zhou J (2019) Overview of image matching based on ORB algorithm. Int J Phys Conf Ser 1237(3):032020. https://doi.org/10.1088/1742-6596/1237/3/032020
Article Google Scholar
Wang S, Guo Z, Liu Y (2021) An image matching method based on sift feature extraction and FLANN search algorithm improvement. J Phys Conf Ser 2037(1):012122. https://doi.org/10.1088/1742-6596/2037/1/012122
Article Google Scholar
Zeng Q, Adu J, Liu J, Yang J, Xu Y, Gong M (2020) Real-time adaptive visible and infrared image registration based on morphological gradient and C_SIFT. J Real Time Image Process 17:1103–1115
Article Google Scholar
Sedaghat A, Ebadi H (2015) Remote sensing image matching based on adaptive binning SIFT descriptor. IEEE Trans Geosci Remote Sens 53(10):5283–5293. https://doi.org/10.1109/TGRS.2015.2420659
Article Google Scholar
Jiang Z, Liu X, Wang Q (2019) Visible and infrared image registration algorithm based on saliency and ORB. Laser Infrared 49(02):251–256
Google Scholar
Ye Y, Bruzzone L, Shan J, Bovolo F, Zhu Q (2019) Fast and robust matching for multimodal remote sensing image registration. IEEE Trans Geosci Remote Sens 57(11):9059–9070. https://doi.org/10.1109/TGRS.2019.2924684
Article Google Scholar
Zeng C, Wang J, Shi P (2013) A stereo image matching method to improve the DSM accuracy inside building boundaries. Can J Remote Sens 39(4):308–317. https://doi.org/10.5589/m13-039
Article Google Scholar
Cao SY, Shen HL, Chen SJ, Li C (2020) Boosting structure consistency for multispectral and multimodal image registration. IEEE Trans Image Process 29:5147–5162.
Article Google Scholar
Uss ML, Vozel B, Lukin VV, Chehdi K (2016) Multimodal remote sensing image registration with accuracy estimation at local and global scales. IEEE Trans Geosci Remote Sens 54(11):6587–6605.
Article Google Scholar
DeTone D, Malisiewicz T, Rabinovich A (2018) Superpoint: Self-supervised interest point detection and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 224–236. https://doi.org/10.48550/arXiv.1712.07629
Sarlin PE, DeTone D, Malisiewicz T, Rabinovich A (2020) Superglue: Learning feature matching with graph neural networks. In: Proceedings of the IEEE/CVF Conference on Computer vision and Pattern Recognition, pp 4938–4947.
Lan C, Lu W, Yu J, Xu Q (2021) Deep Learning algorithm for feature matching of cross modality remote sensing images. Acta Geodaetica et Cartographica Sinica 50(2):189-202.
Guo Y, Zhou Y, Yang F (2023) UAV scale enhanced cross-modality graph matching net-USCMGM-net. Multimedia Tools Appl, pp 1–20. https://doi.org/10.1007/s11042-023-16103-4
Zhou Y, Guo Y, Lin KP, Yang F, Li L (2023) USuperGlue: an unsupervised UAV image matching network based on local self-attention. Soft Comput, pp 1–21.
Zheng Z, Wei Y, Yang Y (2020) University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM International Conference on Multimedia, pp 1395–1403. https://doi.org/10.1145/3394171.3413896
Wang Z, Liu S, Chen G, Dong W (2021) Robust visual positioning of the UAV for the under bridge inspection with a ground guided vehicle. IEEE Trans Instrument Measure 71:1–10. https://doi.org/10.1109/TIM.2021.3135544
Article Google Scholar
Sun J, Shen Z, Wang Y, Bao H, Zhou X (2021) LoFTR: Detector-free local feature matching with transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8922-8931. https://doi.org/https://doi.org/10.48550/arXiv.2104.00680
Zhuang S, Zhao Z, Cao L, Wang D, Fu C, Du K (2023) A robust and fast method to the perspective-n-point problem for camera pose estimation. IEEE Sens J 23(11):11892–11906.
Article Google Scholar
Charroud A, El Moutaouakil K, Palade V, Yahyaouy A (2024) Enhanced autoencoder-based lidar localization in self-driving vehicles. Appl Soft Comput 152:111225. https://doi.org/10.1016/j.asoc.2023.111225
Article Google Scholar
Charroud A, Yahyaouy A, El Moutaouakil K, Onyekpe U (2022) Localisation and mapping of self-driving vehicles based on fuzzy K-means clustering: a non-semantic approach. In: 2022 International Conference on Intelligent Systems and Computer Vision (ISCV). IEEE, pp 1–8.
Dai M, Hu J, Zhuang J, Zheng E (2021) A transformer-based feature segmentation and region alignment method for uav-view geo-localization. IEEE Trans Circuits Syst Video Technol 32(7):4376–4389. https://doi.org/10.1109/TCSVT.2021.3135013
Article Google Scholar
Dai M, Zheng E, Feng Z, Qi L, Zhuang J, Yang W (2023) Vision-based UAV self-positioning in low-altitude urban environments. IEEE Trans Image Process, pp 1–13. https://doi.org/10.1109/TIP.2023.3346279
Berton G, Masone C, Caputo B (2022) Rethinking visual geo-localization for large-scale applications. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4878–4888. https://doi.org/10.48550/arXiv.2204.02287
Couturier A, Akhloufi MA (2021) A review on absolute visual localization for UAV. Robot Autonom Syst 135:103666. https://doi.org/10.1016/j.robot.2020.103666
Article Google Scholar
Wang Z, Shi D, Qiu C, Jin S, Li T, Shi Y, Liu Z, Qiao Z (2024) Sequence matching for Image-Based UAV-to-Satellite Geolocalization. IEEE Trans Geosci Remote Sens 62:1–15. https://doi.org/10.1109/TGRS.2024.3359605
Article Google Scholar
Pan S, Wang X (2021) A survey on perspective-n-point problem. In: 2021 40th Chinese Control Conference (CCC). IEEE, pp 2396–2401. https://doi.org/10.23919/CCC52363.2021.9549863

Download references

Acknowledgements

This work was supported by the Special Foundation for Beijing Tianjin Hebei Basic Research Cooperation (J210008, H2021202008), and the Inner Mongolia Discipline Inspection and Supervision Big Data Laboratory (IMDBD202105).

Author information

Authors and Affiliations

School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, 300401, China
Ya Guo, Fan Yang, Yazhong Si, Yipu Yang, Wei Zhang & Yatong Zhou
Chinese Electronics Technology Group 54th Institute, Shijiazhuang, 050081, China
Xiaolong Zhang
Hebei Province Unmanned System Intelligent Measurement and Control Information Technology Key Laboratory, Shijiazhuang, 050081, China
Xiaolong Zhang

Authors

Ya Guo
View author publications
You can also search for this author inPubMed Google Scholar
Fan Yang
View author publications
You can also search for this author inPubMed Google Scholar
Yazhong Si
View author publications
You can also search for this author inPubMed Google Scholar
Yipu Yang
View author publications
You can also search for this author inPubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Xiaolong Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Yatong Zhou
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Conceptualization was presented by YG; methodology was provided by YG; software was developed by YG and FY; validation was approved by YG, YZ, and FY; formal analysis was performed by YG; data curation was conducted by YG and XZ; writing—original draft preparation was revised by YG; writing—review and editing were prepared by YG, YS, YY and WZ; visualization was provided YG; supervision was conducted by YZ and FY; project administration was approved by YZ; funding acquisition was analyzed by YZ. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Yatong Zhou.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethical and informed consent for data used

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Guo, Y., Yang, F., Si, Y. et al. PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization. J Supercomput 80, 17711–17740 (2024). https://doi.org/10.1007/s11227-024-06128-3

Download citation

Accepted: 08 April 2024
Published: 01 May 2024
Issue Date: August 2024
DOI: https://doi.org/10.1007/s11227-024-06128-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Lightweight visual localization algorithm for UAVs

UAV scale enhanced cross-modality graph matching net-USCMGM-net

Speeding Up SIFT for Faster Feature-Based UAV Visual Localization

Availability of data and access

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical and informed consent for data used

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now