Abstract
In response to the significant positioning errors that arise in visual localization algorithms for unmanned aerial vehicles (UAVs) when relying on drone image matching in areas devoid of satellite signals, we propose a deep learning-based algorithm named PnP-UGCSuperGlue. This algorithm employs a convolutional neural network (CNN) that is enhanced with a graph encoding module. The resulting enriched features contain vital information that refines the feature map and improves the overall accuracy of the visual localization process. The PnP-UGCSuperGlue framework initiates with the semantic feature extraction from both the real-time drone image and the geo-referenced image. This extraction process is facilitated by a CNN-based feature extractor. In the subsequent phase, a graph encoding module is integrated to aggregate the extracted features. This integration significantly enhances the quality of the generated feature keypoints and descriptors. Following this, a graph matching network is applied to leverage the generated descriptors, thereby facilitating a more precise feature point matching and filtering process. Ultimately, the perspective-n-point (PnP) method is utilized to calculate the rotation matrix and translation vector. This calculation is based on the results of the feature matching phase, as well as the camera intrinsic parameters and distortion coefficients. The proposed algorithm’s efficacy is validated through experimental evaluation, which demonstrates a mean absolute error of 0.0005 during the drone’s hovering state and 0.0083 during movement. These values indicate a significant reduction of 0.0010 and 0.0028, respectively, compared to the USuperGlue network.










Similar content being viewed by others
Availability of data and access
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
References
Shendryk Y, Sofonia J, Garrard R, Rist Y, Skocaj D, Thorburn P (2020) Fine-scale prediction of biomass and leaf nitrogen content in sugarcane using UAV LiDAR and multispectral imaging. Int J Appl Earth Observ Geoinf 92:102177. https://doi.org/10.1016/j.jag.2020.102177
Budiyono A, Higashino SI (2023) A review of the latest innovations in UAV technology. J Instrument Autom Syst 10(1):7–16. https://doi.org/10.5281/zenodo.8062292
ZHANG J (2021) Review of the light-weighted and small UAV system for aerial photography and remote sensing. National Remote Sens Bull 25(3):708-724. https://doi.org/10.11834/jrs.20210052
Gupta A, Fernando X (2022) Simultaneous localization and mapping (slam) and data fusion in unmanned aerial vehicles: recent advances and challenges. Drones 6(4):85. https://doi.org/10.3390/drones6040085
Zhou T, Hasheminasab SM, Habib A (2021) Tightly-coupled camera/LiDAR integration for point cloud generation from GNSS/INS-assisted UAV mapping systems. ISPRS J Photogram Remote Sens 180:336–356. https://doi.org/10.1016/j.isprsjprs.2021.08.020
Asadzadeh S, de Oliveira WJ, de Souza Filho CR (2022) UAV-based remote sensing for the petroleum industry and environmental monitoring: state-of-the-art and perspectives. J Petrol Sci Eng 208:109633. https://doi.org/10.1016/j.petrol.2021.109633
Pan M, Chen C, Yin X, Huang Z (2021) UAV-aided emergency environmental monitoring in infrastructure-less areas: LoRa mesh networking approach. IEEE Internet Things J 9(4):2918–2932. https://doi.org/10.1016/10.1109/JIOT.2021.3095494
Dong J, Ren X, Han S, Luo S (2022) UAV vision aided INS/odometer integration for land vehicle autonomous navigation. IEEE Trans Veh Technol 71(5):4825–4840. https://doi.org/10.1109/TVT.2022.3151729
Chaturvedi SK, Sekhar R, Banerjee S, Kamal H (2019) Comparative review study of military and civilian unmanned aerial vehicles (UAVs). INCAS Bull 11(3):183-198. https://doi.org/10.13111/2066-8201.2019.11.3.16
Deng L, He Y, Liu Q (2019) Research on application of fire uumanned aerial vehicles in emergency rescue. 2019 9th International Conference on Fire Science and Fire Protection Engineering (ICFSFPE), pp 1–5. https://doi.org/10.1109/ICFSFPE48751.2019.9055875
Lippiello V, Cacace J (2021) Robust visual localization of a uav over a pipe-rack based on the lie group se (3). IEEE Robot Autom Lett 7(1):295–302. https://doi.org/10.1109/LRA.2021.3125039
Guo Y, Zhou Y, Yang F (2023) AGCosPlace: a UAV visual positioning algorithm based on transformer. Drones 7(8):498. https://doi.org/10.3390/drones7080498
Jiang X, Ma J, Xiao G, Shao Z, Guo X (2021) A review of multimodal image matching: methods and applications. Inf Fusion 73:22–71. https://doi.org/10.1016/j.inffus.2021.02.012
Ma J, Jiang X, Fan A, Jiang J, Yan J (2021) Image matching from handcrafted to deep features: a survey. Int J Comput Vis 129:23–79. https://doi.org/10.1007/s11263-020-01359-2
Luo C, Yang W, Huang P, Zhou J (2019) Overview of image matching based on ORB algorithm. Int J Phys Conf Ser 1237(3):032020. https://doi.org/10.1088/1742-6596/1237/3/032020
Wang S, Guo Z, Liu Y (2021) An image matching method based on sift feature extraction and FLANN search algorithm improvement. J Phys Conf Ser 2037(1):012122. https://doi.org/10.1088/1742-6596/2037/1/012122
Zeng Q, Adu J, Liu J, Yang J, Xu Y, Gong M (2020) Real-time adaptive visible and infrared image registration based on morphological gradient and C_SIFT. J Real Time Image Process 17:1103–1115
Sedaghat A, Ebadi H (2015) Remote sensing image matching based on adaptive binning SIFT descriptor. IEEE Trans Geosci Remote Sens 53(10):5283–5293. https://doi.org/10.1109/TGRS.2015.2420659
Jiang Z, Liu X, Wang Q (2019) Visible and infrared image registration algorithm based on saliency and ORB. Laser Infrared 49(02):251–256
Ye Y, Bruzzone L, Shan J, Bovolo F, Zhu Q (2019) Fast and robust matching for multimodal remote sensing image registration. IEEE Trans Geosci Remote Sens 57(11):9059–9070. https://doi.org/10.1109/TGRS.2019.2924684
Zeng C, Wang J, Shi P (2013) A stereo image matching method to improve the DSM accuracy inside building boundaries. Can J Remote Sens 39(4):308–317. https://doi.org/10.5589/m13-039
Cao SY, Shen HL, Chen SJ, Li C (2020) Boosting structure consistency for multispectral and multimodal image registration. IEEE Trans Image Process 29:5147–5162.
Uss ML, Vozel B, Lukin VV, Chehdi K (2016) Multimodal remote sensing image registration with accuracy estimation at local and global scales. IEEE Trans Geosci Remote Sens 54(11):6587–6605.
DeTone D, Malisiewicz T, Rabinovich A (2018) Superpoint: Self-supervised interest point detection and description. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 224–236. https://doi.org/10.48550/arXiv.1712.07629
Sarlin PE, DeTone D, Malisiewicz T, Rabinovich A (2020) Superglue: Learning feature matching with graph neural networks. In: Proceedings of the IEEE/CVF Conference on Computer vision and Pattern Recognition, pp 4938–4947.
Lan C, Lu W, Yu J, Xu Q (2021) Deep Learning algorithm for feature matching of cross modality remote sensing images. Acta Geodaetica et Cartographica Sinica 50(2):189-202.
Guo Y, Zhou Y, Yang F (2023) UAV scale enhanced cross-modality graph matching net-USCMGM-net. Multimedia Tools Appl, pp 1–20. https://doi.org/10.1007/s11042-023-16103-4
Zhou Y, Guo Y, Lin KP, Yang F, Li L (2023) USuperGlue: an unsupervised UAV image matching network based on local self-attention. Soft Comput, pp 1–21.
Zheng Z, Wei Y, Yang Y (2020) University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM International Conference on Multimedia, pp 1395–1403. https://doi.org/10.1145/3394171.3413896
Wang Z, Liu S, Chen G, Dong W (2021) Robust visual positioning of the UAV for the under bridge inspection with a ground guided vehicle. IEEE Trans Instrument Measure 71:1–10. https://doi.org/10.1109/TIM.2021.3135544
Sun J, Shen Z, Wang Y, Bao H, Zhou X (2021) LoFTR: Detector-free local feature matching with transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8922-8931. https://doi.org/https://doi.org/10.48550/arXiv.2104.00680
Zhuang S, Zhao Z, Cao L, Wang D, Fu C, Du K (2023) A robust and fast method to the perspective-n-point problem for camera pose estimation. IEEE Sens J 23(11):11892–11906.
Charroud A, El Moutaouakil K, Palade V, Yahyaouy A (2024) Enhanced autoencoder-based lidar localization in self-driving vehicles. Appl Soft Comput 152:111225. https://doi.org/10.1016/j.asoc.2023.111225
Charroud A, Yahyaouy A, El Moutaouakil K, Onyekpe U (2022) Localisation and mapping of self-driving vehicles based on fuzzy K-means clustering: a non-semantic approach. In: 2022 International Conference on Intelligent Systems and Computer Vision (ISCV). IEEE, pp 1–8.
Dai M, Hu J, Zhuang J, Zheng E (2021) A transformer-based feature segmentation and region alignment method for uav-view geo-localization. IEEE Trans Circuits Syst Video Technol 32(7):4376–4389. https://doi.org/10.1109/TCSVT.2021.3135013
Dai M, Zheng E, Feng Z, Qi L, Zhuang J, Yang W (2023) Vision-based UAV self-positioning in low-altitude urban environments. IEEE Trans Image Process, pp 1–13. https://doi.org/10.1109/TIP.2023.3346279
Berton G, Masone C, Caputo B (2022) Rethinking visual geo-localization for large-scale applications. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4878–4888. https://doi.org/10.48550/arXiv.2204.02287
Couturier A, Akhloufi MA (2021) A review on absolute visual localization for UAV. Robot Autonom Syst 135:103666. https://doi.org/10.1016/j.robot.2020.103666
Wang Z, Shi D, Qiu C, Jin S, Li T, Shi Y, Liu Z, Qiao Z (2024) Sequence matching for Image-Based UAV-to-Satellite Geolocalization. IEEE Trans Geosci Remote Sens 62:1–15. https://doi.org/10.1109/TGRS.2024.3359605
Pan S, Wang X (2021) A survey on perspective-n-point problem. In: 2021 40th Chinese Control Conference (CCC). IEEE, pp 2396–2401. https://doi.org/10.23919/CCC52363.2021.9549863
Acknowledgements
This work was supported by the Special Foundation for Beijing Tianjin Hebei Basic Research Cooperation (J210008, H2021202008), and the Inner Mongolia Discipline Inspection and Supervision Big Data Laboratory (IMDBD202105).
Author information
Authors and Affiliations
Contributions
Conceptualization was presented by YG; methodology was provided by YG; software was developed by YG and FY; validation was approved by YG, YZ, and FY; formal analysis was performed by YG; data curation was conducted by YG and XZ; writing—original draft preparation was revised by YG; writing—review and editing were prepared by YG, YS, YY and WZ; visualization was provided YG; supervision was conducted by YZ and FY; project administration was approved by YZ; funding acquisition was analyzed by YZ. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Ethical and informed consent for data used
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Guo, Y., Yang, F., Si, Y. et al. PnP-UGCSuperGlue: deep learning drone image matching algorithm for visual localization. J Supercomput 80, 17711–17740 (2024). https://doi.org/10.1007/s11227-024-06128-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-024-06128-3