Loop Closure Detection Based on Local and Global Descriptors with Sinkhorn Algorithm

Xiao, Wei; Zhu, Dong

doi:10.1007/978-3-031-53401-0_29

Wei Xiao¹⁸ &
Dong Zhu¹⁸

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 553))

Included in the following conference series:

International Conference on 6GN for Future Wireless Networks

78 Accesses

Abstract

This paper presents a novel loop closure detection pipeline for SLAM systems, addressing the limitations of current deep learning methods in maintaining 3D point cloud structure and extracting high-quality semantic features. We utilize U-Net and FPN for feature extraction, with a descriptor generator that learns from local descriptors. The Sinkhorn algorithm is incorporated for 6DOF transformation matching between point clouds, effectively managing occlusions and aligning source and target clouds. Our method, evaluated on the KITTI dataset, outperforms traditional and other deep learning methods in computational efficiency and real-time performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aoki, Y., Goforth, H., Srivatsan, R.A., Lucey, S.: PointNetLK: robust & efficient point cloud registration using PointNet. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7163–7172 (2019)
Google Scholar
Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., Sivic, J.: NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5297–5307. IEEE (2016)
Google Scholar
Behley, J., et al.: SemanticKITTI: a dataset for semantic scene understanding of LiDAR sequences. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2019)
Google Scholar
Besl, P.J., McKay, N.D.: Method for registration of 3-D shapes. In: Sensor Fusion IV: Control Paradigms and Data Structures, vol. 1611, pp. 586–606. SPIE (1992)
Google Scholar
Cattaneo, D., Vaghi, M., Valada, A.: LCDNET: deep loop closure detection and point cloud registration for lidar slam. IEEE Trans. Rob. 38(4), 2074–2093 (2022)
Article Google Scholar
Chen, X., Läbe, T., Milioto, A., Röhling, T., Behley, J., Stachniss, C.: OverlapNet: a Siamese network for computing LiDAR scan similarity with applications to loop closing and localization. Auton. Robot 46, 1–21 (2022)
Google Scholar
Chen, Y., Medioni, G.: Object modelling by registration of multiple range images. Image Vis. Comput. 10(3), 145–155 (1992)
Article Google Scholar
Cummins, M., Newman, P.: FAB-MAP: probabilistic localization and mapping in the space of appearance. Int. J. Rob. Res. 27(6), 647–665 (2008)
Article Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
He, L., Wang, X., Zhang, H.: M2DP: a novel 3D point cloud descriptor and its application in loop closure detection. In: 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE (2016)
Google Scholar
Jégou, H., Perronnin, F., Douze, M., Sánchez, J., Pérez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2011)
Article Google Scholar
Kim, G., Choi, S., Kim, A.: Scan Context++: structural place recognition robust to rotation and lateral variations in urban environments. IEEE Trans. Rob. 38(3), 1856–1874 (2021)
Article Google Scholar
Komorowski, J.: MinkLoc3D: point cloud based large-scale place recognition. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1790–1799 (2021)
Google Scholar
Kong, X., et al.: Semantic graph based place recognition for 3D point clouds. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 8216–8223. IEEE, October 2020
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Milletari, F., Navab, N., Ahmadi, S.A.: V-Net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV). IEEE (2016)
Google Scholar
Mur-Artal, R., Tardós, J.D.: ORB-SLAM2: an open-source slam system for monocular, stereo, and RGB-D cameras. IEEE Trans. Rob. 33(5), 1255–1262 (2017). https://doi.org/10.1109/TRO.2017.2705103
Article Google Scholar
Oktay, O., et al.: Attention U-Net: learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vision 42(3), 145–175 (2001)
Article Google Scholar
Pham, K., Le, K., Ho, N., Pham, T., Bui, H.: On unbalanced optimal transport: an analysis of Sinkhorn algorithm. In: International Conference on Machine Learning, pp. 7673–7682. PMLR, November 2020
Google Scholar
Qiao, Z., Wang, H., Zhu, Y., Wang, H.: PLReg3D: learning 3D local and global descriptors jointly for global localization. In: 2021 27th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), pp. 121–126. IEEE (2021)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (FPFH) for 3D registration. In: 2009 IEEE International Conference on Robotics and Automation. IEEE (2009)
Google Scholar
Shi, S., et al.: PV-RCNN: point-voxel feature set abstraction for 3D object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10529–10538 (2020)
Google Scholar
Wang, H., Wang, C., Xie, L.: Intensity scan context: coding intensity and geometry relations for loop closure detection. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 2095–2101. IEEE, May 2020
Google Scholar
Wang, Y., Sun, Z., Xu, C.Z., Sarma, S.E., Yang, J., Kong, H.: LiDAR Iris for loop-closure detection. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 5769–5775. IEEE, October 2020
Google Scholar
Wang, Y., Solomon, J.M.: Deep closest point: learning representations for point cloud registration. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3523–3532 (2019)
Google Scholar
Yang, H., Shi, J., Carlone, L.: TEASER: fast and certifiable point cloud registration. IEEE Trans. Rob. 37(2), 314–333 (2020)
Article Google Scholar
Yew, Z.J., Lee, G.H.: RPM-Net: robust point matching using learned features. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11824–11833 (2020)
Google Scholar
Zhang, Z.: Iterative point matching for registration of free-form curves and surfaces. Int. J. Comput. Vision 13(2), 119–152 (1994)
Article Google Scholar
Zhou, Q.-Y., Park, J., Koltun, V.: Fast global registration. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 766–782. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_47
Chapter Google Scholar

Download references

Acknowledgment

This work was supported by the National Natural Science Foundation of China (grant 61802247).

Author information

Authors and Affiliations

School of Electronic and Information, Shanghai Dianji University, Shanghai, China
Wei Xiao & Dong Zhu

Authors

Wei Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Dong Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Xiao .

Editor information

Editors and Affiliations

Shanghai Dianji University, Shanghai, China
Jingchao Li
Kanagawa University, Yokohama, Japan
Bin Zhang
Shanghai University of Electric Power, Yangpu, China
Yulong Ying

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xiao, W., Zhu, D. (2024). Loop Closure Detection Based on Local and Global Descriptors with Sinkhorn Algorithm. In: Li, J., Zhang, B., Ying, Y. (eds) 6GN for Future Wireless Networks. 6GN 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 553. Springer, Cham. https://doi.org/10.1007/978-3-031-53401-0_29

Download citation

DOI: https://doi.org/10.1007/978-3-031-53401-0_29
Published: 09 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53400-3
Online ISBN: 978-3-031-53401-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Loop Closure Detection Based on Local and Global Descriptors with Sinkhorn Algorithm