Skip to main content

Loop Closure Detection Based on Local and Global Descriptors with Sinkhorn Algorithm

  • Conference paper
  • First Online:
6GN for Future Wireless Networks (6GN 2023)

Abstract

This paper presents a novel loop closure detection pipeline for SLAM systems, addressing the limitations of current deep learning methods in maintaining 3D point cloud structure and extracting high-quality semantic features. We utilize U-Net and FPN for feature extraction, with a descriptor generator that learns from local descriptors. The Sinkhorn algorithm is incorporated for 6DOF transformation matching between point clouds, effectively managing occlusions and aligning source and target clouds. Our method, evaluated on the KITTI dataset, outperforms traditional and other deep learning methods in computational efficiency and real-time performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aoki, Y., Goforth, H., Srivatsan, R.A., Lucey, S.: PointNetLK: robust & efficient point cloud registration using PointNet. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7163–7172 (2019)

    Google Scholar 

  2. Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., Sivic, J.: NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5297–5307. IEEE (2016)

    Google Scholar 

  3. Behley, J., et al.: SemanticKITTI: a dataset for semantic scene understanding of LiDAR sequences. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (2019)

    Google Scholar 

  4. Besl, P.J., McKay, N.D.: Method for registration of 3-D shapes. In: Sensor Fusion IV: Control Paradigms and Data Structures, vol. 1611, pp. 586–606. SPIE (1992)

    Google Scholar 

  5. Cattaneo, D., Vaghi, M., Valada, A.: LCDNET: deep loop closure detection and point cloud registration for lidar slam. IEEE Trans. Rob. 38(4), 2074–2093 (2022)

    Article  Google Scholar 

  6. Chen, X., Läbe, T., Milioto, A., Röhling, T., Behley, J., Stachniss, C.: OverlapNet: a Siamese network for computing LiDAR scan similarity with applications to loop closing and localization. Auton. Robot 46, 1–21 (2022)

    Google Scholar 

  7. Chen, Y., Medioni, G.: Object modelling by registration of multiple range images. Image Vis. Comput. 10(3), 145–155 (1992)

    Article  Google Scholar 

  8. Cummins, M., Newman, P.: FAB-MAP: probabilistic localization and mapping in the space of appearance. Int. J. Rob. Res. 27(6), 647–665 (2008)

    Article  Google Scholar 

  9. Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)

    Article  MathSciNet  Google Scholar 

  10. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

    Google Scholar 

  11. He, L., Wang, X., Zhang, H.: M2DP: a novel 3D point cloud descriptor and its application in loop closure detection. In: 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE (2016)

    Google Scholar 

  12. Jégou, H., Perronnin, F., Douze, M., Sánchez, J., Pérez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2011)

    Article  Google Scholar 

  13. Kim, G., Choi, S., Kim, A.: Scan Context++: structural place recognition robust to rotation and lateral variations in urban environments. IEEE Trans. Rob. 38(3), 1856–1874 (2021)

    Article  Google Scholar 

  14. Komorowski, J.: MinkLoc3D: point cloud based large-scale place recognition. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1790–1799 (2021)

    Google Scholar 

  15. Kong, X., et al.: Semantic graph based place recognition for 3D point clouds. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 8216–8223. IEEE, October 2020

    Google Scholar 

  16. Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)

    Google Scholar 

  17. Milletari, F., Navab, N., Ahmadi, S.A.: V-Net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV). IEEE (2016)

    Google Scholar 

  18. Mur-Artal, R., Tardós, J.D.: ORB-SLAM2: an open-source slam system for monocular, stereo, and RGB-D cameras. IEEE Trans. Rob. 33(5), 1255–1262 (2017). https://doi.org/10.1109/TRO.2017.2705103

    Article  Google Scholar 

  19. Oktay, O., et al.: Attention U-Net: learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)

  20. Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vision 42(3), 145–175 (2001)

    Article  Google Scholar 

  21. Pham, K., Le, K., Ho, N., Pham, T., Bui, H.: On unbalanced optimal transport: an analysis of Sinkhorn algorithm. In: International Conference on Machine Learning, pp. 7673–7682. PMLR, November 2020

    Google Scholar 

  22. Qiao, Z., Wang, H., Zhu, Y., Wang, H.: PLReg3D: learning 3D local and global descriptors jointly for global localization. In: 2021 27th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), pp. 121–126. IEEE (2021)

    Google Scholar 

  23. Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28

    Chapter  Google Scholar 

  24. Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (FPFH) for 3D registration. In: 2009 IEEE International Conference on Robotics and Automation. IEEE (2009)

    Google Scholar 

  25. Shi, S., et al.: PV-RCNN: point-voxel feature set abstraction for 3D object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10529–10538 (2020)

    Google Scholar 

  26. Wang, H., Wang, C., Xie, L.: Intensity scan context: coding intensity and geometry relations for loop closure detection. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 2095–2101. IEEE, May 2020

    Google Scholar 

  27. Wang, Y., Sun, Z., Xu, C.Z., Sarma, S.E., Yang, J., Kong, H.: LiDAR Iris for loop-closure detection. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 5769–5775. IEEE, October 2020

    Google Scholar 

  28. Wang, Y., Solomon, J.M.: Deep closest point: learning representations for point cloud registration. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3523–3532 (2019)

    Google Scholar 

  29. Yang, H., Shi, J., Carlone, L.: TEASER: fast and certifiable point cloud registration. IEEE Trans. Rob. 37(2), 314–333 (2020)

    Article  Google Scholar 

  30. Yew, Z.J., Lee, G.H.: RPM-Net: robust point matching using learned features. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11824–11833 (2020)

    Google Scholar 

  31. Zhang, Z.: Iterative point matching for registration of free-form curves and surfaces. Int. J. Comput. Vision 13(2), 119–152 (1994)

    Article  Google Scholar 

  32. Zhou, Q.-Y., Park, J., Koltun, V.: Fast global registration. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 766–782. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_47

    Chapter  Google Scholar 

Download references

Acknowledgment

This work was supported by the National Natural Science Foundation of China (grant 61802247).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Xiao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xiao, W., Zhu, D. (2024). Loop Closure Detection Based on Local and Global Descriptors with Sinkhorn Algorithm. In: Li, J., Zhang, B., Ying, Y. (eds) 6GN for Future Wireless Networks. 6GN 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 553. Springer, Cham. https://doi.org/10.1007/978-3-031-53401-0_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-53401-0_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-53400-3

  • Online ISBN: 978-3-031-53401-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics