Skip to main content

Probability Loop Closure Detection with Fisher Kernel Framework for Visual SLAM

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2022)

Abstract

A typical approach to describe an image in loop closure detection for visual SLAM is to extract a set of local patch descriptors and encode them into a co-occurrence vector. The most common patch encoding strategy is known as bag-of-visual-words (BoVW) representation, which consists of clustering the local descriptors into visual vocabulary. The distinctiveness of images is difficult to represent since most of them contain similar texture information, which may lead to false positive results. In this paper, the vocabulary is used as a whole by adopting the Fisher kernel (FK) framework. The new representation describes the image as the gradient vector of the likelihood function. The efficiently computed vectors can be compressed with a minimal loss of accuracy using product quantization and perform well in the task of loop closure detection. The proposed method achieves a higher recall rate with 100% precision in loop closure detection compared with state-of-the-art methods, and the detection on bidirectional loops is also enhanced. vSLAM systems may perceive the environment more efficiently by constructing a globally consistent map with the proposed loop closure detection method, which is potentially valuable for applications such as autonomous driving.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arroyo, R., Alcantarilla, P.F., Bergasa, L.M., Yebes, J.J., Gamez, S.: Bidirectional loop closure detection on panoramas for visual navigation. In: Proceedings of IEEE Intelligent Vehicles Symposium (IV), pp. 1378–1383 (2014)

    Google Scholar 

  2. Arshad, S., Kim, G.W.: Role of deep learning in loop closure detection for visual and Lidar SLAM: a survey. Sensors 21(4), 1243 (2021)

    Article  Google Scholar 

  3. Bai, D., Wang, C., Bo, Z., Xiaodong, Y.I., Yang, X.: CNN feature boosted SeqSLAM for real-time loop closure detection. Chin. J. Electron. 27(3), 488–499 (2018)

    Article  Google Scholar 

  4. Bampis, L., Amanatiadis, A., Gasteratos, A.: Fast loop-closure detection using visual-word-vectors from image sequences. Int. J. Robot. Res. 37(1), 62–82 (2018)

    Article  Google Scholar 

  5. Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32

    Chapter  Google Scholar 

  6. Blanco, J.L., Moreno, F.A., González, J.: A collection of outdoor robotic datasets with centimeter-accuracy ground truth. Auton. Robot. 27(4), 327–351 (2009)

    Article  Google Scholar 

  7. Bonarini, A., Burgard, W., Fontana, G., Matteucci, M., Sorrenti, D.G., Tardos, J.D.: RAWSEEDS: robotics advancement through web-publishing of sensorial and elaborated extensive data sets. In: Proceedings of International Conference on Intelligent Robots and Systems Workshop on Benchmarks in Robotics Research (ICIRS) (2009)

    Google Scholar 

  8. Boureau, Y.L., Bach, F., Lecun, Y., Ponce, J.: Learning mid-level features for recognition. In: Computer Vision & Pattern Recognition, pp. 2559–2566 (2010)

    Google Scholar 

  9. Burri, M., et al.: The EuRoC micro aerial vehicle datasets. Int. J. Robot. Res. 35(10), 1157–1163 (2016)

    Article  Google Scholar 

  10. Cadena, C., et al.: Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Trans. Rob. 32(6), 1309–1332 (2016)

    Article  Google Scholar 

  11. Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56

    Chapter  Google Scholar 

  12. Dong, H., Yang, L., Wang, X.: Robust semi-supervised support vector machines with Laplace kernel-induced correntropy loss functions. Appl. Intell. 51(21), 1–15 (2021)

    Google Scholar 

  13. Galvez-López, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Rob. 28(5), 1188–1197 (2012)

    Article  Google Scholar 

  14. Gao, X., Zhang, T.: Unsupervised learning to detect loops using deep neural networks for visual SLAM system. Auton. Robot. 41(1), 1–18 (2017)

    Article  MathSciNet  Google Scholar 

  15. Garcia-Fidalgo, E., Ortiz, A.: iBoW-LCD: an appearance-based loop closure detection approach using incremental bags of binary words. IEEE Robot. Autom. Lett. 99, 3051–3057 (2018)

    Article  Google Scholar 

  16. Ge, Z., Xiaoqiang, Y., Yangdong, Y.: Loop closure detection via maximization of mutual information. IEEE Access 7, 124217–124232 (2019)

    Article  Google Scholar 

  17. Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the KITTI dataset. Int. J. Robot. Res. 32(11), 1231–1237 (2013)

    Article  Google Scholar 

  18. Van Gemert, J.C., Veenman, C.J., Smeulders, A.W., Geusebroek, J.M.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2010)

    Article  Google Scholar 

  19. Gray, R.M., Neuhoff, D.L.: Quantization. IEEE Trans. Inf. Theory 44(6), 2325–2383 (1998)

    Article  Google Scholar 

  20. Guclu, O., Can, A.B.: Fast and effective loop closure detection to improve SLAM performance. J. Intell. Rob. Syst. 93(3), 495–517 (2019)

    Article  Google Scholar 

  21. Gupta, A., Barbu, A.: Parameterized principal component analysis. Pattern Recogn. 78, 215–227 (2018)

    Article  Google Scholar 

  22. Han, D., Li, Y., Song, T., Liu, Z.: Multi-objective optimization of loop closure detection parameters for indoor 2D simultaneous localization and mapping. Sensors 20(7), 1906 (2020)

    Article  Google Scholar 

  23. Haosheng, C., Ge, Z., Yangdong, Y.: Semantic loop closure detection with instance-level inconsistency removal in dynamic industrial scenes. IEEE Trans. Ind. Inf. 17(3), 2030–2040 (2021)

    Article  Google Scholar 

  24. Hervé, J., Matthijs, D., Cordelia, S.: Product quantization for nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 117–128 (2010)

    Google Scholar 

  25. Jégou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3304–3311, July 2010

    Google Scholar 

  26. Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pp. 1–10 (2008)

    Google Scholar 

  27. Ksibi, S., Mejdoub, M., Amar, C.B.: Deep salient-Gaussian Fisher vector encoding of the spatio-temporal trajectory structures for person re-identification. Multimed. Tools Appl. 78(2), 1583–1611 (2018). https://doi.org/10.1007/s11042-018-6200-5

    Article  Google Scholar 

  28. Kwon, H., Yousef, K.M.A., Kak, A.C.: Building 3D visual maps of interior space with a new hierarchical sensor fusion architecture. Robot. Auton. Syst. 61(8), 749–767 (2013)

    Article  Google Scholar 

  29. Labbe, M., Michaud, F.: Appearance-based loop closure detection for online large-scale and long-term operation. IEEE Trans. Rob. 29(3), 734–745 (2013)

    Article  Google Scholar 

  30. Latif, Y., Cadena, C., Neira, J.: Robust loop closing over time for pose graph SLAM. Int. J. Robot. Res. 32(14), 1611–1626 (2013)

    Article  Google Scholar 

  31. Marr, D., Hildreth, E.: Theory of edge detection. In: Proceedings of the Royal Society of London, vol. 207, pp. 187–217 (1980)

    Google Scholar 

  32. Memon, A.R., Wang, H., Hussain, A.: Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems. Robot. Auton. Syst. 126, 103470 (2020)

    Google Scholar 

  33. Mur-Artal, R., Tardós, J.D.: Fast relocalisation and loop closing in keyframe-based slam. In: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 846–853 (2014)

    Google Scholar 

  34. Perronnin, F., Dance, C.R.: Fisher kernels on visual vocabularies for image categorization. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)

    Google Scholar 

  35. Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabularies for generic visual categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 464–475. Springer, Heidelberg (2006). https://doi.org/10.1007/11744085_36

    Chapter  Google Scholar 

  36. Perronnin, F., Yan, L., Sánchez, J., Poirier, H.: Large-scale image retrieval with compressed fisher vectors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3384–3391 (2010)

    Google Scholar 

  37. Pire, T., Fischer, T., Civera, J., Cristóforis, P.D., Berlles, J.J.: Stereo parallel tracking and mapping for robot localization. In: Proceedings of International Conference on Intelligent Robots and Systems (ICIRS) (2015)

    Google Scholar 

  38. Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_34

    Chapter  Google Scholar 

  39. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to sift or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2012)

    Google Scholar 

  40. Safarinejadian, B., Mozaffari, M.: A distributed averaging-based evidential expectation-maximization algorithm for density estimation in unreliable sensor networks. Measurement 165, 108–162 (2020)

    Article  Google Scholar 

  41. Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)

    Article  MathSciNet  Google Scholar 

  42. Sünderhauf, N., et al.: Place recognition with convnet landmarks: viewpoint-robust, condition-robust, training-free. In: Robotics: Science and Systems, p. 296 (2015)

    Google Scholar 

  43. Tonellotto, N., Gotta, A., Nardini, F.M., Gadler, D., Silvestri, F.: Neural network quantization in federated learning at the edge. Inf. Sci. 575(4), 417–436 (2021)

    Article  MathSciNet  Google Scholar 

  44. Uchida, Y., Sakazawa, S.: Image retrieval with fisher vectors of binary features. In: Proceedings of IAPR Asian Conference on Pattern Recognition (ACPR), pp. 1–11 (2017)

    Google Scholar 

  45. Wang, J., Yang, J., Kai, Y., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: Computer Vision & Pattern Recognition, pp. 3360–3367 (2010)

    Google Scholar 

  46. Yan, X., Ye, Y., Qiu, X., Yu, H.: Synergetic information bottleneck for joint multi-view and ensemble clustering. Inf. Fusion 56, 15–27 (2020)

    Article  Google Scholar 

  47. Yang, Y., Mémin, E.: Estimation of physical parameters under location uncertainty using an ensemble2-expectation-maximization algorithm. Q. J. R. Meteorol. Soc. 145(719), 418–433 (2019)

    Article  Google Scholar 

  48. Younes, G., Asmar, D., Shammas, E., Zelek, J.: Keyframe-based monocular SLAM: design, survey, and future directions. Robot. Auton. Syst. 98, 67–88 (2017)

    Article  Google Scholar 

  49. Zhou, W., Zhang, L., Gao, S., Lou, X.: Gradient-based feature extraction from raw Bayer pattern images. IEEE Trans. Image Process. 30, 5122–5137 (2021)

    Article  MathSciNet  Google Scholar 

  50. Zhu, Z., Xu, X., Liu, X., Jiang, Y.: LFM: a lightweight LCD algorithm based on feature matching between similar key frames. Sensors 21(13), 4499 (2021)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ge Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, G., Zuo, Q., Dang, H. (2022). Probability Loop Closure Detection with Fisher Kernel Framework for Visual SLAM. In: Wang, Y., Zhu, G., Han, Q., Wang, H., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2022. Communications in Computer and Information Science, vol 1628. Springer, Singapore. https://doi.org/10.1007/978-981-19-5194-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-5194-7_17

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-5193-0

  • Online ISBN: 978-981-19-5194-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics