Abstract
A typical approach to describe an image in loop closure detection for visual SLAM is to extract a set of local patch descriptors and encode them into a co-occurrence vector. The most common patch encoding strategy is known as bag-of-visual-words (BoVW) representation, which consists of clustering the local descriptors into visual vocabulary. The distinctiveness of images is difficult to represent since most of them contain similar texture information, which may lead to false positive results. In this paper, the vocabulary is used as a whole by adopting the Fisher kernel (FK) framework. The new representation describes the image as the gradient vector of the likelihood function. The efficiently computed vectors can be compressed with a minimal loss of accuracy using product quantization and perform well in the task of loop closure detection. The proposed method achieves a higher recall rate with 100% precision in loop closure detection compared with state-of-the-art methods, and the detection on bidirectional loops is also enhanced. vSLAM systems may perceive the environment more efficiently by constructing a globally consistent map with the proposed loop closure detection method, which is potentially valuable for applications such as autonomous driving.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arroyo, R., Alcantarilla, P.F., Bergasa, L.M., Yebes, J.J., Gamez, S.: Bidirectional loop closure detection on panoramas for visual navigation. In: Proceedings of IEEE Intelligent Vehicles Symposium (IV), pp. 1378–1383 (2014)
Arshad, S., Kim, G.W.: Role of deep learning in loop closure detection for visual and Lidar SLAM: a survey. Sensors 21(4), 1243 (2021)
Bai, D., Wang, C., Bo, Z., Xiaodong, Y.I., Yang, X.: CNN feature boosted SeqSLAM for real-time loop closure detection. Chin. J. Electron. 27(3), 488–499 (2018)
Bampis, L., Amanatiadis, A., Gasteratos, A.: Fast loop-closure detection using visual-word-vectors from image sequences. Int. J. Robot. Res. 37(1), 62–82 (2018)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Blanco, J.L., Moreno, F.A., González, J.: A collection of outdoor robotic datasets with centimeter-accuracy ground truth. Auton. Robot. 27(4), 327–351 (2009)
Bonarini, A., Burgard, W., Fontana, G., Matteucci, M., Sorrenti, D.G., Tardos, J.D.: RAWSEEDS: robotics advancement through web-publishing of sensorial and elaborated extensive data sets. In: Proceedings of International Conference on Intelligent Robots and Systems Workshop on Benchmarks in Robotics Research (ICIRS) (2009)
Boureau, Y.L., Bach, F., Lecun, Y., Ponce, J.: Learning mid-level features for recognition. In: Computer Vision & Pattern Recognition, pp. 2559–2566 (2010)
Burri, M., et al.: The EuRoC micro aerial vehicle datasets. Int. J. Robot. Res. 35(10), 1157–1163 (2016)
Cadena, C., et al.: Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Trans. Rob. 32(6), 1309–1332 (2016)
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Dong, H., Yang, L., Wang, X.: Robust semi-supervised support vector machines with Laplace kernel-induced correntropy loss functions. Appl. Intell. 51(21), 1–15 (2021)
Galvez-López, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Rob. 28(5), 1188–1197 (2012)
Gao, X., Zhang, T.: Unsupervised learning to detect loops using deep neural networks for visual SLAM system. Auton. Robot. 41(1), 1–18 (2017)
Garcia-Fidalgo, E., Ortiz, A.: iBoW-LCD: an appearance-based loop closure detection approach using incremental bags of binary words. IEEE Robot. Autom. Lett. 99, 3051–3057 (2018)
Ge, Z., Xiaoqiang, Y., Yangdong, Y.: Loop closure detection via maximization of mutual information. IEEE Access 7, 124217–124232 (2019)
Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the KITTI dataset. Int. J. Robot. Res. 32(11), 1231–1237 (2013)
Van Gemert, J.C., Veenman, C.J., Smeulders, A.W., Geusebroek, J.M.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2010)
Gray, R.M., Neuhoff, D.L.: Quantization. IEEE Trans. Inf. Theory 44(6), 2325–2383 (1998)
Guclu, O., Can, A.B.: Fast and effective loop closure detection to improve SLAM performance. J. Intell. Rob. Syst. 93(3), 495–517 (2019)
Gupta, A., Barbu, A.: Parameterized principal component analysis. Pattern Recogn. 78, 215–227 (2018)
Han, D., Li, Y., Song, T., Liu, Z.: Multi-objective optimization of loop closure detection parameters for indoor 2D simultaneous localization and mapping. Sensors 20(7), 1906 (2020)
Haosheng, C., Ge, Z., Yangdong, Y.: Semantic loop closure detection with instance-level inconsistency removal in dynamic industrial scenes. IEEE Trans. Ind. Inf. 17(3), 2030–2040 (2021)
Hervé, J., Matthijs, D., Cordelia, S.: Product quantization for nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 117–128 (2010)
Jégou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3304–3311, July 2010
Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pp. 1–10 (2008)
Ksibi, S., Mejdoub, M., Amar, C.B.: Deep salient-Gaussian Fisher vector encoding of the spatio-temporal trajectory structures for person re-identification. Multimed. Tools Appl. 78(2), 1583–1611 (2018). https://doi.org/10.1007/s11042-018-6200-5
Kwon, H., Yousef, K.M.A., Kak, A.C.: Building 3D visual maps of interior space with a new hierarchical sensor fusion architecture. Robot. Auton. Syst. 61(8), 749–767 (2013)
Labbe, M., Michaud, F.: Appearance-based loop closure detection for online large-scale and long-term operation. IEEE Trans. Rob. 29(3), 734–745 (2013)
Latif, Y., Cadena, C., Neira, J.: Robust loop closing over time for pose graph SLAM. Int. J. Robot. Res. 32(14), 1611–1626 (2013)
Marr, D., Hildreth, E.: Theory of edge detection. In: Proceedings of the Royal Society of London, vol. 207, pp. 187–217 (1980)
Memon, A.R., Wang, H., Hussain, A.: Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems. Robot. Auton. Syst. 126, 103470 (2020)
Mur-Artal, R., Tardós, J.D.: Fast relocalisation and loop closing in keyframe-based slam. In: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 846–853 (2014)
Perronnin, F., Dance, C.R.: Fisher kernels on visual vocabularies for image categorization. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabularies for generic visual categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 464–475. Springer, Heidelberg (2006). https://doi.org/10.1007/11744085_36
Perronnin, F., Yan, L., Sánchez, J., Poirier, H.: Large-scale image retrieval with compressed fisher vectors. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3384–3391 (2010)
Pire, T., Fischer, T., Civera, J., Cristóforis, P.D., Berlles, J.J.: Stereo parallel tracking and mapping for robot localization. In: Proceedings of International Conference on Intelligent Robots and Systems (ICIRS) (2015)
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_34
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to sift or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2012)
Safarinejadian, B., Mozaffari, M.: A distributed averaging-based evidential expectation-maximization algorithm for density estimation in unreliable sensor networks. Measurement 165, 108–162 (2020)
Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)
Sünderhauf, N., et al.: Place recognition with convnet landmarks: viewpoint-robust, condition-robust, training-free. In: Robotics: Science and Systems, p. 296 (2015)
Tonellotto, N., Gotta, A., Nardini, F.M., Gadler, D., Silvestri, F.: Neural network quantization in federated learning at the edge. Inf. Sci. 575(4), 417–436 (2021)
Uchida, Y., Sakazawa, S.: Image retrieval with fisher vectors of binary features. In: Proceedings of IAPR Asian Conference on Pattern Recognition (ACPR), pp. 1–11 (2017)
Wang, J., Yang, J., Kai, Y., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: Computer Vision & Pattern Recognition, pp. 3360–3367 (2010)
Yan, X., Ye, Y., Qiu, X., Yu, H.: Synergetic information bottleneck for joint multi-view and ensemble clustering. Inf. Fusion 56, 15–27 (2020)
Yang, Y., Mémin, E.: Estimation of physical parameters under location uncertainty using an ensemble2-expectation-maximization algorithm. Q. J. R. Meteorol. Soc. 145(719), 418–433 (2019)
Younes, G., Asmar, D., Shammas, E., Zelek, J.: Keyframe-based monocular SLAM: design, survey, and future directions. Robot. Auton. Syst. 98, 67–88 (2017)
Zhou, W., Zhang, L., Gao, S., Lou, X.: Gradient-based feature extraction from raw Bayer pattern images. IEEE Trans. Image Process. 30, 5122–5137 (2021)
Zhu, Z., Xu, X., Liu, X., Jiang, Y.: LFM: a lightweight LCD algorithm based on feature matching between similar key frames. Sensors 21(13), 4499 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, G., Zuo, Q., Dang, H. (2022). Probability Loop Closure Detection with Fisher Kernel Framework for Visual SLAM. In: Wang, Y., Zhu, G., Han, Q., Wang, H., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2022. Communications in Computer and Information Science, vol 1628. Springer, Singapore. https://doi.org/10.1007/978-981-19-5194-7_17
Download citation
DOI: https://doi.org/10.1007/978-981-19-5194-7_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5193-0
Online ISBN: 978-981-19-5194-7
eBook Packages: Computer ScienceComputer Science (R0)