Abstract
The success of visual loop closure detection depends on the discrimination ability of the image descriptions. Different sources of image descriptions may carry complementary information as well as redundant information. Though integrating them properly can be beneficial, a main obstacle is the lack of analytical quality indicators to weigh different descriptions jointly. Inspired by the linear discriminant analysis, we propose an efficacy index to evaluate the weighted linear combinations of multiple image descriptions for loop closure detection. When a collection of image descriptions is given, the optimal weights maximizing the efficacy index are deduced analytically. As negative weights may negatively affect the performance of detection, a gradient descent algorithm is further proposed to jointly optimize the nonnegative weights. We use the proposed weighting strategies to combine the image descriptions extracted from multiple local image patches by multiple descriptor extractors. It is experimentally demonstrated that our weighted combinations of image descriptions can greatly improve the performance of loop closure detection by emphasizing informative components and de-emphasizing redundant components.
Similar content being viewed by others
References
Bay, H, Ess, A, Tuytelaars, T, Van Gool, L: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Lowe, D G: Distinctive image features from scale-invariant keypoints. Int. J Comput. Vis. 60(2), 91–110 (2004)
Sivic, J, Zisserman, A, et al.: Video google: A text retrieval approach to object matching in videos. Iccv 2, 1470–1477 (2003)
Datar, M, Immorlica, N, Indyk, P, Mirrokni, VS: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the Twentieth Annual Symposium on Computational Geometry, pp. 253–262. ACM (2004)
Angeli, A, Filliat, D, Doncieux, S, Meyer, J.-A.: Fast and incremental method for loop-closure detection using bags of visual words. IEEE Trans. Robot. 24(5), 1027–1037 (2008)
Shahbazi, H., Zhang, H: Application of locality sensitive hashing to realtime loop closure detection. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1228–1233. IEEE (2011)
Cummins, M, Newman, P : Fab-map: Probabilistic localization and mapping in the space of appearance. Int. J. Robot. Res. 27(6), 647–665 (2008)
Milford, M.J, Wyeth, G.F: Seqslam: Visual route-based navigation for sunny summer days and stormy winter nights. In: 2012 IEEE International Conference on Robotics and Automation (ICRA), pp. 1643–1649. IEEE (2012)
Engel, J, Schöps, T, Cremers, D: Lsd-slam: Large-scale direct monocular slam. In: European Conference on Computer Vision, pp. 834–849. Springer (2014)
Mur-Artal, R, Montiel, J M M, Tardos, J D: Orb-slam: A versatile and accurate monocular slam system. IEEE Trans. Robot. 31(5), 1147–1163 (2015)
Valgren, C, Lilienthal, A J: Sift, surf and seasons: Long-term outdoor localization using local features. In: European Conference on Mobile Robots (ECMR), pp. 253–258 (2007)
Oliva, A, Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)
Calonder, M, Lepetit, V, Ozuysal, M, Trzcinski, T, Strecha, C, Fua, P: Brief: Computing a local binary descriptor very fast. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1281–1298 (2012)
Ojala, T, Pietikainen, M, Maenpaa, T: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)
Rublee, E, Rabaud, V, Konolige, K, Bradski, G: Orb: An efficient alternative to sift or surf. In: 2011 IEEE International conference on computer vision (ICCV), pp. 2564–2571. IEEE (2011)
Liu, Y, Zhang, H: Visual loop closure detection with a compact image descriptor. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1051–1056. IEEE (2012)
Sünderhauf, N, Protzel, P: Brief-gist: Closing the loop by simple means. In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1234–1241. IEEE (2011)
Campos, F.M., Correia, L, Calado, J.M.F: Loop closure detection with a holistic image feature. In: Portuguese Conference on Artificial Intelligence, pp. 247–258. Springer (2013)
Arroyo, R, Alcantarilla, P F, Bergasa, L M, Javier Yebes, J, Gámez, S: Bidirectional loop closure detection on panoramas for visual navigation. In: 2014 Intelligent Vehicles Symposium Proceedings IEEE, pp. 1378–1383. IEEE (2014)
Perronnin, F, Dance, C R: Fisher kernels on visual vocabularies for image categorization. Comput Vis. Pattern Recogn. 1–8 (2007)
Jegou, H, Douze, M, Schmid, C, Perez, P: Aggregating local descriptors into a compact image representation. Comput. Vis. Pattern Recog.,3304–3311 (2010)
Arandjelovic, R, Zisserman, A: All about vlad. Comput. Vis. Pattern Recog., 1578–1585 (2013)
Yi, H, Zhang, H, Zhou, S: Convolutional neural network-based image representation for visual loop closure detection. In: 2015 IEEE International Conference on Information and Automation, pp. 2238–2245. IEEE (2015)
Sünderhauf, N, Shirazi, S, Dayoub, F, Upcroft, B, Milford, M: On the performance of convnet features for place recognition. In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4297–4304. IEEE (2015)
Kittler, J, Hatef, M, Duin, R PW, Matas, J: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)
Campos, F M, Correia, L, Calado, J.M.F: Robot visual localization through local feature fusion: An evaluation of multiple classifiers combination approaches. J. Intell. Robot. Syst. 77(2), 377–390 (2015)
Peng, H, Long, F, Ding, C: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)
Levner, I, Zhang, H: Classification-driven watershed segmentation. IEEE Trans. Image Process. 16(5), 1437–1445 (2007)
Li, W, Mao, K, Zhang, H, Chai, T: Selection of gabor filters for improved texture feature extraction. In: 2010 17th IEEE International conference on Image Processing (ICIP), pp. 361–364. IEEE (2010)
Guyon, I, Elisseeff, A: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Quanquan, G, Li, Z, Han, J: Generalized fisher score for feature selection. arXiv:1202.3725 (2012)
Fisher, R A: The use of multiple measurements in taxonomic problems. Ann. Eugenics 7(2), 179–188 (1936)
Duda, R O, Hart, P E, Stork, D G: Pattern Classification. Wiley (2012)
Li, Q, Ke, L, You, X, Shuhui, B, Liu, Z: Place recognition based on deep feature and adaptive weighting of similarity matrix. Neurocomputing 199, 114–127 (2016)
Tola, E, Lepetit, V, Fua, P: Daisy: An efficient dense descriptor applied to wide-baseline stereo. IEEE Trans. Pattern Anal. Mach. Intell. 32(5), 815–830 (2010)
Lazebnik, S, Schmid, C, Ponce, J: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178. IEEE (2006)
The Norwegian Broadcasting Corporation: The Nordlandsbanen Dataset. http://nrkbeta.no/2013/01/15/nordlandsbanen-minute-by-minute-season-by-season/ (2013)
Glover, AJ, Maddern, WP, Milford, MJ, Wyeth, GF: Fab-map+ ratslam: Appearance-based slam for multiple times of day. In: 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 3507–3512. IEEE (2010)
Davis, J, Goadrich, M: The relationship between precision-recall and roc curves. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 233–240. ACM (2006)
Wang, X, Zhang, H, Peng, G: A chordiogram image descriptor using local edgels. J. Vis. Commun. Represent. 49, 129–140 (2017)
Toshev, A, Taskar, B, Daniilidis, K: Shape-based object detection via boundary structure segmentation. Int. J. Comput. Vis. 99(2), 123–146 (2012)
Ce, L, Yuen, J, Torralba, A: Sift flow: Dense correspondence across scenes and its applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2011)
Boyd, S, Vandenberghe, L: Convex Optimization. Cambridge University Press (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, X., Peng, G. & Zhang, H. Combining Multiple Image Descriptions for Loop Closure Detection. J Intell Robot Syst 92, 565–585 (2018). https://doi.org/10.1007/s10846-017-0755-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10846-017-0755-7