Abstract
This paper presents a novel appearance-based technique for topological robot localization and place recognition. A vocabulary of visual words is formed automatically, representing local features that frequently occur in the set of training images. Using the vocabulary, a spatial pyramid representation is built for each image by repeatedly subdividing it and computing histograms of visual words at increasingly fine resolutions. An information maximization technique is then applied to build a hierarchical classifier for each class by learning informative features. While top-level features in the hierarchy are selected from the coarsest resolution of the representation, capturing the holistic statistical properties of the images, child features are selected from finer resolutions, encoding more local characteristics, redundant with the information coded by their parents. Exploiting the redundancy in the data enables the localization system to achieve greater reliability against dynamic variations in the environment. Achieving an average classification accuracy of 88.9% on a challenging topological localization database, consisting of twenty seven outdoor places, demonstrates the advantages of our hierarchical framework for dealing with dynamic variations that cannot be learned during training.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Brank, J., Grobelnik, M., Milic-Frayling, N., Mladenic, D.: Interaction of feature selection methods and linear classification models. In: Proceedings of the International Conference on Machine Learning, Workshop on Text Learning (2002)
Chang, C., Lin, C.: LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm (2001)
Dance, C., Willamowski, J., Fan, L., Bray, C., Csurka, G.: Visual categorization with bags of keypoints. In: Proceedings of the European Conference on Computer Vision, International Workshop on Statistical Learning in Computer Vision (2004)
Epshtein, B., Ullman, S.: Feature hierarchies for object classification. In: Proceedings of the International Conference on Computer Vision (2005)
Fazl-Ersi, E., Elder, J.H., Tsotsos, J.K.: Hierarchical appearance based classifiers for qualitative spatial localization. In: Proceedings of the International Conference on Intelligent Robots and Systems, pp. 3987–3992 (2009)
Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 524–531 (2005)
Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Proceedings of the Second European Conference on Computational Learning Theory, pp. 23–37 (1995)
Friedman, S., Hanna, P., Fox, D.: Voronoi random fields: extracting topological structure of indoor environments via place labeling. In: Proceedings of the International Joint Conference on Artificial Intelligence, pp. 2109–2114 (2007)
Galindo, C., Saffiotti, A., Coradeschi, S., Buschka, P., Fernandez-Madrigal, J.: Multi-hierarchical semantic maps for mobile robotics. In: Proceedings of the International Conference on Intelligent Robots and Systems, pp. 2278–2283 (2005)
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proceedings of the International Conference on Computer Vision, pp. 604–610 (2005)
Lazebnik, S., Raginsky, M.: Supervised learning of quantizer codebooks by information loss minimization. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1294–1309 (2009)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006)
Leibe, B.: Interleaved object categorization and segmentation. PhD Thesis, ETH, Zurich (2004)
Liu, C., Yuen, J., Torralba, A., Sivic, J., Freeman, W.: SIFT Flow: dense correspondence across difference scenes. In: European Conference on Computer Vision (2008)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Mladenic, D., Brank, J., Grobelnik, M., Milic-Frayling, N.: Feature selection using linear classifier weights: interaction with classification models. In: Proceedings of the Annual ACM Conference on Research and Development in Information Retrieval, pp. 234–241 (2004)
Martínez-Mozos, O., Stachniss, C., Burgard, W.: Supervised learning of places from range data using Adaboost. In: Proceedings of the International Conference on Robotics and Automation (2005)
Martínez-Mozos, O., Burgard, W.: Supervised learning of topological maps using semantic information extracted from range data. In: Proceedings of the International Conference on Intelligent Robots and Systems, pp. 2772–2777 (2006)
Nowak, E., Jurie, F.: Vehicle categorization: parts for speed and accuracy. In: Proceedings of the International Conference on Computer Vision, VS-PETS workshop (2005)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)
Pronobis, A., Caputo, B., Jesfelt, P., Christensen, H.I.: A discriminative approach to robust visual place recognition. In: Proceeding of the International Conference on Robots and Systems, pp. 3829–3836 (2006)
Pronobis, A., Caputo, B.: COLD: COsy localization database. Int. J. Rob. Res. 28(5) (2009)
Ranganathan, A.: PLISS: detecting and labeling places using online change-point detection. In: Proceedings of the Robotics: Science and Systems (2010)
Rottmann, A., Martínez-Mozos, O., Stachniss, C., Burgard, W.: Semantic place classification of indoor environments with mobile robots using boosting. In: Proceedings of the National Conference on Artificial Intelligence, pp. 1306–1311 (2005)
Siagian, C., Itti, L.: Rapid biologically-inspired scene classification using features shared with visual attention. EEE Trans. Pattern Anal. Mach. Intell. 29(2), 300–312 (2007)
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision (2003)
Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A.: Context-based vision system for place and object recognition. In: Proccedings of the IEEE International Conference on Computer Vision, p. 273 (2003)
Tsotsos, J., Verghese, G., Dickinson, S., Jenkin, M., Jepson, A., Milios, E., Nu?o, F., Stevenson, S., Black, M., Metaxas, D., Culhane, S., Ye, Y., Mann, R.: PLAYBOT: a visually guided robot to assist physically disabled children in play. Image Vis. Comput.: Special Issue on Vision for the Disabled 16(4), 275–292 (1998)
Ullah, M., Pronobis, A., Caputo, B., Luo, J., Jensfelt, P., Christensen, H.: Towards robust place recognition for robot localization. In: Proceedings of the International Conference on Robotics and Automation (2008)
Vapnik, V.: Statistical Learning Theory. Wiley, New York (1998)
Vasudevan, S., Gachter, S., Nguyen, V.T., Siegwart, R.: Cognitive maps for mobile robots—an object based approach. Robot. Auton. Syst. 55(5), 359–371 (2007)
Vidal-Naquet, M., Ullman, S.: Object recognition with informative features and linear classification. In: Proceedings of the International Conference on Computer Vision, pp. 281–288 (2003)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Wu, J., Rehg, J.M.: CENTRIST: a visual descriptor for scene categorization. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1489–1501 (2011)
Zender, H., Martínez-Mozos, O., Jensfeltc, P., Kruijffa, G., Burgard, W.: Conceptual spatial representations for indoor mobile robots. Robot. Auton. Syst. 56(6), 493–502 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fazl-Ersi, E., Elder, J.H. & Tsotsos, J.K. Hierarchical Classifiers for Robust Topological Robot Localization. J Intell Robot Syst 68, 147–163 (2012). https://doi.org/10.1007/s10846-012-9671-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10846-012-9671-z