Skip to main content

Kernel Visual Keyword Description for Object and Place Recognition

  • Conference paper
  • First Online:
Advances in Signal Processing and Intelligent Recognition Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 425))

Abstract

The most important aspects in computer and mobile robotics are both visual object and place recognition; they have been used to tackle numerous applications via different techniques as established previously in the literature, however, combining the machine learning techniques for learning objects to obtain best possible recognition and as well as to obtain its image descriptors for describing the content of the image fully is considered as another vital way which can be used in computer vision. Thus, the ability of the system is to learn and describe the structural features of objects or places more effectively, which in turn; it leads to a correct recognition of objects. This paper introduces a method that uses Naive Base to combine the Kernel Principle Component (KPCA) features with HOG features from the visual scene. According to this approach, a set of SURF features and Histogram of Gradient (HOG) are extracted from a given image. The minimum Euclidean Distance between all SURF features is computed from the visual codebook which was constructed by K-means previously to be combined with HOG features. A classification method such as Support Vector Machine (SVM) was used for data analysis and the results indicate that KPCA with HOG method significantly outperforms bag of visual keyword (BOW) approach on Caltech-101 object dataset and IDOL visual place dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wenyu, C., Wenzhi, X., Ru, Z.: Method of item recognition based on SIFT and SURF, Mathematical Structures in Computer Science 24(5) (2014)

    Google Scholar 

  2. Suaib, N.M., Marhaban, M.H., Saripan, M.I., Ahmad, S.A.: Performance evaluation of feature detection and feature matching for stereo visual odometry using SIFT and SURF. In: 2014 IEEE Region 10 Symposium, pp. 200–203 (2014)

    Google Scholar 

  3. Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: ICCV 2003: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)

    Google Scholar 

  4. Jiang, Y.-G., Ngo, C.-W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 494–501 (2007)

    Google Scholar 

  5. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Visionand Pattern Recognition (2008)

    Google Scholar 

  6. Huang, J., Kumar, S.R., Mitra, M., Zhu, W.J., Zabih, R.: Image indexingusing color correlograms. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 762 (1997)

    Google Scholar 

  7. Gandhali, P.S., Debasis, M.: Correlogram Method for Comparing Bio-Sequences. Technical Report FIT-CS-2006-01, Master’s Thesis, Florida Institute of Technology (2006)

    Google Scholar 

  8. Csurka, G., Dance, C., Fan, L., Bray, C.: Visual categorization with bag of keypoints. In: The 8th European Conference on Computer Vision, pp. 513–516 (2004)

    Google Scholar 

  9. Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabulariesfor generic visual categorization. In: European Conference on Computer Vision (ECCV 2006), pp. 464–475 (2006)

    Google Scholar 

  10. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)

    Article  Google Scholar 

  11. Forstner, W., Moonen, B.: A metric for covariance matrices. Technical report, Dept. of Geodesy and Geoinformatics, Stuttgart University (1999)

    Google Scholar 

  12. Tian, J., Qiuxia, H., Xiaoyi, M., Mingyu, H.: An Improved KPCA/GA-SVM Classification Model for Plant Leaf Disease Recognition. Journal of Computational Information Systems 8(18), 7737–7745 (2012)

    Google Scholar 

  13. Schölkopf, B., Smola, A.J., Müller, K.-R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 5, 1299–1319 (1998)

    Article  Google Scholar 

  14. Baudat, G., Anouar, F.: Generalized Discriminant Analysis Using a Kernel Approach. Neural Computation 12(10), 2385–2404 (2000)

    Article  Google Scholar 

  15. Artač, M., Jogan, M., Leonardis, A.: Mobile robot localization using an incremental eigenspace model. In: IEEE International Conferenceon Robotics and Automation, Washington, D.C., pp. 1025–1030 (2002)

    Google Scholar 

  16. Dzati, A.R., Salwani, I., Haryati, J.: Robust palm print verification system based on evolution kernel principal component analysis. In: IEEE International Conference on Control System, Computing and Engineering 2014 (ICCSCE 2014) (2014)

    Google Scholar 

  17. Jogan, M., Leonardis, A., Wildenauer, H., Bischof, H.: Mobile robot localization under varying illumination. In: The 16th International on Pattern Recognition, pp. 2385–2404 (2000)

    Google Scholar 

  18. Kröse, B., Bunschoten, R.: Probabilistic localization by appearance models and active vision. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 2255–2260 (1999)

    Google Scholar 

  19. Hong, M.L., Dong, M.Z., Ren, C.N., Xiang, L., Hai, Y.D.: Face Recognition Using KPCA and KFDA. AMM, pp. 380–384:3850–3853 (2013)

    Google Scholar 

  20. Sim, R., Dudek, G.: Learning landmarks for robot localization. In: Proceedings of the National Conference on Artificial Intelligence SIGART/AAAI Doctoral Consortium, Austin, TX, SIGART/AAAI, pp. 1110–1111. AAAI Press (2000)

    Google Scholar 

  21. Phiwmal, N., Sanguansat, P.: An Improved Feature Extraction and Combination of Multiple Classifiers for Query-by-Humming. The International Arab Journal of Information and Technology 11(1) 103–110 (2014)

    Google Scholar 

  22. Bay, H., Tuytelaars, T., Van Gool, L.: Speeded up robust features. ETH Zurich, Katholieke Universiteit Leuven, vol. 3951, pp 404–417. Springer, Heidelberg (2006)

    Google Scholar 

  23. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proc. of CVPR 2006 (2006)

    Google Scholar 

  24. Suvi, T., Kai, N., Mikko, T., Antti, K., Tapio, S.: ECG-derived respiration methods: Adapted ICA and PCA. Medical Engineering & Physics (2015)

    Google Scholar 

  25. Vipsita, S., Shee, B.K., Rath, S.K.: Protein superfamily classification using kernel principal component analysis and probabilistic neural networks. In: 2011 Annual IEEE India Conference (INDICON) (2011)

    Google Scholar 

  26. Pronobis, A., Caputo, B., Jensfelt, P., Christensen, I.: A realistic benchmark for visual indoor place recognition. Robotics and Autonomous System 58(1), 81–96 (2009)

    Article  Google Scholar 

  27. Lu, L., Jianhua, Y., Evrim, T., Ronald, M.S.: Multilevel Image Recognition using Discriminative Patches and Kernel Covariance, SPIE Medical Imaging (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tarik A. Rashid .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Ali, A.M., Rashid, T.A. (2016). Kernel Visual Keyword Description for Object and Place Recognition. In: Thampi, S., Bandyopadhyay, S., Krishnan, S., Li, KC., Mosin, S., Ma, M. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 425. Springer, Cham. https://doi.org/10.1007/978-3-319-28658-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-28658-7_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-28656-3

  • Online ISBN: 978-3-319-28658-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics