Kernel Visual Keyword Description for Object and Place Recognition

Ali, Abbas M.; Rashid, Tarik A.

doi:10.1007/978-3-319-28658-7_3

Abbas M. Ali⁸ &
Tarik A. Rashid⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 425))

1681 Accesses
1 Citations

Abstract

The most important aspects in computer and mobile robotics are both visual object and place recognition; they have been used to tackle numerous applications via different techniques as established previously in the literature, however, combining the machine learning techniques for learning objects to obtain best possible recognition and as well as to obtain its image descriptors for describing the content of the image fully is considered as another vital way which can be used in computer vision. Thus, the ability of the system is to learn and describe the structural features of objects or places more effectively, which in turn; it leads to a correct recognition of objects. This paper introduces a method that uses Naive Base to combine the Kernel Principle Component (KPCA) features with HOG features from the visual scene. According to this approach, a set of SURF features and Histogram of Gradient (HOG) are extracted from a given image. The minimum Euclidean Distance between all SURF features is computed from the visual codebook which was constructed by K-means previously to be combined with HOG features. A classification method such as Support Vector Machine (SVM) was used for data analysis and the results indicate that KPCA with HOG method significantly outperforms bag of visual keyword (BOW) approach on Caltech-101 object dataset and IDOL visual place dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wenyu, C., Wenzhi, X., Ru, Z.: Method of item recognition based on SIFT and SURF, Mathematical Structures in Computer Science 24(5) (2014)
Google Scholar
Suaib, N.M., Marhaban, M.H., Saripan, M.I., Ahmad, S.A.: Performance evaluation of feature detection and feature matching for stereo visual odometry using SIFT and SURF. In: 2014 IEEE Region 10 Symposium, pp. 200–203 (2014)
Google Scholar
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: ICCV 2003: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 1470 (2003)
Google Scholar
Jiang, Y.-G., Ngo, C.-W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 494–501 (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Visionand Pattern Recognition (2008)
Google Scholar
Huang, J., Kumar, S.R., Mitra, M., Zhu, W.J., Zabih, R.: Image indexingusing color correlograms. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p. 762 (1997)
Google Scholar
Gandhali, P.S., Debasis, M.: Correlogram Method for Comparing Bio-Sequences. Technical Report FIT-CS-2006-01, Master’s Thesis, Florida Institute of Technology (2006)
Google Scholar
Csurka, G., Dance, C., Fan, L., Bray, C.: Visual categorization with bag of keypoints. In: The 8th European Conference on Computer Vision, pp. 513–516 (2004)
Google Scholar
Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabulariesfor generic visual categorization. In: European Conference on Computer Vision (ECCV 2006), pp. 464–475 (2006)
Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)
Article Google Scholar
Forstner, W., Moonen, B.: A metric for covariance matrices. Technical report, Dept. of Geodesy and Geoinformatics, Stuttgart University (1999)
Google Scholar
Tian, J., Qiuxia, H., Xiaoyi, M., Mingyu, H.: An Improved KPCA/GA-SVM Classification Model for Plant Leaf Disease Recognition. Journal of Computational Information Systems 8(18), 7737–7745 (2012)
Google Scholar
Schölkopf, B., Smola, A.J., Müller, K.-R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 5, 1299–1319 (1998)
Article Google Scholar
Baudat, G., Anouar, F.: Generalized Discriminant Analysis Using a Kernel Approach. Neural Computation 12(10), 2385–2404 (2000)
Article Google Scholar
Artač, M., Jogan, M., Leonardis, A.: Mobile robot localization using an incremental eigenspace model. In: IEEE International Conferenceon Robotics and Automation, Washington, D.C., pp. 1025–1030 (2002)
Google Scholar
Dzati, A.R., Salwani, I., Haryati, J.: Robust palm print verification system based on evolution kernel principal component analysis. In: IEEE International Conference on Control System, Computing and Engineering 2014 (ICCSCE 2014) (2014)
Google Scholar
Jogan, M., Leonardis, A., Wildenauer, H., Bischof, H.: Mobile robot localization under varying illumination. In: The 16th International on Pattern Recognition, pp. 2385–2404 (2000)
Google Scholar
Kröse, B., Bunschoten, R.: Probabilistic localization by appearance models and active vision. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 2255–2260 (1999)
Google Scholar
Hong, M.L., Dong, M.Z., Ren, C.N., Xiang, L., Hai, Y.D.: Face Recognition Using KPCA and KFDA. AMM, pp. 380–384:3850–3853 (2013)
Google Scholar
Sim, R., Dudek, G.: Learning landmarks for robot localization. In: Proceedings of the National Conference on Artificial Intelligence SIGART/AAAI Doctoral Consortium, Austin, TX, SIGART/AAAI, pp. 1110–1111. AAAI Press (2000)
Google Scholar
Phiwmal, N., Sanguansat, P.: An Improved Feature Extraction and Combination of Multiple Classifiers for Query-by-Humming. The International Arab Journal of Information and Technology 11(1) 103–110 (2014)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: Speeded up robust features. ETH Zurich, Katholieke Universiteit Leuven, vol. 3951, pp 404–417. Springer, Heidelberg (2006)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proc. of CVPR 2006 (2006)
Google Scholar
Suvi, T., Kai, N., Mikko, T., Antti, K., Tapio, S.: ECG-derived respiration methods: Adapted ICA and PCA. Medical Engineering & Physics (2015)
Google Scholar
Vipsita, S., Shee, B.K., Rath, S.K.: Protein superfamily classification using kernel principal component analysis and probabilistic neural networks. In: 2011 Annual IEEE India Conference (INDICON) (2011)
Google Scholar
Pronobis, A., Caputo, B., Jensfelt, P., Christensen, I.: A realistic benchmark for visual indoor place recognition. Robotics and Autonomous System 58(1), 81–96 (2009)
Article Google Scholar
Lu, L., Jianhua, Y., Evrim, T., Ronald, M.S.: Multilevel Image Recognition using Discriminative Patches and Kernel Covariance, SPIE Medical Imaging (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Software Engineering Department, College of Engineering, Salahuddin University, Erbil, Kurdistan, Iraq
Abbas M. Ali & Tarik A. Rashid

Authors

Abbas M. Ali
View author publications
You can also search for this author in PubMed Google Scholar
Tarik A. Rashid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tarik A. Rashid .

Editor information

Editors and Affiliations

and Management – Kerala (IIITM-K), Indian Inst. of Information Technology, Trivandrum, Kerala, India
Sabu M. Thampi
Machine Intelligence Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Sanghamitra Bandyopadhyay
Department of Electrical, Ryerson University, Toronto, Ontario, Canada
Sri Krishnan
Providence University, Taichung, Taiwan
Kuan-Ching Li
Computer Engineering Department, Vladimir State University, Vladimir Region, Russia
Sergey Mosin
School of Electrical, Nanyang Technological University, Singapore, Singapore
Maode Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ali, A.M., Rashid, T.A. (2016). Kernel Visual Keyword Description for Object and Place Recognition. In: Thampi, S., Bandyopadhyay, S., Krishnan, S., Li, KC., Mosin, S., Ma, M. (eds) Advances in Signal Processing and Intelligent Recognition Systems. Advances in Intelligent Systems and Computing, vol 425. Springer, Cham. https://doi.org/10.1007/978-3-319-28658-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-28658-7_3
Published: 25 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28656-3
Online ISBN: 978-3-319-28658-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics