Scene Classification Based on Local Binary Pattern and Improved Bag of Visual Words

Montazer, Gholam Ali; Giveki, Davar; Soltanshahi, Mohammad Ali

doi:10.1007/978-3-319-19258-1_21

Gholam Ali Montazer¹⁶,
Davar Giveki¹⁷ &
Mohammad Ali Soltanshahi¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9094))

Included in the following conference series:

International Work-Conference on Artificial Neural Networks

2064 Accesses

Abstract

Today, image classification is considered as one of the most important and challenging tasks in computer vision. This paper presents a new method for image classification using Bag Of Visual Words and Local Binary Patterns (LBP). The bag-of-visual-words (BoVW) model has been proven to be very efficient for image classification and image retrieval. However, most proposals directly use local features extracted from an image while ignoring hidden information that could be extracted from an image. To solve this problem, we propose a novel image classification method using information extracted from different channels of the image and the grayscale version of the image. In this way more discriminant information is extracted from the image and as a result the constructed BoVW model gives highly discriminative features that considerably increases the classification performance. In this work we embed features extracted using LBP into BoVW model to construct our proposed scene classification model. The choice of LBP as image feature descriptor is because of the fact that the content of most of the scene images contains textural information so extracting LBP features is a very wise choice compared to other popular image features like Scale Invariant Feature Transform (SIFT) that fails to capture image information in homogeneous areas or textual images. Experiments on Oliva and Torralba (OT) dataset demonstrate the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Scene image classification based on visual words concatenation of local and global features

Article 27 September 2021

Image Classification Based on Modified BOW Model

Image Classification Using Spatial Difference Descriptor Under Spatial Pyramid Matching Framework

References

Henderson, J.: Introduction to real-world scene perception. Visual Cognition 12(3), 849–851 (2005)
Article Google Scholar
Heitz, G., Koller, D.: Learning spatial context: using stuff to find things. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 30–43. Springer, Heidelberg (2008)
Chapter Google Scholar
Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research: Visual Perception 155, 23–36 (2006)
Google Scholar
Chang, E., Kingshy, G., Sychay, G., Gang, W.: Content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 6–38 (2003)
Article Google Scholar
Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.J.: Content-based hierarchical classification of vacation images. In: IEEE International Conference on Multimedia Computing and Systems, vol. 1, pp. 518–523 (1999)
Google Scholar
Siagian, C., Itti, L.: Gist: a mobile robotics application of context-based vision in outdoor environment. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 3, pp. 1063–1069 (2005)
Google Scholar
Torralba, A., Oliva, A.: Statistics of natural image categories. Network: Computation in Neural Systems 14(3), 391–412 (2003)
Article Google Scholar
Vogel, J., Schiele, B.: A semantic typicality measure for natural scene categorization. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM 2004. LNCS, vol. 3175, pp. 195–203. Springer, Heidelberg (2004)
Chapter Google Scholar
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning object categories from google’s image search. In: Tenth IEEE International Conference on Computer Vision, vol. 2, pp. 1816–1823 (2005)
Google Scholar
Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(11), 1475–1490 (2004)
Article Google Scholar
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, pp. 1470–1477 (2003)
Google Scholar
Fei-Fei, L., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 524–531 (2005)
Google Scholar
Quelhas, P., Monay, F., Odobez, J.M., Gatica-Perez, D., Tuytelaars, T., Van Gool, L.: Modeling scenes with local descriptors and latent aspects. In: Tenth IEEE International Conference on Computer Vision, vol. 1, pp. 883–890 (2005)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178 (2006)
Google Scholar
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: Tenth IEEE International Conference on Computer Vision, vol. 2, pp. 1458–1465 (2005)
Google Scholar
Bosch, A., Munoz, X., Marti, R.: Which is the best way to organize/classify images by content? Image and Vision Computing 25(6), 778–791 (2007)
Article Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Chapter Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning, pp. 1–22. ECCV Computer Vision (2004)
Google Scholar
Opelt, A., Pinz, A., Zisserman, A.: A boundary-fragment-model for scene detection. In: European Conference on Computer Vision, vol. 2, pp. 575–588 (2006)
Google Scholar
Yu, J., Qin, Z., Wan, T., Zhang, X.: Feature integration analysis of bag-of-features model for image retrieval. Neurocomputing 120, 355–364 (2013)
Article Google Scholar
Penatti Otávio, A., Silva, B., Fernanda, B., Valle, E., Gouet-Brunet, V., Torres, R.da S.: Visual word spatial arrangement for image retrieval and classification. Pattern Recognition 47, 705–720 (2014)
Article Google Scholar
Zhang, S., Tian, Q., Hua, G., Huang, Q., Gao, W.: ScenePatchNet: Towards scalable and semantic image annotation and retrieval. Computer Vision and Image Understanding 118, 16–29 (2014)
Article Google Scholar
Zhang, H., Berg, A., Maire, M., Malik, J.: SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2126–2136 (2006)
Google Scholar
Leung, T., Malik, J.: Representing and recognizing the visual appearance of materials using three-dimensional textons. International Journal of Computer Vision 43(1), 29–44 (2001)
Article MATH Google Scholar
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Ojala, D., Pietikäinen, M., Mäenpää, T.: Multiresolution gray scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 971–987 (2002)
Article Google Scholar
Maenpaa, T.: The Local Binary Pattern Approach to Texture Analysis - Extensions and Applications. Oulu University Press (2003)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Zhang, J., Marszałek, M., Lazebnik, C., Schmid, S.: Local features and kernels for classification of texture and Scene categories: a comprehensive study. International Journal of Computer Vision 73(2), 213–238 (2007)
Article Google Scholar
Wang, H., Liang, W., Wu, X., Teng, P.: Scene image retrieval via re-ranking semantic and packed dense interest points. Neurocomputing 119, 65–73 (2013)
Article Google Scholar
Kim, J., Grauman, K.: Asymmetric region-to-image matching for comparing images with generic object categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2344–2351 (2010)
Google Scholar
Dai, D., Wut, T., Zhu, S.: Discovering scene categories by information projection and cluster sampling. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3455–3462 (2010)
Google Scholar
Wang, H., Teng, P., Liang, W.: Packed dense interest points for scene image retrieval. In: Sixth IEEE International Conference on Image and Graphics (ICIG), pp. 789–794 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Technology Engineering, School of Engineering, Tarbiat Modares University, Tehran, Iran
Gholam Ali Montazer
Department of Information Technology, Iranian Research Institute for Information Science and Technology (Iran Doc), Tehran, Iran
Davar Giveki
Department of Computer Science, University of Tehran, Tehran, Iran
Mohammad Ali Soltanshahi

Authors

Gholam Ali Montazer
View author publications
You can also search for this author in PubMed Google Scholar
Davar Giveki
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Ali Soltanshahi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gholam Ali Montazer .

Editor information

Editors and Affiliations

University of Granada, Granada, Spain
Ignacio Rojas
University of Malaga, Malaga, Spain
Gonzalo Joya
Polytechnic University of Catalonia, Vilanova i la Geltrú, Spain
Andreu Catala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Montazer, G.A., Giveki, D., Soltanshahi, M.A. (2015). Scene Classification Based on Local Binary Pattern and Improved Bag of Visual Words. In: Rojas, I., Joya, G., Catala, A. (eds) Advances in Computational Intelligence. IWANN 2015. Lecture Notes in Computer Science(), vol 9094. Springer, Cham. https://doi.org/10.1007/978-3-319-19258-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-19258-1_21
Published: 06 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19257-4
Online ISBN: 978-3-319-19258-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics