Abstract
Today, image classification is considered as one of the most important and challenging tasks in computer vision. This paper presents a new method for image classification using Bag Of Visual Words and Local Binary Patterns (LBP). The bag-of-visual-words (BoVW) model has been proven to be very efficient for image classification and image retrieval. However, most proposals directly use local features extracted from an image while ignoring hidden information that could be extracted from an image. To solve this problem, we propose a novel image classification method using information extracted from different channels of the image and the grayscale version of the image. In this way more discriminant information is extracted from the image and as a result the constructed BoVW model gives highly discriminative features that considerably increases the classification performance. In this work we embed features extracted using LBP into BoVW model to construct our proposed scene classification model. The choice of LBP as image feature descriptor is because of the fact that the content of most of the scene images contains textural information so extracting LBP features is a very wise choice compared to other popular image features like Scale Invariant Feature Transform (SIFT) that fails to capture image information in homogeneous areas or textual images. Experiments on Oliva and Torralba (OT) dataset demonstrate the effectiveness of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Henderson, J.: Introduction to real-world scene perception. Visual Cognition 12(3), 849–851 (2005)
Heitz, G., Koller, D.: Learning spatial context: using stuff to find things. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 30–43. Springer, Heidelberg (2008)
Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research: Visual Perception 155, 23–36 (2006)
Chang, E., Kingshy, G., Sychay, G., Gang, W.: Content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 6–38 (2003)
Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.J.: Content-based hierarchical classification of vacation images. In: IEEE International Conference on Multimedia Computing and Systems, vol. 1, pp. 518–523 (1999)
Siagian, C., Itti, L.: Gist: a mobile robotics application of context-based vision in outdoor environment. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 3, pp. 1063–1069 (2005)
Torralba, A., Oliva, A.: Statistics of natural image categories. Network: Computation in Neural Systems 14(3), 391–412 (2003)
Vogel, J., Schiele, B.: A semantic typicality measure for natural scene categorization. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM 2004. LNCS, vol. 3175, pp. 195–203. Springer, Heidelberg (2004)
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning object categories from google’s image search. In: Tenth IEEE International Conference on Computer Vision, vol. 2, pp. 1816–1823 (2005)
Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(11), 1475–1490 (2004)
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, pp. 1470–1477 (2003)
Fei-Fei, L., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 524–531 (2005)
Quelhas, P., Monay, F., Odobez, J.M., Gatica-Perez, D., Tuytelaars, T., Van Gool, L.: Modeling scenes with local descriptors and latent aspects. In: Tenth IEEE International Conference on Computer Vision, vol. 1, pp. 883–890 (2005)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178 (2006)
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: Tenth IEEE International Conference on Computer Vision, vol. 2, pp. 1458–1465 (2005)
Bosch, A., Munoz, X., Marti, R.: Which is the best way to organize/classify images by content? Image and Vision Computing 25(6), 778–791 (2007)
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning, pp. 1–22. ECCV Computer Vision (2004)
Opelt, A., Pinz, A., Zisserman, A.: A boundary-fragment-model for scene detection. In: European Conference on Computer Vision, vol. 2, pp. 575–588 (2006)
Yu, J., Qin, Z., Wan, T., Zhang, X.: Feature integration analysis of bag-of-features model for image retrieval. Neurocomputing 120, 355–364 (2013)
Penatti Otávio, A., Silva, B., Fernanda, B., Valle, E., Gouet-Brunet, V., Torres, R.da S.: Visual word spatial arrangement for image retrieval and classification. Pattern Recognition 47, 705–720 (2014)
Zhang, S., Tian, Q., Hua, G., Huang, Q., Gao, W.: ScenePatchNet: Towards scalable and semantic image annotation and retrieval. Computer Vision and Image Understanding 118, 16–29 (2014)
Zhang, H., Berg, A., Maire, M., Malik, J.: SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2126–2136 (2006)
Leung, T., Malik, J.: Representing and recognizing the visual appearance of materials using three-dimensional textons. International Journal of Computer Vision 43(1), 29–44 (2001)
Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Ojala, D., Pietikäinen, M., Mäenpää, T.: Multiresolution gray scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 971–987 (2002)
Maenpaa, T.: The Local Binary Pattern Approach to Texture Analysis - Extensions and Applications. Oulu University Press (2003)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Zhang, J., Marszałek, M., Lazebnik, C., Schmid, S.: Local features and kernels for classification of texture and Scene categories: a comprehensive study. International Journal of Computer Vision 73(2), 213–238 (2007)
Wang, H., Liang, W., Wu, X., Teng, P.: Scene image retrieval via re-ranking semantic and packed dense interest points. Neurocomputing 119, 65–73 (2013)
Kim, J., Grauman, K.: Asymmetric region-to-image matching for comparing images with generic object categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2344–2351 (2010)
Dai, D., Wut, T., Zhu, S.: Discovering scene categories by information projection and cluster sampling. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3455–3462 (2010)
Wang, H., Teng, P., Liang, W.: Packed dense interest points for scene image retrieval. In: Sixth IEEE International Conference on Image and Graphics (ICIG), pp. 789–794 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Montazer, G.A., Giveki, D., Soltanshahi, M.A. (2015). Scene Classification Based on Local Binary Pattern and Improved Bag of Visual Words. In: Rojas, I., Joya, G., Catala, A. (eds) Advances in Computational Intelligence. IWANN 2015. Lecture Notes in Computer Science(), vol 9094. Springer, Cham. https://doi.org/10.1007/978-3-319-19258-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-19258-1_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19257-4
Online ISBN: 978-3-319-19258-1
eBook Packages: Computer ScienceComputer Science (R0)