A Multi-modal SPM Model for Image Classification

Zheng, Peng; Zhao, Zhong-Qiu; Gao, Jun

doi:10.1007/978-3-319-63315-2_46

Peng Zheng¹⁷,
Zhong-Qiu Zhao¹⁷ &
Jun Gao¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10363))

Included in the following conference series:

International Conference on Intelligent Computing

2426 Accesses

Abstract

The BoF (bag-of-features) model is one of the most famous models applied to many fields in computer vision and has achieved impressive results. However, the SIFT/HOG visual words have a limit discriminative power which is partly due to the fact that it only describes the local gradient distribution. In the meanwhile, there is still redundancy and hidden information existed in the formed histogram. Considering these respects, we propose a multi-modal SPM model which fuses global features to complement traditional local ones and conducts dimensionality reduction in local spaces for mining possible feature dependencies. Experimental results show the efficiency of the proposed method in comparison with the existing counterparts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Locality constrained encoding of frequency and spatial information for image classification

Article 01 March 2018

Component SPD matrices: A low-dimensional discriminative data descriptor for image set classification

Article Open access 04 August 2018

Improved Soft Assignment Coding for Image Classification

References

Bosch, A., Zisserman, A., Muoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 712–727 (2008)
Article Google Scholar
Cao, L., Ji, R., Gao, Y., Yang, Y., Tian, Q.: Weakly supervised sparse coding with geometric consistency pooling. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp. 3578–3585. IEEE (2012)
Google Scholar
Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automaticquery expansion with a generative feature model for object retrieval. In: ICCV, pp. 1–8 (2007)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition. vol. 1, pp. 886–893. IEEE (2005)
Google Scholar
Elad, M., Aharon, M.: Image denoising via sparse and redundant representations over learned dictionaries. IEEE Trans. Image Proc. 15(12), 3736–3745 (2006)
Article MathSciNet Google Scholar
Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: CVPR, vol. 2, pp. 524–531. IEEE (2005)
Google Scholar
Gao, S., Tsang, I.W., Chia, L.T., Zhao, P.: Local features are not lonely–laplacian sparse coding for image classification. In: CVPR, pp. 3555–3561. IEEE (2010)
Google Scholar
Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset (2007)
Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: ICCV, vol. 1, pp. 604–610. IEEE (2005)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)
Google Scholar
Li, F.F., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106(1), 59–70 (2007)
Article Google Scholar
Long, M., Ding, G., Wang, J., Sun, J., Guo, Y., Yu, P.S.: Transfer sparse coding for robust image representation. In: CVPR, pp. 407–414. IEEE (2013)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Manjunath, B., Ma, W.: Texture features for browsing and retrieval of image data. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 837–842 (1996)
Article Google Scholar
Oliva, A., Torralba, A.: Building the gist of a scene: the role of global image features in recognition. Prog. Brain Res. 155, 23–36 (2006)
Article Google Scholar
Quelhas, P., Monay, F., Odobez, J.M., Gatica-Perez, D., Tuytelaars, T., Van Gool, L.: Modeling scenes with local descriptors and latent aspects. In: ICCV, vol. 1, pp. 883–890. IEEE (2005)
Google Scholar
Stricker, M., Orengo, M.: Similarity of color images. In: SPIE Conference on Storage and Retrieval for Image and Video Databases, vol. 2420, pp. 381–392, San Jose, USA (1995)
Google Scholar
Wang, D., Lu, H., Chen, Y.W.: Object tracking by multi-cues spatial pyramid matching. In: ICIP, pp. 3957–3960. IEEE (2010)
Google Scholar
Wang, M., Gao, Y., Lu, K., Rui, Y.: View-based discriminative probabilistic modeling for 3d object retrieval and recognition. IEEE Trans. Image Proc. 22(4), 1395–1407 (2013)
Article MathSciNet Google Scholar
Wang, M., Li, W., Liu, D., Ni, B., Shen, J., Yan, S.: Facilitating image search with a scalable and compact semantic mapping. IEEE Trans. Cybern. 45(8), 1561–1574 (2015)
Article Google Scholar
Wang, M., Liu, X., Wu, X.: Visual classification by l1-hypergraph modeling. IEEE Trans. Knowl. Data Eng. 27(9), 2564–2574 (2015)
Article Google Scholar
Wu, J.X., Rehg, J.M.: Where am i: place instance and category recognition using spatial pact. In: CVPR, pp. 1–8. IEEE (2008)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR, pp. 1794–1801. IEEE (2009)
Google Scholar
Yin, H., Cao, Y., Sun, H.: Combining pyramid representation and adaboost for urban scene classification using high-resolution synthetic aperture radar images. Radar Sonar Navig. IET 5(1), 58–64 (2011)
Article Google Scholar
Yuan, X.T., Liu, X., Yan, S.: Visual classification with multitask joint sparse representation. IEEE Trans. Image Proc. 21(10), 4349–4360 (2012)
Article MathSciNet Google Scholar
Zhao, Z.Q., Glotin, H., Xie, Z., Gao, J., Wu, X.D.: Cooperative sparse representation in two opposite directions for semi-supervised image annotation. IEEE Trans. Image Proc. 21(9), 4218–4231 (2012)
Article MathSciNet Google Scholar
Zheng, L., Wang, S., Liu, Z., Tian, Q.: Packing and padding: coupled multi-index for accurate image retrieval. In: CVPR (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer and Information, Hefei University of Technology, Hefei, China
Peng Zheng, Zhong-Qiu Zhao & Jun Gao

Authors

Peng Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Zhong-Qiu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jun Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhong-Qiu Zhao .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
De-Shuang Huang
Liverpool John Moores University, Liverpool, United Kingdom
Abir Hussain
Inha University, Incheon, Korea (Republic of)
Kyungsook Han
Indian Institute of Technology Madras, Chennai, India
M. Michael Gromiha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, P., Zhao, ZQ., Gao, J. (2017). A Multi-modal SPM Model for Image Classification. In: Huang, DS., Hussain, A., Han, K., Gromiha, M. (eds) Intelligent Computing Methodologies. ICIC 2017. Lecture Notes in Computer Science(), vol 10363. Springer, Cham. https://doi.org/10.1007/978-3-319-63315-2_46

Download citation

DOI: https://doi.org/10.1007/978-3-319-63315-2_46
Published: 21 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63314-5
Online ISBN: 978-3-319-63315-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics