skip to main content
10.1145/3015166.3015182acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicspsConference Proceedingsconference-collections
research-article

Extended Discriminative Spatial Pyramid

Authors Info & Claims
Published:21 November 2016Publication History

ABSTRACT

In this paper, we introduce a novel model for embedding image spatial information into a feature vector based on an extension of spatial pyramid model (SPM). Our novel model considers the spatial distributions of both visual words and visual word combinations, extending the original SPM with a new explanation. The popular combination "spatial pyramid + max pooling + linear SVMs" for image classification and some existing works can be seen as simple implementations of our novel model, and we propose another one for better illustration. Three simple implementations are contrastively analyzedon Caltech 101, 15 Scenes and UIUC-Sports datasets, and our proposed one slightly outperforms the others.

References

  1. Y. Huang, Z. Wu, L. Wang, and T. Tan, "Feature Coding in Image Classification: A Comprehensive Study," IEEE TPAMI, vol. 36, no. 3, pp. 493--506, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. G. Lowe, "Distinctive image features from scale-invariant keypoints," IJCV, vol. 60, pp. 91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Yang, K. Yu, and T. Huang, "Supervised Translation-Invariant Sparse Coding," in CVPR, 2010, pp. 3517--3524.Google ScholarGoogle Scholar
  4. J. Feng, B. Ni, Q. Tian, and S. Yan, "Geometric lp-norm Feature Pooling for Image Classification," in CVPR, 2011, pp. 2697--2704. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y.L. Boureau, F. Bach, Y. LeCun, and J. Ponce, "Learning Mid-Level Features For Recognition," in CVPR, 2010, pp. 2559--2566.Google ScholarGoogle Scholar
  6. T. Harada, Y. Ushiku, Y. Yamashita, and Y. Kuniyoshi, "Discriminative Spatial Pyramid," in CVPR, 2011, pp. 1617--1624. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Lazebnik, C. Schmid, and J. Ponce, "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories," in CVPR, 2006, pp. 2169--2178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Y. Huang, Z. Wu, L. Wang, and C. Song, "Multiple spatial pooling for visual object recognition," Neurocomputing, vol. 129, no. 4, pp. 225--231, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. V. Viitaniemi and J. Laaksonen, "Spatial extensions to bag of visual words," in CIVR, 2009, pp. 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. F. Sadeghi and M.F. Tappen, "Latent Pyramidal Regions for Recognizing Scenes," in ECCV, 2012, pp. 228--241. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Jia, C. Huang and T. Darrell, "Beyond Spatial Pyramids: Receptive Field Learning for Pooled Image Features," in CVPR, 2012, pp. 3370--3377. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Dammak, M. Mejdoub, and C.B. Amar, "Histogram of dense subgraphs for image representation," IET Image Processing, vol. 9, no. 3, pp. 184--191, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  13. S. Zhang, Q. Tian, G. Hua, Q. Huang, and W. Gao, "Generating Descriptive Visual words and Visual Phrases for Large-Scale Image Applications," IEEE TIP, vol. 20, no. 9, pp. 2664--2667, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. Chen, K. Yap, and L. Chau, "From Universal Bag-of-Words to Adaptive Bag-of-Phrases for Mobile Scene Recognition," in ICIP, 2011, pp. 825--828.Google ScholarGoogle Scholar
  15. S. Yan, D. Xu, B. Zhang, H. Zhang, Q. Yang, and S. Lin,"Graph embedding and extensions: A general framework for dimensionality reduction,"IEEE TPAMI, vol. 29, no. 1, pp. 40--51, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, "Locality-Constrained Linear Coding for Image Classification," in CVPR, 2010, pp. 3360--3367.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICSPS 2016: Proceedings of the 8th International Conference on Signal Processing Systems
    November 2016
    235 pages
    ISBN:9781450347907
    DOI:10.1145/3015166

    Copyright © 2016 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 21 November 2016

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    ICSPS 2016 Paper Acceptance Rate46of83submissions,55%Overall Acceptance Rate46of83submissions,55%
  • Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader