skip to main content
10.1145/2009916.2009987acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Integrating hierarchical feature selection and classifier training for multi-label image annotation

Published:24 July 2011Publication History

ABSTRACT

It is well accepted that using high-dimensional multi-modal visual features for image content representation and classifier training may achieve more sufficient characterization of the diverse visual properties of the images and further result in higher discrimination power of the classifiers. However, training the classifiers in a high-dimensional multi-modal feature space requires a large number of labeled training images, which will further result in the problem of curse of dimensionality. To tackle this problem, a hierarchical feature subset selection algorithm is proposed to enable more accurate image classification, where the processes for feature selection and classifier training are seamlessly integrated in a single framework. First, a feature hierarchy (i.e., concept tree for automatic feature space partition and organization) is used to automatically partition high-dimensional heterogeneous multi-modal visual features into multiple low-dimensional homogeneous single-modal feature subsets according to their certain physical meanings and each of them is used to characterize one certain type of the diverse visual properties of the images. Second, principal component analysis (PCA) is performed on each homogeneous singlemodal feature subset to select the most representative feature dimensions and a weak classifier is learned simultaneously. After the weak classifiers and their representative feature dimensions are available for all these homogeneous single-modal feature subsets, they are combined to generate an ensemble image classifier and achieve hierarchical feature subset selection. Our experiments on a specific domain of natural images have also obtained very positive results.

References

  1. K. Barnard and D. Forsyth. Learning the semantics of words and pictures. ICCV, pages 408--415, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  2. K. Bennet and A. Demiriz. Semi-supervised support vector machines. NIPS, pages 368--374, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Blei and M. Jordan. Modeling annotated data. ACM SIGIR, pages 127--134, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Breiman. Bagging predictors. Machine Learning, 24:123--140, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Carson, S. Belongie, H. Greenspan, and J. Malik. Blobworld: Image segmentation using expectation maximization and its application to image querying. IEEE Trans. on PAMI, 24:1026--1038, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Deng and M. Manjunath. Unsupervised segmentation for color-texture regions in images and video. IEEE Trans. on PAMI, 23:800--810, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Duygulu, K. Barnard, J. Freitas, and D. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. ECCV, pages 97--112, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Fan, Y. Gao, and H. Luo. Multi-level annotation of natural scenes using dominant image components and semantic image concepts. ACM Multimedia, pages 540--547, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Fan, S. Stolfo, J. Zhang, and P. Chan. Adacost: Misclassification cost-sensitive boosting. ICML, pages 99--105, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Freund and R. Schapire. Experiments with a new boosting algorithm. ICML, pages 148--156, 1996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Gao, J. Fan, H. Luo, X. Xue, and R. Jain. Automatic image annotation by incorporating feature hierarchy and boosting to scale-up svm classifiers. ACM Multimedia, pages 901--910, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Ghahremani, C. Shahabi, S. Yao, and R. Zimmermann. Yima: real-time multimedia storage and retrieval. ACM Multimedia, pages 668--669, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. ACM SIGIR, pages 119--126, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Jin, L. Khan, L. Wang, and M. Awad. Image annotations by combining multiple evidence and wordnet. ACM Multimedia, pages 706--715, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Joachims. Transductive inference for text classification using support vector machines. ICML, pages 200--209, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. NIPS, pages 553--560, 2003.Google ScholarGoogle Scholar
  17. A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. ECCV, pages 316--329, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. D. Modha and W. S. Spangler. Feature weighting in k-means clustering. Machine Learning, 52:217--237, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using em. Machine Learning, pages 103--134, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. O'Sullivan, J. Langford, R. Caruana, and A. Blum. Featureboost: A meta learning algorithm that improves model robustness. ICML, pages 703--710, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adavances in Large Margin Classifiers, MIT Press, 1999.Google ScholarGoogle Scholar
  22. B. Ripley. Neural network and related methods for classification. Journal of the Royal Statistical Society, Series B, 56:409--456, 1994.Google ScholarGoogle Scholar
  23. A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Trans. on PAMI, 22:1349--1380, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Sutton, M. Sindelar, and A. McCallum. Feature boosting: Preventing weight undertraining in structured discriminative learning. CIIR TR-IR-402 University of Masschusetts, 2005.Google ScholarGoogle Scholar
  25. M. Szummer and T. Jaakkola. Information regularization with partially labeled data. NIPS, pages 1025--1032, 2002.Google ScholarGoogle Scholar
  26. A. Torralba and A. Oliva. Semantic organization of scenes using discriminant structural templates. ICCV, pages 1253--1258, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. V. Vapnik. Statistical learning theory. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. N. Vasconcelos and M. Vasconcelos. Scalable discriminant feature selection for image retrieval and recognition. CVPR, pages 770--775, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. P. Viola and M. Jones. Robust real-time face detection. Intl. J. Computer Vision, 57:137--154, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. X.-J. Wang, W.-Y. Ma, L. Zhang, and X. Li. Iteratively clustering web images based on link and attribute reinforcements. ACM Multimedia, pages 122--131, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. M. Weber, M. Welling, and P. Perona. Unsupervised learning of models for recognition. ECCV, pages 18--32, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. C. Yang, M. Dong, and J. Hua. Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. CVPR, pages 2057--2063, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. L. Yu and H. Liu. Feature selection for high-dimensional data: A fast correlation-based filter solution. ICML, pages 856--863, 2003.Google ScholarGoogle Scholar
  34. R. Zhang, Z. Zhang, M. Li, W.-Y. Ma, and H. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieva. ICCV, pages 846--851, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. S. Zhang, J. Huang, Y. Huang, Y. Yu, H. Li, and D. Metaxas. Automatic image annotation using group sparsity. CVPR, pages 3312--3319, 2010.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Integrating hierarchical feature selection and classifier training for multi-label image annotation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
        July 2011
        1374 pages
        ISBN:9781450307574
        DOI:10.1145/2009916

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 July 2011

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate792of3,983submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader