skip to main content
10.1145/1386352.1386370acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
research-article

Non-negative matrix factorisation for object class discovery and image auto-annotation

Authors Info & Claims
Published:07 July 2008Publication History

ABSTRACT

In information retrieval, sub-space techniques are usually used to reveal the latent semantic structure of a data-set by projecting it to a low dimensional space. Non-negative matrix factorisation (NMF), which generates a non-negative representation of data through matrix decomposition, is one such technique. It is different from other similar techniques, such as singular vector decomposition (SVD), in its non-negativity constraints which lead to its parts-based representation characteristic. In this paper, we present the novel use of NMF in two tasks; object class detection and automatic annotation of images. Experimental results imply that NMF is a promising sub-space technique for discovering the latent structure of image data-sets, with the ability of encoding the latent topics that correspond to object classes in the basis vectors generated.

References

  1. Jia-Yu Pan, Hyung-Jeong Yang, Pinar Duygulu, and Christos Faloutsos, "Automatic image captioning," in Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME 2004), 2004, pp. 1987--1990.Google ScholarGoogle Scholar
  2. Jonathon S. Hare and Paul H. Lewis, "Saliency-based models of image content and their application to auto-annotation by semantic propagation," in Proceedings of Multimedia and the Semantic Web / European Semantic Web Conference 2005, 2005.Google ScholarGoogle Scholar
  3. D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, pp. 788, october 1999.Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Tsuge, M. Shishibori, S. Kuroiwa, and K. Kita, "Dimensionality reduction using non-negative matrix factorization for information retrieval," in IEEE International Conference on Systems, Man, and Cybernetics, 2001, pp. 960--965.Google ScholarGoogle Scholar
  5. Wei Xu, Xin Liu, and Yihong Gong, "Document clustering based on non-negative matrix factorization," in SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. 2003, pp. 267--273, ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. David Guillamet, Jordi Vitrià, and Bernt Schiele, "Introducing a weighted non-negative matrix factorization for image classification," Pattern Recognition Letters, vol. 24, no. 14, pp. 2447--2454, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Weixiang Liu and Nanning Zheng, "Non-negative matrix factorization based methods for object recognition," Pattern Recognition Letters, vol. 25, no. 8, pp. 893--897, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Daniel D. Lee and H. Sebastian Seung, "Algorithms for non-negative matrix factorization," in Advances in Neural Information Processing Systems, 2001, pp. 556--562.Google ScholarGoogle Scholar
  9. Patrik O. Hoyer, "Non-negative matrix factorization with sparseness constraints," The Journal of Machine Learning Research, vol. 5, pp. 1457--1469, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Eric Gaussier and Cyril Goutte, "Relation between plsa and nmf and implications," in Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, 2005, pp. 601 -- 602. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Chris Ding, Tao Li, and Wei Peng, "Nmf and plsi: equivalence and a hybrid algorithm," in Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, 2006, pp. 641 -- 642. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Jonathon S. Hare and Paul H. Lewis, "Salient regions for query by image content.," in CIVR '04: Proceedings of the 6th ACM international conference on Image and video retrieval, 2004, pp. 317--325.Google ScholarGoogle Scholar
  13. David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, vol. 60, no. 2, pp. 91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Chih-Jen Lin, "Projected gradient methods for non-negative matrix factorization," Tech. Rep. Information and Support Service ISSTECH-95-013, Department of Computer Science, National Taiwan University, 2005.Google ScholarGoogle Scholar
  15. Jianbo Shi and Jitendra Malik, "Normalized cuts and image segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), pp. 888--905, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bryan C. Russell, Alexei A. Efros, Josef Sivic, William T. Freeman, and Andrew Zisserman, "Using multiple segmentations to discover objects and their extent in image collections," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, June 2006, pp. 1605--1614. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman, "Indexing by latent semantic analysis," Journal of the American Society for Information Science, vol. 41, pp. 391 -- 407, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  18. Kobus Barnard, Pinar Duygulu, Nando de Freitas, David Forsyth, David Blei, and Michael I. Jordan, "Matching words and pictures," Journal of Machine Learning Research, vol. 3, pp. 1107--1135, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Jonathon S. Hare, Paul H. Lewis, Peter G. B. Enser, and Christine J. Sandom, "A linear-algebraic technique with an application in semantic image retrieval," in CIVR '06: Proceedings of the 6th ACM international conference on Image and video retrieval, 2006, pp. 31--40. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Non-negative matrix factorisation for object class discovery and image auto-annotation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CIVR '08: Proceedings of the 2008 international conference on Content-based image and video retrieval
        July 2008
        674 pages
        ISBN:9781605580708
        DOI:10.1145/1386352

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 July 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader