skip to main content
10.1145/2393347.2396306acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
poster

Memorable basis: towards human-centralized sparse representation

Published: 29 October 2012 Publication History

Abstract

Previous studies of sparse representation in multimedia research focus on developing reliable and efficient dictionary learning algorithms. Despite the sparse prior, how to integrate other related perceptual factors of human being into dictionary learning process was seldom studied. In this paper, we investigate the influence of image memorability for human-centralized sparse representation. Based on the results of a photo memory game, we are able to quantitatively characterize an image's memorability which allows us to train sparse bases from the most memorable images instead of randomly selected natural images. We believed that such kind of basis is more consistent with neural networks in human brain and hence can better predict where human looks. To test our hypothesis, we choose human eye-fixation prediction problem for quantitative evaluation. The experimental results demonstrate the superior performance of our Memorable Basis compared to traditional sparse basis trained from unselected images.

References

[1]
G. Davis, S. Mallat, and M. Avellaneda. Adaptive greedy approximations. J. Construct. Approx., vol. 13, pp. 57--98, 1997.
[2]
S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3397--3415, 1993.
[3]
J. A. Tropp. Greed is good: Algorithmic results for sparse approximation. IEEE TIT, vol. 50, pp. 2231--2242, 2004.
[4]
S. S. Chen, D. L. Donoho, and M. A. Saunders. Atomic decomposition by basis pursuit. SIAM Rev., vol. 43, no. 1, pp. 129--159, 2001.
[5]
A. Bell and T. Sejnowski. The independent components of natural scenes are edge filters. Vision Research, vol. 37, no. 23, pp. 3327--3338, 1997.
[6]
M. Aharon, M. Elad, and A. Bruckstein. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process., vol. 54, no. 11, pp. 4311--4322, 2006.
[7]
P. Isola, J. Xiao, A. Torralba and A. Oliva. What makes an image memorable? IEEE CVPR, pp. 145--152, 2011.
[8]
P. Isola, D. Parikh, A. Torralba and A. Oliva. Understanding the Intrinsic Memorability of Images. NIPS, 2011.
[9]
L. Standing. Learning 10,000 pictures. Quarterly Journal of Experimental Psychology, 1973.
[10]
T.F. Brady, T. Konkle, G. A. Alvarez, and A. Oliva. Visual long-term memory has a massive storage capacity for object details. Proc. Natl. Acad. Sci., 2008.
[11]
T. Konkle, T. F. Brady, G. A. Alvarez, and A. Oliva. Conceptual distinctiveness supports detailed visual long-term memory for real-world objects. Journal of Experimental Psychology: General, vol. 139, no. 3, pp. 558--578, 2010
[12]
J. Xiao, J. Hayes, K. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. IEEE CVPR, 2010.
[13]
A.Hyvärinen, E. Oja, A fast fixed-point algorithm for independent component analysis, Neural Computation, pp, 1483--1492, 1997.
[14]
N. Bruce, J. Tsotsos, Saliency, attention, and visual search: An information thretic approach, Journal of Vision, vol. 9, no. 3, pp.1--24, 2009.
[15]
X. Hou and L. Zhang. Dynamic visual attention: searching for coding length increments. NIPS, pp. 681--688, 2008
[16]
J. Yan, M. Zhu, H. Liu and Y. Liu. Visual saliency detection via sparsity pursuit. IEEE Signal Processing Letters, vol. 17, no. 8, pp. 739--742, 2010.
[17]
C. Lang, G. Liu, J. Yu and S. Yan. Saliency detection by multi-task sparsity pursuit. IEEE Trans. Image Process. 2011.
[18]
N. Bruce and J. Tsotsos. Saliency based on information maximization, NIPS, pp. 155--162, 2006
[19]
L. Zhang, M. Tong, T. Marks, H. Shan, and G. Cottrell. Sun: A Bayesian framework for saliency using natural statistics, Journal of Vision, vol. 8, no. 7, pp. 1--20, 2008.

Index Terms

  1. Memorable basis: towards human-centralized sparse representation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '12: Proceedings of the 20th ACM international conference on Multimedia
    October 2012
    1584 pages
    ISBN:9781450310895
    DOI:10.1145/2393347
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. eye fixation prediction
    2. human centralized sparse representation
    3. image memorability
    4. visual attention

    Qualifiers

    • Poster

    Conference

    MM '12
    Sponsor:
    MM '12: ACM Multimedia Conference
    October 29 - November 2, 2012
    Nara, Japan

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 173
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media