skip to main content
10.1145/2393347.2396514acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

What you see is what you should get

Published: 29 October 2012 Publication History

Abstract

This paper summarizes our work on optimization of video content compression in the context of the end user. I will briefly explain motivation and novelty of the work that presents the bulk of my dissertation research. Preliminary experiments with promising results are encouraging further steps that should lead to completion of a model for video compression that uses most of available bits for the video content that is actually seen. By this we mean the content that user attends to. Everything else is coded with much less bits, leading to significant savings compared to state-of-the-art coding techniques. Our work should be regarded as extension and not replacement of the hybrid coding paradigm.

References

[1]
Cisco, Inc. 2012. Visual Networking Index Services Adoption (VNI SA) Forecast, 2011--2016. Whitepaper.
[2]
Bin, L., Sullivan, G. J. and Xu J. 2012. Compression Performance of High Efficiency Video Coding (HEVC) Working Draft 4. IEEE International Symposium on Circuits and Systems (ISCAS).
[3]
Miller, G.A. 1956. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychological Review, Vol 63(2), 81--97.
[4]
Sherrington, C.S. 1897. On the reciprocal action in the retina as studied by means of some rotating discs. J Physiol. 21, 33--54.
[5]
McDougall, W. 1904. The sensations excited by a single momentary stimulation of the eye. Brit J Psychol 1, 78--113.
[6]
Girod, B. 1989. The information theoretical significance of spatial and temporal masking in video signals. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 1077, pp. 178--187.
[7]
Tam, W. J., Stelmach, L. B., Wang, L., Lauzon, D. and Gray, P. 1995. Visual masking at video scene cuts. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 2411, pp. 111--119.
[8]
Hu Q., Klein S.A. and Carney T. 1993. Masking of high-spatial-frequency information after a scene cut. Society for Informational Display 93 Digest. 24:521--523.
[9]
Pastrana-Vidal R.R., Gicquel, J.-C., Colomes, C. and H. Cherifi. 2004. Temporal Masking Effect on Dropped Frames at Video Scene Cuts. Proc. SPIE Human Vision and Electronic Imaging IX, vol. 5292, pp. 194--201.
[10]
Quan H.-T., Ghanbari, M. 2008. Asymmetrical temporal masking near video scene change. 15th IEEE International Conference on Image Processing, vol., no., pp.2568--2571.
[11]
Itti, L. and Baldi, P. 2005. A principled approach to detecting surprising events in video. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition.
[12]
Ha, H., Park, J., Lee S. and Bovik, A.C. 2011. Perceptually Scalable Extension of H.264. IEEE Transactions on Circuits and Systems for Video Technology, vol.21, no.11, pp.1667--1678.
[13]
Rensink, R. A. 2002. A model of saliency-based visual attention for rapid science analysis. ACM Proc. 2nd Int. Symp. Smart Graphics, New York, pp. 63--70.
[14]
Lee, J.-B. and Eleftheriadis, A. 2005. Spatio-temporal model-assisted very low-bit-rate coding with compatibility. IEEE Transactions on Circuits and Systems for Video Technology, vol.15, no.12, pp. 1517- 1532.
[15]
Rensink R. A., O' Regan J. K. and Clark, J. J. 1997. To see or not to see: The need for attention to perceive changes in scenes. Psychol. Sci., vol. 8, pp. 368--373.
[16]
Bridgeman, G., Hendry, D. and Stark, L. 1975. Failure to detect displacement of visual world during saccadic eye movements Vision Research, 15, 719--722.
[17]
Mizukoshi, K., Fabian, P. and Stahle, J. 1977. Optokinetic Test Comprising Both Acceleration and Constant Velocity Stimulation. Acta Oto-laryng., Vol. 84, No. 1--6., p155--165
[18]
Lindemann, L., Wenger, S. and Magnor, M. 2012. Evaluation of video artifact perception using event-related potentials. Proceedings of the ACM SIGGRAPH Symposium on Applied Perception in Graphics and Visualization (APGV '11), New York, NY, USA, 53--58.
[19]
Scholler, S., Bosse, S., Treder, M.S., Blankertz, B., Curio, G., Muller, K.-R. and Wiegand, T. 2012. Toward a Direct Measure of Video Quality Perception Using EEG. IEEE Transactions on Image Processing, vol.21, n.5, p.2619--262

Cited By

View all
  • (2014)PIVP 2014Proceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2647879(1263-1264)Online publication date: 3-Nov-2014
  • (2013)Temporal visual masking for HEVC/H.265 perceptual optimization2013 Picture Coding Symposium (PCS)10.1109/PCS.2013.6737775(430-433)Online publication date: Dec-2013

Index Terms

  1. What you see is what you should get

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '12: Proceedings of the 20th ACM international conference on Multimedia
    October 2012
    1584 pages
    ISBN:9781450310895
    DOI:10.1145/2393347
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cognition
    2. perception
    3. video compression

    Qualifiers

    • Research-article

    Conference

    MM '12
    Sponsor:
    MM '12: ACM Multimedia Conference
    October 29 - November 2, 2012
    Nara, Japan

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 13 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)PIVP 2014Proceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2647879(1263-1264)Online publication date: 3-Nov-2014
    • (2013)Temporal visual masking for HEVC/H.265 perceptual optimization2013 Picture Coding Symposium (PCS)10.1109/PCS.2013.6737775(430-433)Online publication date: Dec-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media