skip to main content
10.1145/2393347.2396514acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

What you see is what you should get

Published:29 October 2012Publication History

ABSTRACT

This paper summarizes our work on optimization of video content compression in the context of the end user. I will briefly explain motivation and novelty of the work that presents the bulk of my dissertation research. Preliminary experiments with promising results are encouraging further steps that should lead to completion of a model for video compression that uses most of available bits for the video content that is actually seen. By this we mean the content that user attends to. Everything else is coded with much less bits, leading to significant savings compared to state-of-the-art coding techniques. Our work should be regarded as extension and not replacement of the hybrid coding paradigm.

References

  1. Cisco, Inc. 2012. Visual Networking Index Services Adoption (VNI SA) Forecast, 2011--2016. Whitepaper.Google ScholarGoogle Scholar
  2. Bin, L., Sullivan, G. J. and Xu J. 2012. Compression Performance of High Efficiency Video Coding (HEVC) Working Draft 4. IEEE International Symposium on Circuits and Systems (ISCAS).Google ScholarGoogle Scholar
  3. Miller, G.A. 1956. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychological Review, Vol 63(2), 81--97.Google ScholarGoogle ScholarCross RefCross Ref
  4. Sherrington, C.S. 1897. On the reciprocal action in the retina as studied by means of some rotating discs. J Physiol. 21, 33--54.Google ScholarGoogle ScholarCross RefCross Ref
  5. McDougall, W. 1904. The sensations excited by a single momentary stimulation of the eye. Brit J Psychol 1, 78--113.Google ScholarGoogle Scholar
  6. Girod, B. 1989. The information theoretical significance of spatial and temporal masking in video signals. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 1077, pp. 178--187.Google ScholarGoogle ScholarCross RefCross Ref
  7. Tam, W. J., Stelmach, L. B., Wang, L., Lauzon, D. and Gray, P. 1995. Visual masking at video scene cuts. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 2411, pp. 111--119.Google ScholarGoogle Scholar
  8. Hu Q., Klein S.A. and Carney T. 1993. Masking of high-spatial-frequency information after a scene cut. Society for Informational Display 93 Digest. 24:521--523.Google ScholarGoogle Scholar
  9. Pastrana-Vidal R.R., Gicquel, J.-C., Colomes, C. and H. Cherifi. 2004. Temporal Masking Effect on Dropped Frames at Video Scene Cuts. Proc. SPIE Human Vision and Electronic Imaging IX, vol. 5292, pp. 194--201..Google ScholarGoogle Scholar
  10. Quan H.-T., Ghanbari, M. 2008. Asymmetrical temporal masking near video scene change. 15th IEEE International Conference on Image Processing, vol., no., pp.2568--2571.Google ScholarGoogle Scholar
  11. Itti, L. and Baldi, P. 2005. A principled approach to detecting surprising events in video. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Ha, H., Park, J., Lee S. and Bovik, A.C. 2011. Perceptually Scalable Extension of H.264. IEEE Transactions on Circuits and Systems for Video Technology, vol.21, no.11, pp.1667--1678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Rensink, R. A. 2002. A model of saliency-based visual attention for rapid science analysis. ACM Proc. 2nd Int. Symp. Smart Graphics, New York, pp. 63--70.Google ScholarGoogle Scholar
  14. Lee, J.-B. and Eleftheriadis, A. 2005. Spatio-temporal model-assisted very low-bit-rate coding with compatibility. IEEE Transactions on Circuits and Systems for Video Technology, vol.15, no.12, pp. 1517- 1532. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Rensink R. A., O' Regan J. K. and Clark, J. J. 1997. To see or not to see: The need for attention to perceive changes in scenes. Psychol. Sci., vol. 8, pp. 368--373.Google ScholarGoogle ScholarCross RefCross Ref
  16. Bridgeman, G., Hendry, D. and Stark, L. 1975. Failure to detect displacement of visual world during saccadic eye movements Vision Research, 15, 719--722.Google ScholarGoogle Scholar
  17. Mizukoshi, K., Fabian, P. and Stahle, J. 1977. Optokinetic Test Comprising Both Acceleration and Constant Velocity Stimulation. Acta Oto-laryng., Vol. 84, No. 1--6., p155--165Google ScholarGoogle ScholarCross RefCross Ref
  18. Lindemann, L., Wenger, S. and Magnor, M. 2012. Evaluation of video artifact perception using event-related potentials. Proceedings of the ACM SIGGRAPH Symposium on Applied Perception in Graphics and Visualization (APGV '11), New York, NY, USA, 53--58. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Scholler, S., Bosse, S., Treder, M.S., Blankertz, B., Curio, G., Muller, K.-R. and Wiegand, T. 2012. Toward a Direct Measure of Video Quality Perception Using EEG. IEEE Transactions on Image Processing , vol.21, n.5, p.2619--262Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. What you see is what you should get

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '12: Proceedings of the 20th ACM international conference on Multimedia
      October 2012
      1584 pages
      ISBN:9781450310895
      DOI:10.1145/2393347

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 October 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader