research-article

What you see is what you should get

Author:

Velibor AdzicAuthors Info & Claims

MM '12: Proceedings of the 20th ACM international conference on Multimedia

Pages 1441 - 1444

https://doi.org/10.1145/2393347.2396514

Published: 29 October 2012 Publication History

Abstract

This paper summarizes our work on optimization of video content compression in the context of the end user. I will briefly explain motivation and novelty of the work that presents the bulk of my dissertation research. Preliminary experiments with promising results are encouraging further steps that should lead to completion of a model for video compression that uses most of available bits for the video content that is actually seen. By this we mean the content that user attends to. Everything else is coded with much less bits, leading to significant savings compared to state-of-the-art coding techniques. Our work should be regarded as extension and not replacement of the hybrid coding paradigm.

References

[1]

Cisco, Inc. 2012. Visual Networking Index Services Adoption (VNI SA) Forecast, 2011--2016. Whitepaper.

[2]

Bin, L., Sullivan, G. J. and Xu J. 2012. Compression Performance of High Efficiency Video Coding (HEVC) Working Draft 4. IEEE International Symposium on Circuits and Systems (ISCAS).

[3]

Miller, G.A. 1956. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychological Review, Vol 63(2), 81--97.

[4]

Sherrington, C.S. 1897. On the reciprocal action in the retina as studied by means of some rotating discs. J Physiol. 21, 33--54.

[5]

McDougall, W. 1904. The sensations excited by a single momentary stimulation of the eye. Brit J Psychol 1, 78--113.

[6]

Girod, B. 1989. The information theoretical significance of spatial and temporal masking in video signals. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 1077, pp. 178--187.

[7]

Tam, W. J., Stelmach, L. B., Wang, L., Lauzon, D. and Gray, P. 1995. Visual masking at video scene cuts. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 2411, pp. 111--119.

[8]

Hu Q., Klein S.A. and Carney T. 1993. Masking of high-spatial-frequency information after a scene cut. Society for Informational Display 93 Digest. 24:521--523.

[9]

Pastrana-Vidal R.R., Gicquel, J.-C., Colomes, C. and H. Cherifi. 2004. Temporal Masking Effect on Dropped Frames at Video Scene Cuts. Proc. SPIE Human Vision and Electronic Imaging IX, vol. 5292, pp. 194--201.

[10]

Quan H.-T., Ghanbari, M. 2008. Asymmetrical temporal masking near video scene change. 15th IEEE International Conference on Image Processing, vol., no., pp.2568--2571.

[11]

Itti, L. and Baldi, P. 2005. A principled approach to detecting surprising events in video. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition.

Digital Library

[12]

Ha, H., Park, J., Lee S. and Bovik, A.C. 2011. Perceptually Scalable Extension of H.264. IEEE Transactions on Circuits and Systems for Video Technology, vol.21, no.11, pp.1667--1678.

Digital Library

[13]

Rensink, R. A. 2002. A model of saliency-based visual attention for rapid science analysis. ACM Proc. 2nd Int. Symp. Smart Graphics, New York, pp. 63--70.

[14]

Lee, J.-B. and Eleftheriadis, A. 2005. Spatio-temporal model-assisted very low-bit-rate coding with compatibility. IEEE Transactions on Circuits and Systems for Video Technology, vol.15, no.12, pp. 1517- 1532.

Digital Library

[15]

Rensink R. A., O' Regan J. K. and Clark, J. J. 1997. To see or not to see: The need for attention to perceive changes in scenes. Psychol. Sci., vol. 8, pp. 368--373.

[16]

Bridgeman, G., Hendry, D. and Stark, L. 1975. Failure to detect displacement of visual world during saccadic eye movements Vision Research, 15, 719--722.

[17]

Mizukoshi, K., Fabian, P. and Stahle, J. 1977. Optokinetic Test Comprising Both Acceleration and Constant Velocity Stimulation. Acta Oto-laryng., Vol. 84, No. 1--6., p155--165

[18]

Lindemann, L., Wenger, S. and Magnor, M. 2012. Evaluation of video artifact perception using event-related potentials. Proceedings of the ACM SIGGRAPH Symposium on Applied Perception in Graphics and Visualization (APGV '11), New York, NY, USA, 53--58.

Digital Library

[19]

Scholler, S., Bosse, S., Treder, M.S., Blankertz, B., Curio, G., Muller, K.-R. and Wiegand, T. 2012. Toward a Direct Measure of Video Quality Perception Using EEG. IEEE Transactions on Image Processing, vol.21, n.5, p.2619--262

Digital Library

Cited By

Kalva HChen HFernandez Escribano GAdzic VHua KRui YSteinmetz RHanjalic ANatsev AZhu W(2014)PIVP 2014Proceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2647879(1263-1264)Online publication date: 3-Nov-2014
https://dl.acm.org/doi/10.1145/2647868.2647879
Adzic VKalva HFurht B(2013)Temporal visual masking for HEVC/H.265 perceptual optimization2013 Picture Coding Symposium (PCS)10.1109/PCS.2013.6737775(430-433)Online publication date: Dec-2013
https://doi.org/10.1109/PCS.2013.6737775

Index Terms

What you see is what you should get
1. Computing methodologies
  1. Computer graphics
    1. Image compression

Recommendations

Performance analysis of hybrid coders in multi-constraints pruned environment
Abstract
Advance Video Coder (H.264/AVC) and High-Efficiency Video (H.265/HEVC) coders are fast developing video compression standards, provides high compression and quality of service as compared to previously established standards. The present work ...
A Very Low Bit Rate Video Coding Combined with Fast Adaptive Block Size Motion Estimation and Nonuniform Scalar Quantization Multiwavelet Transform

We describe a very low bit rate video coding framework in which motion correlation between successive video frames is exploited in the multiwavelet transform domain. Some complicated techniques, such as spatial prediction in intra coding, adaptive block ...
Bit-stream allocation methods for wavelet based scalable video coding
MobiMedia '06: Proceedings of the 2nd international conference on Mobile multimedia communications

In wavelet based scalable video coding quality scalability is achieved using bit-plane coding techniques that result in embedded bit-stream. In order to minimise distortion when specific bit-rate is targeted, compressed data has to be optimally ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '12: Proceedings of the 20th ACM international conference on Multimedia

October 2012

1584 pages

ISBN:9781450310895

DOI:10.1145/2393347

General Chairs:
Noboru Babaguchi
Osaka University, Japan
,
Kiyoharu Aizawa
The University of Tokyo, Japan
,
John Smith
IBM, USA
,
Program Chairs:
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Thomas Plagemann
University of Oslo, Norway
,
Xian-Sheng Hua
Microsoft, USA
,
Rong Yan
Facebook, USA

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '12

Sponsor:

SIGMM

MM '12: ACM Multimedia Conference

October 29 - November 2, 2012

Nara, Japan

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
147
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)1

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Kalva HChen HFernandez Escribano GAdzic VHua KRui YSteinmetz RHanjalic ANatsev AZhu W(2014)PIVP 2014Proceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2647879(1263-1264)Online publication date: 3-Nov-2014
https://dl.acm.org/doi/10.1145/2647868.2647879
Adzic VKalva HFurht B(2013)Temporal visual masking for HEVC/H.265 perceptual optimization2013 Picture Coding Symposium (PCS)10.1109/PCS.2013.6737775(430-433)Online publication date: Dec-2013
https://doi.org/10.1109/PCS.2013.6737775

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten