ABSTRACT
This paper summarizes our work on optimization of video content compression in the context of the end user. I will briefly explain motivation and novelty of the work that presents the bulk of my dissertation research. Preliminary experiments with promising results are encouraging further steps that should lead to completion of a model for video compression that uses most of available bits for the video content that is actually seen. By this we mean the content that user attends to. Everything else is coded with much less bits, leading to significant savings compared to state-of-the-art coding techniques. Our work should be regarded as extension and not replacement of the hybrid coding paradigm.
- Cisco, Inc. 2012. Visual Networking Index Services Adoption (VNI SA) Forecast, 2011--2016. Whitepaper.Google Scholar
- Bin, L., Sullivan, G. J. and Xu J. 2012. Compression Performance of High Efficiency Video Coding (HEVC) Working Draft 4. IEEE International Symposium on Circuits and Systems (ISCAS).Google Scholar
- Miller, G.A. 1956. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychological Review, Vol 63(2), 81--97.Google ScholarCross Ref
- Sherrington, C.S. 1897. On the reciprocal action in the retina as studied by means of some rotating discs. J Physiol. 21, 33--54.Google ScholarCross Ref
- McDougall, W. 1904. The sensations excited by a single momentary stimulation of the eye. Brit J Psychol 1, 78--113.Google Scholar
- Girod, B. 1989. The information theoretical significance of spatial and temporal masking in video signals. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 1077, pp. 178--187.Google ScholarCross Ref
- Tam, W. J., Stelmach, L. B., Wang, L., Lauzon, D. and Gray, P. 1995. Visual masking at video scene cuts. Proc. SPIE Human Vision, Visual Processing and Digital Display, vol. 2411, pp. 111--119.Google Scholar
- Hu Q., Klein S.A. and Carney T. 1993. Masking of high-spatial-frequency information after a scene cut. Society for Informational Display 93 Digest. 24:521--523.Google Scholar
- Pastrana-Vidal R.R., Gicquel, J.-C., Colomes, C. and H. Cherifi. 2004. Temporal Masking Effect on Dropped Frames at Video Scene Cuts. Proc. SPIE Human Vision and Electronic Imaging IX, vol. 5292, pp. 194--201..Google Scholar
- Quan H.-T., Ghanbari, M. 2008. Asymmetrical temporal masking near video scene change. 15th IEEE International Conference on Image Processing, vol., no., pp.2568--2571.Google Scholar
- Itti, L. and Baldi, P. 2005. A principled approach to detecting surprising events in video. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition. Google ScholarDigital Library
- Ha, H., Park, J., Lee S. and Bovik, A.C. 2011. Perceptually Scalable Extension of H.264. IEEE Transactions on Circuits and Systems for Video Technology, vol.21, no.11, pp.1667--1678. Google ScholarDigital Library
- Rensink, R. A. 2002. A model of saliency-based visual attention for rapid science analysis. ACM Proc. 2nd Int. Symp. Smart Graphics, New York, pp. 63--70.Google Scholar
- Lee, J.-B. and Eleftheriadis, A. 2005. Spatio-temporal model-assisted very low-bit-rate coding with compatibility. IEEE Transactions on Circuits and Systems for Video Technology, vol.15, no.12, pp. 1517- 1532. Google ScholarDigital Library
- Rensink R. A., O' Regan J. K. and Clark, J. J. 1997. To see or not to see: The need for attention to perceive changes in scenes. Psychol. Sci., vol. 8, pp. 368--373.Google ScholarCross Ref
- Bridgeman, G., Hendry, D. and Stark, L. 1975. Failure to detect displacement of visual world during saccadic eye movements Vision Research, 15, 719--722.Google Scholar
- Mizukoshi, K., Fabian, P. and Stahle, J. 1977. Optokinetic Test Comprising Both Acceleration and Constant Velocity Stimulation. Acta Oto-laryng., Vol. 84, No. 1--6., p155--165Google ScholarCross Ref
- Lindemann, L., Wenger, S. and Magnor, M. 2012. Evaluation of video artifact perception using event-related potentials. Proceedings of the ACM SIGGRAPH Symposium on Applied Perception in Graphics and Visualization (APGV '11), New York, NY, USA, 53--58. Google ScholarDigital Library
- Scholler, S., Bosse, S., Treder, M.S., Blankertz, B., Curio, G., Muller, K.-R. and Wiegand, T. 2012. Toward a Direct Measure of Video Quality Perception Using EEG. IEEE Transactions on Image Processing , vol.21, n.5, p.2619--262Google ScholarDigital Library
Index Terms
- What you see is what you should get
Recommendations
Performance analysis of hybrid coders in multi-constraints pruned environment
AbstractAdvance Video Coder (H.264/AVC) and High-Efficiency Video (H.265/HEVC) coders are fast developing video compression standards, provides high compression and quality of service as compared to previously established standards. The present work ...
A Very Low Bit Rate Video Coding Combined with Fast Adaptive Block Size Motion Estimation and Nonuniform Scalar Quantization Multiwavelet Transform
We describe a very low bit rate video coding framework in which motion correlation between successive video frames is exploited in the multiwavelet transform domain. Some complicated techniques, such as spatial prediction in intra coding, adaptive block ...
Bit-stream allocation methods for wavelet based scalable video coding
MobiMedia '06: Proceedings of the 2nd international conference on Mobile multimedia communicationsIn wavelet based scalable video coding quality scalability is achieved using bit-plane coding techniques that result in embedded bit-stream. In order to minimise distortion when specific bit-rate is targeted, compressed data has to be optimally ...
Comments