skip to main content
10.1145/1943552.1943581acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

Adaptive encoding of zoomable video streams based on user access pattern

Published: 23 February 2011 Publication History

Abstract

Zoomable video allows users to selectively zoom and pan into regions of interest within the video for viewing at higher resolutions. Such interaction requires dynamic cropping of RoIs on the source video. In this paper, we consider how the bandwidth needed to transmit the RoIs can be reduced by carefully encoding the source video. The key idea is to exploit user access patterns to the RoIs, and encode different regions of the video with different encoding parameters based on the popularity of the region. We show that our encoding method can reduce the expected bandwidth by upto 27%.

Supplementary Material

MP4 File (110225_26192_03_acm.mp4)

References

[1]
http://media.xiph.org/video/derf/.
[2]
http://www.cdvl.org/.
[3]
A. Carlier, G. Ravindra, and W. T. Ooi. Towards characterizing users' interaction with zoomable video. In Proc. of International Workshop on Social, Adaptive, and Personalized Mutlimedia Interaction and Access (SAPMIA 2010), Florence, Italy, October 2010.
[4]
L.-Q. Chen, X. Xie, X. Fan, W.-Y. Ma, H.-J. Zhang, and H.-Q. Zhou. A visual attention model for adapting images on small displays. Multimedia Systems, 9(4):353--364, 2003.
[5]
W. Feng, T. Dang, J. Kassebaum, and T. Bauman. Supporting region-of-interest cropping through constrained compression. In Proc. of ACM MM'08, pages 745--748, Vancouver, British Columbia, Canada, 2008.
[6]
S. Heymann, A. Smolic, K. Mueller, Y. Guo, J. Rurainsky, P. Eisert, and T.Wiegand. Representation, coding and interactive rendering of high-resolution panoramic images and video using MPEG-4. In Proc. Panoramic Photogrammetry Work-shop PPW'05, Feb 2005.
[7]
C.-M. Huang and C.-W. Lin. Multiple-priority region-of-interest H.264 video compression using constraint variable bitrate control for video surveillance. Optical Engineering, 48(4):047004, 2009.
[8]
M. Karczewicz and R. Kurceren. The SP- and SI-frames design for H.264/AVC. IEEE Transactions on Circuits and Systems for Video Technology, 13(7):637--644, 2003.
[9]
W. Lai, X.-D. Gu, R.-H. Wang, W.-Y. Ma, and H.-J. Zhang. A content-based bit allocation model for video streaming. In Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on, volume 2, pages 1315--1318, jun 2004.
[10]
H. Liu, X. Xie, W.-Y. Ma, and H.-J. Zhang. Automatic browsing of large pictures on mobile devices. In Proc. of ACM MM'03, pages 148--155, Berkeley, CA, USA, 2003.
[11]
L. C. Loschky and G. S. Wolverton. How late can you update gaze-contingent multiresolutional displays without detection? ACM Trans. Multimedia Comput. Commun. Appl., 3(4):1--10, 2007.
[12]
A. Mavlankar, P. Baccichet, D. Varodayan, and B. Girod. Optimal slice size for streaming regions of high resolution video with virtual pan/tilt/zoom functionality. In Proc. of EUSIPCO'07, 2007.
[13]
A. Mavlankar and B. Girod. Background extraction and long-term memory motion-compensated prediction for spatial-random-access-enabled video coding. In PCS'09: Proceedings of the 27th conference on Picture Coding Symposium, pages 61--64, Piscataway, NJ, USA, 2009. IEEE Press.
[14]
A. Mavlankar, D. Varodayan, and B. Girod. Region-of-Interest prediction for interactively streaming regions of high resolution video. In Proc. of International Packet Video Workshop, PV2007, Lausanne, Switzerland, Nov. 2007.
[15]
N. Quang Minh Khiem, G. Ravindra, A. Carlier, and W. T. Ooi. Supporting zoomable video streams with dynamic region-of-interest cropping. In Proc. of MMSYS '10, Phoenix, Arizona, USA, 2010.
[16]
E. M. Reingold and L. C. Loschky. Reduced saliency of peripheral targets in gaze-contingent multi-resolutional displays: blended versus sharp boundary windows. In ETRA '02: Proceedings of the 2002 symposium on Eye tracking research & applications, pages 89--93, New York, NY, USA, 2002. ACM.
[17]
A. Santella, M. Agrawala, D. DeCarlo, D. Salesin, and M. Cohen. Gaze-based interaction for semi-automatic photo cropping. In Proc. of ACM CHI '06, Montreal, Quebec, Canada, 2006.
[18]
P. Sivanantharasa, W. Fernando, and H. K. Arachchi. Region of interest video coding with flexible macroblock ordering. In Industrial and Information Systems, First International Conference on, pages 596--599, aug. 2006.
[19]
X. Xie, H. Liu, S. Goumaz, and W.-Y. Ma. Learning user interest for image browsing on small-form-factor devices. In Proc. of ACM CHI '05, Portland, Oregon, USA, 2005.
[20]
M. Inoue, H. Kimata, K. Fukazawa, and N. Matsuura. Interactive panoramic video streaming system over restricted bandwidth network. In Proceedings of the international conference on Multimedia, MM '10, pages 1191--1194, New York, NY, USA, 2010. ACM.

Cited By

View all
  • (2024)Deep Viewpoint Prediction in 360º Video with Data Distribution Adaptation2024 16th International Conference on Wireless Communications and Signal Processing (WCSP)10.1109/WCSP62071.2024.10826980(1318-1323)Online publication date: 24-Oct-2024
  • (2024)Networked Metaverse Systems: Foundations, Gaps, Research DirectionsIEEE Open Journal of the Communications Society10.1109/OJCOMS.2024.34260985(5488-5539)Online publication date: 2024
  • (2023)Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR StreamingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/360314620:1(1-28)Online publication date: 1-Jun-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MMSys '11: Proceedings of the second annual ACM conference on Multimedia systems
February 2011
294 pages
ISBN:9781450305181
DOI:10.1145/1943552
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 February 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bandwidth efficient
  2. encoding
  3. monolithic streaming
  4. optimal tiling
  5. region-of-interest streaming
  6. tile streaming
  7. zoomable video

Qualifiers

  • Research-article

Conference

MMSYS '11
Sponsor:
MMSYS '11: MMSYS '11 - Multimedia Systems Conference
February 23 - 25, 2011
CA, San Jose, USA

Acceptance Rates

Overall Acceptance Rate 176 of 530 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Deep Viewpoint Prediction in 360º Video with Data Distribution Adaptation2024 16th International Conference on Wireless Communications and Signal Processing (WCSP)10.1109/WCSP62071.2024.10826980(1318-1323)Online publication date: 24-Oct-2024
  • (2024)Networked Metaverse Systems: Foundations, Gaps, Research DirectionsIEEE Open Journal of the Communications Society10.1109/OJCOMS.2024.34260985(5488-5539)Online publication date: 2024
  • (2023)Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR StreamingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/360314620:1(1-28)Online publication date: 1-Jun-2023
  • (2020)Optimizing Fixation Prediction Using Recurrent Neural Networks for 360$^{\circ }$ Video Streaming in Head-Mounted Virtual RealityIEEE Transactions on Multimedia10.1109/TMM.2019.293180722:3(744-759)Online publication date: Mar-2020
  • (2019)Panoramic video live broadcasting system based on global distribution2019 Chinese Automation Congress (CAC)10.1109/CAC48633.2019.8996293(63-67)Online publication date: Nov-2019
  • (2018)Adaptive Streaming of HEVC Tiled Videos Using MPEG-DASHIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2017.268849128:8(1981-1992)Online publication date: Aug-2018
  • (2018)ClusTile: Toward Minimizing Bandwidth in 360-degree Video StreamingIEEE INFOCOM 2018 - IEEE Conference on Computer Communications10.1109/INFOCOM.2018.8486282(962-970)Online publication date: Apr-2018
  • (2018)Content adaptive tiling method based on user access preference for streaming panoramic video2018 IEEE International Conference on Consumer Electronics (ICCE)10.1109/ICCE.2018.8326152(1-4)Online publication date: Jan-2018
  • (2017)POI360Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies10.1145/3143361.3143381(336-349)Online publication date: 28-Nov-2017
  • (2017)Performance measurements of 360° video streaming to head-mounted displays over live 4G cellular networks2017 19th Asia-Pacific Network Operations and Management Symposium (APNOMS)10.1109/APNOMS.2017.8094203(205-210)Online publication date: Sep-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media