skip to main content
10.1145/3603269.3604819acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

ZGaming: Zero-Latency 3D Cloud Gaming by Image Prediction

Published:01 September 2023Publication History

ABSTRACT

In cloud gaming, interactive latency is one of the most important factors in users' experience. Although the interactive latency can be reduced through typical network infrastructures like edge caching and congestion control, the interactive latency of current cloud-gaming platforms is still far from users' satisfaction.

This paper presents ZGaming, a novel 3D cloud gaming system based on image prediction, in order to eliminate the interactive latency in traditional cloud gaming systems. To improve the quality of the predicted images, we propose (1) a quality-driven 3D-block cache to reduce the "hole" artifacts, (2) a server-assisted LSTM-predicting algorithm to improve the prediction accuracy of dynamic foreground objects, and (3) a prediction-performance-driven adaptive bitrate strategy which optimizes the quality of predicted images. The experiment on the real-world cloud gaming network conditions shows that compared with existing methods, ZGaming reduces the interactive latency from 23 ms to 0 ms when providing the same video quality, or improves the video quality by 5.4 dB when keeping the interactive latency as 0 ms.

References

  1. 2013. X264. (2013). https://www.videolan.org/developers/x264.html.Google ScholarGoogle Scholar
  2. 2018. gamehook. (2018). https://github.com/philkr/gamehook.Google ScholarGoogle Scholar
  3. 2020. GTA-IM-Dataset. (2020). https://github.com/ZheC/GTA-IM-Dataset.Google ScholarGoogle Scholar
  4. 2022. Amazon Luna. (2022). https://www.amazon.com/luna/landing-page.Google ScholarGoogle Scholar
  5. 2022. Cyberpunk 2077. (2022). https://www.cyberpunk.net/.Google ScholarGoogle Scholar
  6. 2022. DirectX. (2022). https://en.wikipedia.org/wiki/DirectX.Google ScholarGoogle Scholar
  7. 2022. Google Stadia. (2022). https://stadia.google.com.Google ScholarGoogle Scholar
  8. 2022. gpu. (2022). https://developer.nvidia.com/cuda-gpus.Google ScholarGoogle Scholar
  9. 2022. Microsoft Xbox Cloud Gaming. (2022). https://www.xbox.com/en-US/xbox-game-pass/cloud-gaming.Google ScholarGoogle Scholar
  10. 2022. NVIDIA GeForce Now. (2022). https://www.nvidia.com/en-us/geforce-now/.Google ScholarGoogle Scholar
  11. 2022. opengl. (2022). https://www.opengl.org/.Google ScholarGoogle Scholar
  12. 2022. Razor. (2022). https://github.com/yuanrongxi/razor.Google ScholarGoogle Scholar
  13. 2022. RenderDoc. (2022). https://renderdoc.org/.Google ScholarGoogle Scholar
  14. 2022. Sony PlayStation Now. (2022). https://www.playstation.com/en-us/ps-now/.Google ScholarGoogle Scholar
  15. 2022. ue. (2022). https://www.unrealengine.com/.Google ScholarGoogle Scholar
  16. 2022. unity. (2022). https://unity.com/.Google ScholarGoogle Scholar
  17. 2023. Cloud Gaming for Genshin Impact. (2023). https://mhyy.mihoyo.com.Google ScholarGoogle Scholar
  18. 2023. Genshin Impact. (2023). https://genshin.hoyoverse.com/.Google ScholarGoogle Scholar
  19. 2023. NetLimiter. (2023). https://www.netlimiter.com.Google ScholarGoogle Scholar
  20. 2023. Splatoon. (2023). https://splatoon.nintendo.com.Google ScholarGoogle Scholar
  21. Alberto Alós, Francisco Morán, Pablo Carballeira, Daniel Berjón, and Narciso García. 2019. Congestion Control for Cloud Gaming Over UDP Based on Round-Trip Video Latency. IEEE Access 7 (2019), 78882--78897. Google ScholarGoogle ScholarCross RefCross Ref
  22. Maryam Amiri, Hussein Al Osman, Shervin Shirmohammadi, and Maha Abdallah. 2015. An SDN controller for delay and jitter reduction in cloud gaming. In Proceedings of the 23rd ACM international conference on Multimedia. 1043--1046.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Maryam Amiri, Hussein Al Osman, Shervin Shirmohammadi, and Maha Abdallah. 2016. Toward delay-efficient game-aware data centers for cloud gaming. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 12, 5s (2016), 1--19.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Paul Bao and Douglas Gourlay. 2004. Remote walkthrough over mobile networks using 3-D image warping and streaming. IEE Proceedings-Vision, Image and Signal Processing 151, 4 (2004), 329--336.Google ScholarGoogle ScholarCross RefCross Ref
  25. Neal Cardwell, Yuchung Cheng, C Stephen Gunn, Soheil Hassas Yeganeh, and Van Jacobson. 2017. BBR: congestion-based congestion control. Commun. ACM 60, 2 (2017), 58--66.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Gaetano Carlucci, Luca De Cicco, Stefan Holmer, and Saverio Mascolo. 2016. Analysis and design of the google congestion control for web real-time communication (WebRTC). In Proceedings of the 7th International Conference on Multimedia Systems. 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Marc Carrascosa and Boris Bellalta. 2020. Cloud-gaming:Analysis of Google Stadia traffic. (2020). arXiv:cs.NI/2009.09786Google ScholarGoogle Scholar
  28. Chun-Fa Chang and Shyh-Haur Ger. 2002. Enhancing 3D graphics on mobile devices by image-based rendering. In Pacific-Rim Conference on Multimedia. Springer, 1105--1111.Google ScholarGoogle ScholarCross RefCross Ref
  29. Mark Claypool and Kajal Claypool. 2006. Latency and player actions in online games. Commun. ACM 49, 11 (2006), 40--45.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Mark Claypool and Kajal Claypool. 2010. Latency can kill: precision and deadline in online games. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems. 215--222.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Tongyu Dai, Xinggong Zhang, and Zongming Guo. 2018. Learning-based congestion control for internet video communication over wireless networks. In 2018 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1--5.Google ScholarGoogle ScholarCross RefCross Ref
  32. Andrea Di Domenico, Gianluca Perna, Martino Trevisan, Luca Vassio, and Danilo Giordano. 2021. A network analysis on cloud gaming: Stadia, GeForce Now and PSNow. (2021). arXiv:cs.NI/2012.06774Google ScholarGoogle Scholar
  33. Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2015. Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence 38, 2 (2015), 295--307.Google ScholarGoogle Scholar
  34. Christoph Fehn. 2004. Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV. In Stereoscopic Displays and Virtual Reality Systems XI, Vol. 5291. International Society for Optics and Photonics, 93--104.Google ScholarGoogle ScholarCross RefCross Ref
  35. Chun-Ying Huang, Kuan-Ta Chen, De-Yu Chen, Hwai-Jung Hsu, and Cheng-Hsin Hsu. 2014. GamingAnywhere: The first open source cloud gaming system. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 10, 1s (2014), 1--25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, and Jia-Bin Huang. 2018. Deepmvs: Learning multi-view stereopsis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2821--2830.Google ScholarGoogle ScholarCross RefCross Ref
  37. Vincent Jantet, Christine Guillemot, and Luce Morin. 2011. Object-based layered depth images for improved virtual view synthesis in rate-constrained context. In 2011 18th IEEE International Conference on Image Processing. IEEE, 125--128.Google ScholarGoogle ScholarCross RefCross Ref
  38. Jian Jin, Anhong Wang, Yao Zhao, Chunyu Lin, and Bing Zeng. 2016. Region-aware 3-D warping for DIBR. IEEE Transactions on Multimedia 18, 6 (2016), 953--966.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Philipp Krähenbühl. 2018. Free supervision from video games. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2955--2964.Google ScholarGoogle ScholarCross RefCross Ref
  40. Zeqi Lai, Y. Charlie Hu, Yong Cui, Linhui Sun, Ningwei Dai, and Hung-Sheng Lee. 2020. Furion: Engineering High-Quality Immersive Virtual Reality on Today's Mobile Devices. IEEE Transactions on Mobile Computing 19, 7 (2020), 1586--1602. Google ScholarGoogle ScholarCross RefCross Ref
  41. Ji-Tang Lee, De-Nian Yang, and Wanjiun Liao. 2016. Efficient caching for multi-view 3D videos. In 2016 IEEE Global Communications Conference (GLOBECOM). IEEE, 1--7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Kyungmin Lee, David Chu, Eduardo Cuervo, Johannes Kopf, Yury Degtyarev, Sergey Grizan, Alec Wolman, and Jason Flinn. 2015. Outatime: Using Speculation to Enable Low-Latency Continuous Interaction for Mobile Cloud Gaming. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys '15). Association for Computing Machinery, New York, NY, USA, 151--165. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Shuai Li, Kaixin Wang, Yanbo Gao, Xun Cai, and Mao Ye. 2022. Geometric warping error aware CNN for DIBR oriented view synthesis. In Proceedings of the 30th ACM International Conference on Multimedia. 1512--1521.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Yuhua Lin and Haiying Shen. 2017. CloudFog: Leveraging Fog to Extend Cloud Gaming for Thin-Client MMOG with High Quality of Service. IEEE Transactions on Parallel and Distributed Systems 28, 2 (2017), 431--445. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Liying Lu, Wenbo Li, Xin Tao, Jiangbo Lu, and Jiaya Jia. 2021. Masa-sr: Matching acceleration and spatial adaptation for reference-based image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6368--6377.Google ScholarGoogle ScholarCross RefCross Ref
  46. William R Mark. 1999. Post-rendering 3D image warping: visibility, reconstruction, and performance for depth-image warping. Ph.D. Dissertation. Citeseer.Google ScholarGoogle Scholar
  47. Leonard McMillan. 1997. An image-based approach to three-dimensional computer graphics. Ph.D. Dissertation. Citeseer.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Voicu Popescu, John Eyles, Anselmo Lastra, Joshua Steinhurst, Nick England, and Lars Nyland. 2000. The WarpEngine: An architecture for the post-polygonal age. In Proceedings of the 27th annual conference on Computer graphics and interactive techniques. 433--442.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Xiaojuan Qi, Zhengzhe Liu, Qifeng Chen, and Jiaya Jia. 2019. 3D motion decomposition for RGBD future dynamic scene synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7673--7682.Google ScholarGoogle ScholarCross RefCross Ref
  50. Nasim Rahaman, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred Hamprecht, Yoshua Bengio, and Aaron Courville. 2019. On the spectral bias of neural networks. In International Conference on Machine Learning. PMLR, 5301--5310.Google ScholarGoogle Scholar
  51. Konstantinos Rematas, Ira Kemelmacher-Shlizerman, Brian Curless, and Steve Seitz. 2018. Soccer on your tabletop. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4738--4747.Google ScholarGoogle ScholarCross RefCross Ref
  52. Stephan R Richter, Vibhav Vineet, Stefan Roth, and Vladlen Koltun. 2016. Playing for data: Ground truth from computer games. In European conference on computer vision. Springer, 102--118.Google ScholarGoogle ScholarCross RefCross Ref
  53. Saeed Shafiee Sabet, Steven Schmidt, Saman Zadtootaghaj, Babak Naderi, Carsten Griwodz, and Sebastian Möller. 2020. A latency compensation technique based on game characteristics to mitigate the influence of delay on cloud gaming quality of experience. In Proceedings of the 11th ACM Multimedia Systems Conference. 15--25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Michael Schaffner, Pierre Greisen, Simon Heinzle, Frank K Gürkaynak, Hubert Kaeslin, and Aljoscha Smolic. 2013. MADmax: A 1080p stereo-to-multiview rendering ASIC in 65 nm CMOS based on image domain warping. In 2013 Proceedings of the ESSCIRC (ESSCIRC). IEEE, 61--64.Google ScholarGoogle ScholarCross RefCross Ref
  55. Shu Shi, Cheng-Hsin Hsu, Klara Nahrstedt, and Roy Campbell. 2011. Using graphics rendering contexts to enhance the real-time video coding for mobile cloud gaming. In Proceedings of the 19th ACM international conference on Multimedia. 103--112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Shu Shi, Won J Jeon, Klara Nahrstedt, and Roy H Campbell. 2009. Real-time remote rendering of 3D video for mobile devices. In Proceedings of the 17th ACM international conference on Multimedia. 391--400.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Shu Shi, Mahsa Kamali, Klara Nahrstedt, John C Hart, and Roy H Campbell. 2010. A high-quality low-delay remote rendering system for 3D video. In Proceedings of the 18th ACM international conference on Multimedia. 601--610.Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Shu Shi, Klara Nahrstedt, and Roy Campbell. 2012. A real-time remote rendering system for interactive mobile graphics. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 8, 3s (2012), 1--20.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Jiahao Su, Wonmin Byeon, Jean Kossaiff, Furong Huang, Jan Kautz, and Anima Anandkumar. 2020. Convolutional tensor-train lstm for spatio-temporal learning. Advances in Neural Information Processing Systems 33 (2020), 13714--13726.Google ScholarGoogle Scholar
  60. Mirko Suznjevic, Ivan Slivar, and Lea Skorin-Kapov. 2016. Analysis and QoE evaluation of cloud gaming service adaptation under different network conditions: The case of NVIDIA GeForce NOW. In 2016 Eighth International Conference on Quality of Multimedia Experience (QoMEX). 1--6. Google ScholarGoogle ScholarCross RefCross Ref
  61. Dong Tian, Po-Lin Lai, Patrick Lopez, and Cristina Gomila. 2009. View synthesis techniques for 3D video. In Applications of Digital Image Processing XXXII, Vol. 7443. SPIE, 233--243.Google ScholarGoogle Scholar
  62. Shishun Tian, Lu Zhang, Luce Morin, and Olivier Déforges. 2019. A Benchmark of DIBR Synthesized View Quality Assessment Metrics on a New Database for Immersive Media Applications. IEEE Transactions on Multimedia 21, 5 (2019), 1235--1247. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. YunboWang, Zhifeng Gao, Mingsheng Long, Jianmin Wang, and S Yu Philip. 2018. Predrnn++: Towards a resolution of the deep-in-time dilemma in spatiotemporal predictive learning. In International Conference on Machine Learning. PMLR, 5123--5132.Google ScholarGoogle Scholar
  64. Yunbo Wang, Haixu Wu, Jianjin Zhang, Zhifeng Gao, Jianmin Wang, Philip Yu, and Mingsheng Long. 2022. Predrnn: A recurrent neural network for spatiotem-poral predictive learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).Google ScholarGoogle Scholar
  65. Yunbo Wang, Jianjin Zhang, Hongyu Zhu, Mingsheng Long, Jianmin Wang, and Philip S Yu. 2019. Memory in memory: A predictive neural network for learning higher-order non-stationarity from spatiotemporal dynamics. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9154--9162.Google ScholarGoogle ScholarCross RefCross Ref
  66. Lingfeng Xu, Xun Guo, Yan Lu, Shipeng Li, Oscar C. Au, and Lu Fang. 2014. A low latency cloud gaming system using edge preserved image homography. In 2014 IEEE International Conference on Multimedia and Expo (ICME). 1--6. Google ScholarGoogle ScholarCross RefCross Ref
  67. Roy D Yates, Mehrnaz Tavan, Yi Hu, and Dipankar Raychaudhuri. 2017. Timely cloud gaming. In IEEE INFOCOM 2017-IEEE Conference on Computer Communications. IEEE, 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  68. Mengsi Yeh, Chih-Hang Wang, Jitang Lee, De-Nian Yang, and Wanjiun Liao. 2020. Mobile proxy caching for multi-view 3D videos with adaptive view selection. IEEE Transactions on Mobile Computing (2020).Google ScholarGoogle Scholar
  69. Wucherl Yoo, Shu Shi, Won J Jeon, Klara Nahrstedt, and Roy H Campbell. 2010. Real-time parallel remote rendering for mobile devices using graphics processing units. In 2010 IEEE International Conference on Multimedia and Expo. IEEE, 902--907.Google ScholarGoogle ScholarCross RefCross Ref
  70. Yizhong Zhang, Jiaolong Yang, Zhen Liu, Ruicheng Wang, Guojun Chen, Xin Tong, and Baining Guo. 2022. Virtualcube: An immersive 3d video communication system. IEEE Transactions on Visualization and Computer Graphics 28, 5 (2022), 2146--2156.Google ScholarGoogle ScholarCross RefCross Ref
  71. Zhengming Zhang, Yaoqing Yang, Meng Hua, Chunguo Li, Yongming Huang, and Luxi Yang. 2019. Proactive caching for vehicular multi-view 3D video streaming via deep reinforcement learning. IEEE Transactions on Wireless Communications 18, 5 (2019), 2693--2706.Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. Nikolaos Zioulis, Antonis Karakottas, Dimitrios Zarpalas, Federico Alvarez, and Petros Daras. 2019. Spherical view synthesis for self-supervised 360 depth estimation. In 2019 International Conference on 3D Vision (3DV). IEEE, 690--699.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. ZGaming: Zero-Latency 3D Cloud Gaming by Image Prediction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ACM SIGCOMM '23: Proceedings of the ACM SIGCOMM 2023 Conference
      September 2023
      1217 pages
      ISBN:9798400702365
      DOI:10.1145/3603269

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 September 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate554of3,547submissions,16%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader