5 Conclusion
In this paper, we study the long warm-up time of GPU acceleration of DNN inference in Web browsers. We analyzed the reason behind the long warm-up time through a measurement study and revealed that compiling WebGL programs takes most of the warm-up time. Inspired by this finding, we proposed WPIA, an approach that suggests precompiling WebGL programs on the server side to avoid compiling them in Web browsers. WPIA tackles the challenges of precompiling by merging WebGL programs and using a record-and-replay technique. Evaluation experiment results show that WPIA can accelerate the DNN warm-up time to an order of magnitude.
References
Google. Background features in google meet, powered by Web ML. See Research. https://www.google/blog/background-features-in-google-meet-powered-by website, 2020
Smilkov D, Thorat N, Assogba Y, Yuan A, Kreeger N, Yu P, Zhang K, Cai S, Nielsen E, Soergel D, Bileschi S, Terry M, Nicholson C, Gupta S N, Sirajuddin S, Sculley D, Monga R, Corrado G, Viégas F B, Wattenberg M. TensorFlow.js: machine learning for the Web and beyond. In: Proceedings of Machine Learning and Systems 2019. 2019
Ma Y, Xiang D, Zheng S, Tian D, Liu X. Moving deep learning into Web browser: how far can we go? In: Proceedings of the World Wide Web Conference. 2019, 1234–1244
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant No. 62102009) and the Beijing Outstanding Young Scientist Program (No. BJJWZYJH01201910001004).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.
Additional information
Electronic supplementary material Supplementary material is available in the online version of this article at journal.hep.com.cn and link.springer.com.
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Tian, D., Ma, Y., Han, Y. et al. WPIA: accelerating DNN warm-up in Web browsers by precompiling WebGL programs. Front. Comput. Sci. 18, 186211 (2024). https://doi.org/10.1007/s11704-024-40066-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11704-024-40066-w