Abstract
Delta encoding and shared dictionary compression (SDC) for accelerating Web content have been studied extensively in research over the last two decades, but have only found limited adoption in the industry so far: Compression approaches that use a custom-tailored dictionary per website have all failed in practice due to lacking browser support and high overall complexity. General-purpose SDC approaches such as Brotli reduce complexity by shipping the same dictionary for all use cases, while most delta encoding approaches just consider similarities between versions of the same entity (but not between different entities). In this study, we investigate how much of the potential benefits of SDC and delta encoding are left on the table by these two simplifications. As our first contribution, we describe the idea of cross-entity delta encoding that uses cached assets from the immediate browser history for content encoding instead of a precompiled shared dictionary: This avoids the need to create a custom dictionary, but enables highly customized and efficient compression. Second, we present an experimental evaluation of compression efficiency to hold cross-entity delta encoding against state-of-the-art Web compression algorithms. We consciously compare algorithms some of which are not yet available in browsers to understand their potential value before investing resources to build them. Our results indicate that cross-entity delta encoding is over 50% more efficient for text-based resources than compression industry standards. We hope our findings motivate further research and development on this topic.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
Still, the best dictionary was almost every time of the same subtype, e.g. text/html.
- 4.
Due to limited space, we only present a few selected alternatives here and refer to https://icwe.compaz.info for an extensive overview.
References
Alakuijala, J., et al.: Brotli: a general-purpose data compressor. ACM TOI 37(1), 1–30 (2019)
Chan, M.C., Woo, T.: Cache-based compaction: a new technique for optimizing web transfer. In: IEEE INFOCOM 1999. Conference on Computer Communications (1999)
Knecht, D.O., Graham-Cumming, J., Prince, M.B.: Method and apparatus for reducing network resource transmission size using delta compression (2019)
Korn, D.G., Vo, K.P.: Engineering a differencing and compression data format. In: USENIX Annual Technical Conference, General Track, pp. 219–228 (2002)
McQuade, B., Mixter, K., Lee, W.H., Butler, J.: A proposal for shared dictionary compression over HTTP (2016)
Mogul, J.C., Douglis, F., Feldmann, A., Krishnamurthy, B.: Potential benefits of delta encoding and data compression for HTTP. SIGCOMM CCR 27, 181–194 (1997)
Shapira, O.: SDCH at LinkedIn (2015). https://engineering.linkedin.com/shared-dictionary-compression-http-linkedin. Accessed 20 Jan 2022
Wingerath, W., et al.: Speed Kit: a polyglot & GDPR-compliant approach for caching personalized content. In: ICDE, Dallas, Texas (2020)
Wollmer, B., Wingerath, W., Ferrlein, S., Gessert, F., Ritter, N.: Compaz: exploring the potentials of shared dictionary compression on the web. In: 22th International Conference on Web Engineering, ICWE (2022)
Wollmer, B., Wingerath, W., Ritter, N.: Context-aware encoding & delivery in the web. In: 20th International Conference on Web Engineering, ICWE (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Wollmer, B., Wingerath, W., Ferrlein, S., Panse, F., Gessert, F., Ritter, N. (2022). The Case for Cross-Entity Delta Encoding in Web Compression. In: Di Noia, T., Ko, IY., Schedl, M., Ardito, C. (eds) Web Engineering. ICWE 2022. Lecture Notes in Computer Science, vol 13362. Springer, Cham. https://doi.org/10.1007/978-3-031-09917-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-031-09917-5_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-09916-8
Online ISBN: 978-3-031-09917-5
eBook Packages: Computer ScienceComputer Science (R0)