Characterization and Evaluation of Cache Hierarchies for Web Servers

Iyer, Ravi

doi:10.1023/B:WWWJ.0000028180.97418.53

Characterization and Evaluation of Cache Hierarchies for Web Servers

Published: September 2004

Volume 7, pages 259–280, (2004)
Cite this article

World Wide Web Aims and scope Submit manuscript

Ravi Iyer¹

88 Accesses
4 Citations
Explore all metrics

Abstract

As Internet usage continues to expand rapidly, careful attention needs to be paid to the design of Internet servers for achieving high performance and end-user satisfaction. Currently, the memory system continues to remain a significant performance bottleneck for Internet servers employing multi-GHz processors. In this paper, our aim is two-fold: (1) to characterize the cache/memory performance of web server workloads and (2) to propose and evaluate cache design alternatives for future web servers. We chose SPECweb99 as the representative web server workload and our entire characterization and evaluation methodology is based on our CASPER simulation framework. We begin by exploring the processor cache design space for single and dual-processor servers. Based on our observations, we then evaluate other cache hierarchy alternatives such as chipset caches, coherence filters and decompressed page stores. We show the sensitivity of these components to basic organization parameters such as cache size, line size and degree of associativity. We also present the performance implications of routing memory requests initiated by I/O devices through these caches. Based on detailed simulation data and its implications on system level performance, this paper shows that chipset caches have significant potential for improving future web server performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

G. Astfalk and T. Brewer, “An overview of the HP/convex exemplar hardware,” http://www. convex.com/tech_cache/ps/hw_ov.ps, accessed in December 1997.
J. L. Baer and W. H. Wang, “On the inclusion properties for multi-level cache hierarchies,” in 15th Annual Symposium on Computer Architecture, Silver Springs, 1988.
F. Dahlgren and P. Stenstrom, “Effectiveness of hardware-based stride and sequential prefetching in shared-memory multiprocessors,” in 1st Symposium on High Performance Comput. Architecture, January 1994.
Y. Hu, A. Nand et al. “Measurement, analysis and performance improvement of the Apache web server,” in IEEE International Performance, Computing & Communications Conference, February 1999.
C. Huitema, “Network vs. server issues in end-to-end performance,” in Workshop on Performance and Architecture on Web Servers (PAWS), June 2000.
R. Iyer, “Exploring the cache design space for web servers,” in Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS’01), May 2001, invited paper.
R. Iyer, “CASPER: Cache architecture simulation and performance exploration using refstreams,” in Proceedings of the Intel Design and Test Technology Conference (DTTC), July 2002.
D. Joseph and D. Grunwald, “Prefetching using Markov predictors,” in Proceedings of the 24th International Symposium on Computer Architecture (ISCA), 1997.
N. Jouppi, “Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers,” in Proceedings of the 17th Annual International Symposium on Computer Architecture, 1990, pp. 364–373.
K. Kant, R. Iyer, and P. Mohapatra, “Architectural impact of secure socket layer in Internet servers,” in Proceedings of the International Conference on Computer Design, 2000.
K. Kant and C. Sundaram, “A server performance model for static web workloads,” in Proceedings of the International Symposium on Performance Analysis of Systems and Software (ISPASS 2000), April 2000.
K. Kant and Y. Won, “Performance impact of uncached file accesses in SPECweb99,” in Second Annual Workshop on Workload Characterization (WWC), October 1999.
D. Lenoski, J. Laudon, K. Gharachorloo, W. Weber, A. Gupta et al., “The Stanford DASH multiprocessor,” Computer 25(3) (1992) 63–79.
Article Google Scholar
W. F. Lin, S. K. Reinhardt, and D. C. Burger, “Designing a modern memory hierarchy with hardware prefetching,” IEEE Transactions on Computers 50(11), Special Issue onMemory Systems, November 2001.
T. Lovett and R. Clapp, “STiNG: A CC-NUMA computer system for the commercial marketplace,” in Proceedings of the 23rd International Symposium on Computer Architecture, May 1996, pp. 308–317.
M. Michael and A. K. Nanda, “Design and performance of directory caches for scalable shared memory multiprocessors,” in Proceedings of the 5th International Symposium on High Performance Comput. Architecture, January 1999.
A. Moga and M. Dubois, “The effectiveness of SRAM network caches on clustered DSMs,” in Proceedings of the Fourth International Symposium on High Performance Computer Architecture, February 1998, pp. 103–112.
P. Mohapatra, H. Thanthry, and K. Kant, “Characterization of bus transactions for SPECweb96 benchmark,” in 2nd Workshop on Workload Characterization (WWC), October 1999.
B. Nayfeh, K. Olukotun et al., “The impact of shared cache clustering in small-scale shared memory multiprocessors,” in Proceedings of the International Conference on High Performance Computer Architecture (HPCA-1), February 1996.
M. Pirvu and L. Bhuyan, “Hardware spatial forwarding for widely shared data,” in Proceedings of 14th International Conference on Supercomputing (ICS), May 2000.
R. Radhakrishnan and L. John, “A performance study of modern Web applications,” in Proceedings of the EuroPar’99.
“SPECweb99 Design Document,” available online on the SPEC website at http://www.specbench. org/osg/web99/docs/whitepaper.html
“TPC-C Design Document,” available online on the TPC website at www.tpc.org/tpcc/
“TPC-W Design Document,” available online on the TPC website at www.tpc.org/tpcw/
R. B. Tremaine, P. A. Franaszek, J. T. Robinson, C. O. Schulz, T. B. Smith, M. Wazlowski, and P. M. Bland, “IBM memory expansion technology (MXT),” IBM Journal of Research and Development 45(2) (2001) 271–285.
Google Scholar
D. M. Tullsen and S. J. Eggers, “Limitations of cache prefetching on a bus-based multiprocessor,” in Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993, pp. 278–288.
U. Vallamsetty, P. Mohapatra, R. Iyer, and K. Kant, “Improving the cache performance of network intensive workloads,” in Proceedings of the International Conference on Parallel Processing, 2001.
WebBench, PC Magazine benchmarks, http://www.veritest.com/benchmarks/webbench/webbench.asp
WebStone, Mindcraft benchmarks, http://www.mindcraft.com/webstone/
S. C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta, “The SPLASH-2 programs: characterization and methodological considerations,” in Proceedings of the 22nd International Symposium on Computer Architecture, June 1995, pp. 24–36.
Z. Zhang and J. Torellas, “Reducing remote conflict misses: NUMA with remote cache versus COMA,” in Proceedings of the Third International Symposium on High Performance Computer Architecture, January 1997, pp. 272–281.

Download references

Author information

Authors and Affiliations

Intel Corporation, 2111 NE 25th Avenue, M/S JF3-232, Hillsboro, OR, 97124, USA
Ravi Iyer

Authors

Ravi Iyer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Iyer, R. Characterization and Evaluation of Cache Hierarchies for Web Servers. World Wide Web 7, 259–280 (2004). https://doi.org/10.1023/B:WWWJ.0000028180.97418.53

Download citation

Issue Date: September 2004
DOI: https://doi.org/10.1023/B:WWWJ.0000028180.97418.53

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Characterization and Evaluation of Cache Hierarchies for Web Servers

Abstract

Access this article

Similar content being viewed by others

Efficient Characterization of Hidden Processor Memory Hierarchies

Adapting Memory Hierarchies for Emerging Datacenter Interconnects

Toward an Understanding of the Tradeoffs of Adopting the MEAN Web Server Stack

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Characterization and Evaluation of Cache Hierarchies for Web Servers

Abstract

Access this article

Similar content being viewed by others

Efficient Characterization of Hidden Processor Memory Hierarchies

Adapting Memory Hierarchies for Emerging Datacenter Interconnects

Toward an Understanding of the Tradeoffs of Adopting the MEAN Web Server Stack

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation