ABSTRACT
TCP Splicing can be used in content-aware switches to tremendously reduce overall request latency. In order to reduce the processing latency further, we propose to offload the protocol processing onto network processors (NPs). An NP consists of a multithreaded multiprocessor architecture that can provide high throughput for packet processing or forwarding. However, offloading any protocol software to an NP needs to be carefully designed due to its low-level programming and limited control memory size.In this paper, we first analyze the operation of TCP Splicing in detail and evaluate its performance through measurements on a Linux-based switch. Then various possibilities of workload allocation among different computation resources in an NP are presented, and the design tradeoffs are discussed. A content aware switch is implemented using IXP 2400 NP and evaluated for performance comparison. The measurement results demonstrate that our NP-based switch can reduce the http processing latency by an average of 83.3% for a 1K byte web page. The amount of reduction increases with larger file sizes. It is also shown that the packet throughput can be improved by up to 5.7x across a range of files by taking advantage of multithreading and multiprocessing, available in the NP.
- Apache Software Foundation, http://www.apache.orgGoogle Scholar
- G. Apostolopoulos, D. Aubespin V. Peris, P. Pradhan, D. Saha Design, Implementation and Performance of a Content-Based Switch proceedings of IEEE INFOCOM-2000Google Scholar
- Cisco Systems, Cisco Content Services Switch, http://www.cisco.com/en/US/products/hw/contnetw/ps789/prod_models_home.htmlGoogle Scholar
- A. Cohe, S. Rangarajan, H. Slye, On the Performance of TCP Splicing for URL-Aware Redirection. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, CO, Oct. 1999 Google ScholarDigital Library
- Erik J. Johnson and Aaron R. Kunze, IXP 1200 Programming The Microengine Coding Guide for the Intel IXP2400 network Processor Family, Intel Press Google ScholarDigital Library
- Foundry Systems, Foundry ServerIron XL/G, http://www.b2net.co.uk/foundry/foundry_serveriron_xlg_web_switch.htmGoogle Scholar
- Tom Halfhill, Intel Network Processor Targets Routers, Microprocessor Report, September 1999Google Scholar
- Linux Virtual Server Project, http://www.linuxvirtualserver.orgGoogle Scholar
- David Mosberger and Tai Jin, HP Research Labs A Tool for Measuring Web Server Performance, 1998Google Scholar
- David A. Maltz, Pravin Bhagwat, TCP Splicing for Application Layer Proxy Performance, IBM Research Report RC 21139, 1998 Google ScholarDigital Library
- Netfilter, http://www.netfilter.orgGoogle Scholar
- Nortel Networks, Alteon Web Switches, http://www.nortelnetworks.com/products/01/alteon/webswitch/index.htmlGoogle Scholar
- V.S. Pai, M. Aron, G. Banga, M. Svendsen, P. Druschel, W. Zwaenepoel, E. Nahum, Locality-Aware Request Distribution in Cluster-based Network Servers. In Proceedings of the 8th Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, Oct. 1998 Google ScholarDigital Library
- Athanasios E. Papathanasiou, Eric Van Hensbergen, KNITS: Switch-based Connection Hand-off, Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies. Proceedings. IEEE, Volume: 1, 2002Google Scholar
- RFC1624: Computation of the Internet Checksum via Incremental Update, May 1994Google Scholar
- Marcel-Catalin Rosu, Daniela Rosu, Kernel Support for Faster Web Proxies, USENIX Annual Technical Conference, June 2003Google Scholar
- Tammo Spalink, Scott Karlin, Larry Peterson, Yitzchak Gottlieb, Building a Robust Software-Based Router Using Network Processors, Proceedings of the eighteenth ACM symposium on Operating systems principles, pages 216 -- 229, 2001 Google ScholarDigital Library
- Oliver Spatscheck, et al., Optimizing TCP Forwarder Performance, IEEE/ACM Transactions on Networking, 2000 Google ScholarDigital Library
- The Linux Kernel Archives, http://www.kernel.orgGoogle Scholar
- Chu-Sing Yang and Mon-Yen Luo, Efficient Support for Content-Based Routing in Web Server Clusters. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, Boulder, CO, October 1999 Google ScholarDigital Library
- Li Zhao, Yan Luo, Laxmi Bhuyan and Ravi Iyer, Design and Implementation of A Content-aware Switch using A Network Processor. In Proceedings of the 13th IEEE Symposium on High Performance Interconnects, Stanford University, CA, August 2005 Google ScholarDigital Library
Index Terms
- SpliceNP: a TCP splicer using a network processor
Recommendations
Flow Migration on Multicore Network Processors: Load Balancing While Minimizing Packet Reordering
ICPP '13: Proceedings of the 2013 42nd International Conference on Parallel ProcessingWith ever increasing network traffic rates, multicore architectures for network processors have successfully provided performance improvements through high parallelism. However, naively allocating the network traffic to multiple cores without ...
C Compiler Design for an Industrial Network Processor
OM '01: Proceedings of the 2001 ACM SIGPLAN workshop on Optimization of middleware and distributed systemsOne important problem in code generation for embedded processors is the design of efficient compilers for ASIPs with application specific architectures. This paper outlines the design of a C compiler for an industrial ASIP for telecom applications. The ...
C Compiler Design for an Industrial Network Processor
One important problem in code generation for embedded processors is the design of efficient compilers for ASIPs with application specific architectures. This paper outlines the design of a C compiler for an industrial ASIP for telecom applications. The ...
Comments