skip to main content
10.1145/1007912.1007928acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
Article

Parallelism versus memory allocation in pipelined router forwarding engines

Published: 27 June 2004 Publication History

Abstract

A crucial problem that needs to be solved is the allocation of memory to processors in a pipeline. Ideally, the processor memories should be totally separate (i.e., one port memories) in order to minimize contention; however, this minimizes memory sharing. Idealized sharing occurs by using a single shared memory for all processors but this maximizes contention. Instead, in this paper we show that perfect memory sharing of shared memory can be achieved with a collection of *two*-port memories, as long as the number of processors is less than the number of memories. We show that the problem of allocation is NP-complete in general, but has a fast approximation algorithm that comes within a factor of 3/2. The proof utilizes a new bin packing model, which is interesting in its own right. Further, for important special cases that arise in practice the approximation algorithm is indeed optimal. We also describe an incremental memory allocation algorithm that provides good memory utilization while allowing fast updates.

References

[1]
A. Basu and G. Narlikar, Fast Incremental Updates for Pipeline Forwarding Engines, InfoCom 2003.
[2]
C. Berge, Graphs and Hypergraphs, North-Holland, Amsterdam, 1976.
[3]
Guy E. Blelloch Phillip B. Gibbons Yossi Matias Accounting for Memory Bank Contention and Delay in High-Bandwidth Multiprocessors, IEEE Transactions on Parallel and Distributed Systems, volume 8, 1997.
[4]
D. Culler, J, Singh, and A. Gupta. Parallel Computer Architecture, A Hardware/Software Approach, Morgan Kaufman, 1999.
[5]
M. Degermark, A. Brodnik, S. Carlsson, and Stephen Pink, Small forwarding tables for fast routing lookups, Proc. SIGCOMM, (1997), 3--14.
[6]
M. R. Garey and D. S. Johnson, Computer and Intractability, A Guide to the Theory of NP-completeness, W. H. Freeman and Co., San Francisco, 1979.
[7]
M. R. Garey and D. S. Johnson, Complexity results for multiprocessor scheduling under resource constraints, SIAM J. Comput., (1975), 397--411.
[8]
A. Ranade, How to emulate shared memory. Journal of Computer and System Sciences, 42:307--326, 1991.
[9]
B. Rau, Pseudo-randomly interleaved memory. In Proceedings Int. Symp. on Computer Architecture, 1991.
[10]
T.V. Lakshman and D. Staliadis, High Speed Policy-based Packet Forwarding Using Efficient Multi-dimensional Range Matching, Proc. ACM SIGCOMM '98, 1998.
[11]
M. Ruiz-Sanchez, E. Biersack, and W. Dabbous, Survey and Taxonomy of IP Address Lookup Algorithms, IEEE Network, March/April 2001.
[12]
S. Sikka and G. Varghese, Memory Efficient State Lookups with Fast Updates, in Proceedings of SIGCOMM 2000, August 2000.
[13]
L. Valiant, A bridging model for parallel computation. Communications of the ACM, 33(8), 1990.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SPAA '04: Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures
June 2004
332 pages
ISBN:1581138407
DOI:10.1145/1007912
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. approximation algorithm
  2. memory allocation

Qualifiers

  • Article

Conference

SPAA04

Acceptance Rates

Overall Acceptance Rate 447 of 1,461 submissions, 31%

Upcoming Conference

SPAA '25
37th ACM Symposium on Parallelism in Algorithms and Architectures
July 28 - August 1, 2025
Portland , OR , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2009)PLUGACM SIGCOMM Computer Communication Review10.1145/1594977.159259339:4(207-218)Online publication date: 16-Aug-2009
  • (2009)PLUGProceedings of the ACM SIGCOMM 2009 conference on Data communication10.1145/1592568.1592593(207-218)Online publication date: 16-Aug-2009
  • (2009)High-bandwidth network memory system through virtual pipelinesIEEE/ACM Transactions on Networking (TON)10.1109/TNET.2008.200864617:4(1029-1041)Online publication date: 1-Aug-2009
  • (2008)MPFS: A truly scalable router architecture for next generation InternetScience in China Series F: Information Sciences10.1007/s11432-008-0148-451:11(1761-1771)Online publication date: 16-Oct-2008
  • (2006)CAMPProceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems10.1145/1185347.1185355(51-60)Online publication date: 3-Dec-2006
  • (2006)Virtually Pipelined Network MemoryProceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture10.1109/MICRO.2006.51(197-207)Online publication date: 9-Dec-2006
  • (2006)Decision support for packing in warehousesProceedings of the 21st international conference on Computer and Information Sciences10.1007/11902140_14(115-124)Online publication date: 1-Nov-2006

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media