Article

Parallelism versus memory allocation in pipelined router forwarding engines

Authors:

Fan Chung,

Ronald Graham,

George VargheseAuthors Info & Claims

SPAA '04: Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures

Pages 103 - 111

https://doi.org/10.1145/1007912.1007928

Published: 27 June 2004 Publication History

Get Access

Abstract

A crucial problem that needs to be solved is the allocation of memory to processors in a pipeline. Ideally, the processor memories should be totally separate (i.e., one port memories) in order to minimize contention; however, this minimizes memory sharing. Idealized sharing occurs by using a single shared memory for all processors but this maximizes contention. Instead, in this paper we show that perfect memory sharing of shared memory can be achieved with a collection of *two*-port memories, as long as the number of processors is less than the number of memories. We show that the problem of allocation is NP-complete in general, but has a fast approximation algorithm that comes within a factor of 3/2. The proof utilizes a new bin packing model, which is interesting in its own right. Further, for important special cases that arise in practice the approximation algorithm is indeed optimal. We also describe an incremental memory allocation algorithm that provides good memory utilization while allowing fast updates.

References

[1]

A. Basu and G. Narlikar, Fast Incremental Updates for Pipeline Forwarding Engines, InfoCom 2003.

Google Scholar

[2]

C. Berge, Graphs and Hypergraphs, North-Holland, Amsterdam, 1976.

Digital Library

Google Scholar

[3]

Guy E. Blelloch Phillip B. Gibbons Yossi Matias Accounting for Memory Bank Contention and Delay in High-Bandwidth Multiprocessors, IEEE Transactions on Parallel and Distributed Systems, volume 8, 1997.

Digital Library

Google Scholar

[4]

D. Culler, J, Singh, and A. Gupta. Parallel Computer Architecture, A Hardware/Software Approach, Morgan Kaufman, 1999.

Digital Library

Google Scholar

[5]

M. Degermark, A. Brodnik, S. Carlsson, and Stephen Pink, Small forwarding tables for fast routing lookups, Proc. SIGCOMM, (1997), 3--14.

Digital Library

Google Scholar

[6]

M. R. Garey and D. S. Johnson, Computer and Intractability, A Guide to the Theory of NP-completeness, W. H. Freeman and Co., San Francisco, 1979.

Digital Library

Google Scholar

[7]

M. R. Garey and D. S. Johnson, Complexity results for multiprocessor scheduling under resource constraints, SIAM J. Comput., (1975), 397--411.

Crossref

Google Scholar

[8]

A. Ranade, How to emulate shared memory. Journal of Computer and System Sciences, 42:307--326, 1991.

Digital Library

Google Scholar

[9]

B. Rau, Pseudo-randomly interleaved memory. In Proceedings Int. Symp. on Computer Architecture, 1991.

Digital Library

Google Scholar

[10]

T.V. Lakshman and D. Staliadis, High Speed Policy-based Packet Forwarding Using Efficient Multi-dimensional Range Matching, Proc. ACM SIGCOMM '98, 1998.

Digital Library

Google Scholar

[11]

M. Ruiz-Sanchez, E. Biersack, and W. Dabbous, Survey and Taxonomy of IP Address Lookup Algorithms, IEEE Network, March/April 2001.

Digital Library

Google Scholar

[12]

S. Sikka and G. Varghese, Memory Efficient State Lookups with Fast Updates, in Proceedings of SIGCOMM 2000, August 2000.

Digital Library

Google Scholar

[13]

L. Valiant, A bridging model for parallel computation. Communications of the ACM, 33(8), 1990.

Digital Library

Google Scholar

Cited By

View all

De Carli LPan YKumar AEstan CSankaralingam K(2009)PLUGACM SIGCOMM Computer Communication Review10.1145/1594977.159259339:4(207-218)Online publication date: 16-Aug-2009
https://dl.acm.org/doi/10.1145/1594977.1592593
De Carli LPan YKumar AEstan CSankaralingam KRodriguez PBiersack EPapagiannaki KRizzo L(2009)PLUGProceedings of the ACM SIGCOMM 2009 conference on Data communication10.1145/1592568.1592593(207-218)Online publication date: 16-Aug-2009
https://dl.acm.org/doi/10.1145/1592568.1592593
Agrawal BSherwood T(2009)High-bandwidth network memory system through virtual pipelinesIEEE/ACM Transactions on Networking (TON)10.1109/TNET.2008.200864617:4(1029-1041)Online publication date: 1-Aug-2009
https://dl.acm.org/doi/10.1109/TNET.2008.2008646
Show More Cited By

Index Terms

Parallelism versus memory allocation in pipelined router forwarding engines

Recommendations

Parallelism versus Memory Allocation in Pipelined Router Forwarding Engines

A crucial problem that needs to be solved is the allocation of memory to processors in a pipeline. Ideally, the processor memories should be totally separate (i.e., one-port memories) in order to minimize contention; however, this minimizes memory ...
Recursive function data allocation to scratch-pad memory
CASES '07: Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems

This paper presents the first automatic scheme to allocate local (stack) data in recursive functions to scratch-pad memory (SPM) in embedded systems. A scratch-pad is a fast directly addressed compiler-managed SRAM memory that replaces the hardware-...
Energy-aware memory allocation in heterogeneous non-volatile memory systems
ISLPED '03: Proceedings of the 2003 international symposium on Low power electronics and design

Memory systems consume a significant portion of power in hand-held embedded systems. So far, low-power memory techniques have addressed the power consumption when the system is turned on. In this paper, we consider data retention energy during the power-...

Comments

Information & Contributors

Information

Published In

SPAA '04: Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures

June 2004

332 pages

ISBN:1581138407

DOI:10.1145/1007912

General Chair:
Phil Gibbons
Intel Research
,
Program Chair:
Micah Adler
University of Massachusetts

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2004

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

SPAA04

Sponsor:

SPAA04: 16th ACM Symposium on Parallelism in Algorithms and Architectures 2004

June 27 - 30, 2004

Barcelona, Spain

Acceptance Rates

Overall Acceptance Rate 447 of 1,461 submissions, 31%

Upcoming Conference

SPAA '25

Sponsor:
sigact
sigact

37th ACM Symposium on Parallelism in Algorithms and Architectures

July 28 - August 1, 2025

Portland , OR , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
323
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

De Carli LPan YKumar AEstan CSankaralingam K(2009)PLUGACM SIGCOMM Computer Communication Review10.1145/1594977.159259339:4(207-218)Online publication date: 16-Aug-2009
https://dl.acm.org/doi/10.1145/1594977.1592593
De Carli LPan YKumar AEstan CSankaralingam KRodriguez PBiersack EPapagiannaki KRizzo L(2009)PLUGProceedings of the ACM SIGCOMM 2009 conference on Data communication10.1145/1592568.1592593(207-218)Online publication date: 16-Aug-2009
https://dl.acm.org/doi/10.1145/1592568.1592593
Agrawal BSherwood T(2009)High-bandwidth network memory system through virtual pipelinesIEEE/ACM Transactions on Networking (TON)10.1109/TNET.2008.200864617:4(1029-1041)Online publication date: 1-Aug-2009
https://dl.acm.org/doi/10.1109/TNET.2008.2008646
Sun ZDai YGong Z(2008)MPFS: A truly scalable router architecture for next generation InternetScience in China Series F: Information Sciences10.1007/s11432-008-0148-451:11(1761-1771)Online publication date: 16-Oct-2008
https://doi.org/10.1007/s11432-008-0148-4
Kumar SBecchi MCrowley PTurner JBhuyan LDubois MEatherton W(2006)CAMPProceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems10.1145/1185347.1185355(51-60)Online publication date: 3-Dec-2006
https://dl.acm.org/doi/10.1145/1185347.1185355
Agrawal BSherwood T(2006)Virtually Pipelined Network MemoryProceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture10.1109/MICRO.2006.51(197-207)Online publication date: 9-Dec-2006
https://dl.acm.org/doi/10.1109/MICRO.2006.51
Ertek GKilic K(2006)Decision support for packing in warehousesProceedings of the 21st international conference on Computer and Information Sciences10.1007/11902140_14(115-124)Online publication date: 1-Nov-2006
https://dl.acm.org/doi/10.1007/11902140_14

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations