Seamless Integration of Parallelism and Memory Hierarchy

Fantozzi, Carlo; Pietracaprina, Andrea; Pucci, Geppino

doi:10.1007/3-540-45465-9_73

Carlo Fantozzi⁷,
Andrea Pietracaprina⁷ &
Geppino Pucci⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2380))

Included in the following conference series:

International Colloquium on Automata, Languages, and Programming

2309 Accesses

Abstract

We prove an analogue of Brent’s lemma for BSP-like parallel machines featuring a hierarchical structure for both the interconnection and the memory. Specifically, for these machines we present a uniform scheme to simulate any computation designed for v processors on a v′-processor configuration with v′ ≤ v and the same overall memory size. For a wide class of computations the simulation exhibits optimal O (v/v′) slowdown. The simulation strategy aims at translating communication locality into temporal locality. As an important special case (v′= 1), our simulation can be employed to obtain efficient hierarchy-conscious sequential algorithms from efficient fine-grained ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enumerated BSP Automata

Automatic Cost Analysis for Imperative BSP Programs

Article 21 February 2018

Multi-ML: Programming Multi-BSP Algorithms in ML

Article 02 April 2016

References

Valiant, L.G.: A bridging model for parallel computation. Communications of the ACM 33 (1990)103–111
Article Google Scholar
Dehne, F., Fabri, A., Rau-Chaplin, A.: Scalable parallel geometric algorithms for coarse grained multicomputers. International Journal on Computational Geometry 6 (1996) 379–400
Article MATH MathSciNet Google Scholar
Brent, R.P.: The parallel evaluation of general arithmetic expressions. Journal of the ACM 21 (1974)201–206
Article MATH MathSciNet Google Scholar
Vitter, J.S., Shriver, E.A.M.: Algorithms for parallel memory II: Hierarchical multilevel memories. Algorithmica 12 (1994) 148–169
Article MATH MathSciNet Google Scholar
Aggarwal, A., Alpern, B., Chandra, A.K., Snir, M.: A model for hierarchical memory. In: Proc. of the 19th ACM STOC. (1987) 305–314
Google Scholar
Alpern, B., Carter, L., Ferrante, J.: Modeling parallel computers as memory hierarchies. In Programming Models for Massively Parallel Computers. IEEE Computer Society Press (1993)116–123
Google Scholar
Hey wood, T., Ranka, S.: A practical hierarchical model of parallel computation. I. the model. Journal of Parallel and Distributed Computing 16 (1992) 212–232
Article MathSciNet Google Scholar
Bilardi, G., Preparata, F.P.: Processor-time tradeoffs under bounded-speed message propagation: Part I, upper bounds. Theory of Computing Systems 30 (1997) 523–546
Article MATH MathSciNet Google Scholar
Bilardi, G., Preparata, F.P.: Processor-time tradeoffs under bounded-speed message propagation: Part II, lower bounds. Theory of Computing Systems 32 (1999) 531–559
Article MATH MathSciNet Google Scholar
Dehne, F., Dittrich, W., Hutchinson, D.: Efficient external memory algorithms by simulating coarse-grained parallel algorithms. In: Proc. of the 9th ACM SPAA. (1997) 106–115
Google Scholar
Dehne, F., Dittrich, W., Hutchinson, D., Maheshwari, A.: Reducing I/O complexity by simulating coarse grained parallel algorithms. In: Proc. of the 13th IPPS. (1999) 14–20
Google Scholar
Sibeyn, J., Kaufmann, M.: BSP-like external-memory computation. In: Proc. of the 3rd Italian Conference on Algorithms and Complexity. LNCS 1203 (1997) 229–240
Google Scholar
De la Torre, P., Kruskal, C.P.: Submachine locality in the bulk synchronous setting. In: Proc. of EUROPAR 96. LNCS 1124 (1996) 352–358
Chapter Google Scholar
Bilardi, G., Peserico, E.: A characterization of temporal locality and its portability across memory hierarchies. In: Proc. of ICALP 2001. LNCS 2076 (2001) 128–139
Google Scholar
Bilardi, G., Pietracaprina, A., Pucci, G.: A quantitative measure of portability with application to bandwidth-latency models for parallel computing. In: Proc. of EUROPAR 99. LNCS 1685 (1999)543–551
Google Scholar
Fantozzi, C., Pietracaprina, A., Pucci, G.: Implementing shared memory on clustered machines. In: Proc. of IPDPS 2001. (2001)
Google Scholar
Bilardi, G., Fantozzi, C., Pietracaprina, A., Pucci, G.: On the effectiveness of D-BSP as a bridging model of parallel computation. In: Proc. of ICCS 2001. LNCS 2074 (2001) 579–588
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Elettronica e Informatica, Università di Padova, Via Gradenigo 6/A, I-35131, Padova, Italy
Carlo Fantozzi, Andrea Pietracaprina & Geppino Pucci

Authors

Carlo Fantozzi
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Pietracaprina
View author publications
You can also search for this author in PubMed Google Scholar
Geppino Pucci
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Theoretical Computer Science, ETH Zentrum, ETH Zürich, 8092, Zürich, Switzerland
Peter Widmayer & Stephan Eidenbenz &
Department of Languages and Sciences of the Computation E.T.S. de Ingeniería Informática, University of Málaga, Campus de Teatinos, 29071, Málaga, Spain
Francisco Triguero , Rafael Morales & Ricardo Conejo , &
School of Cognitive and Computing Sciences, University of Sussex, Falmer, Brighton, BN1 9QN, UK
Matthew Hennessy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fantozzi, C., Pietracaprina, A., Pucci, G. (2002). Seamless Integration of Parallelism and Memory Hierarchy. In: Widmayer, P., Eidenbenz, S., Triguero, F., Morales, R., Conejo, R., Hennessy, M. (eds) Automata, Languages and Programming. ICALP 2002. Lecture Notes in Computer Science, vol 2380. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45465-9_73

Download citation

DOI: https://doi.org/10.1007/3-540-45465-9_73
Published: 25 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43864-9
Online ISBN: 978-3-540-45465-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Seamless Integration of Parallelism and Memory Hierarchy

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enumerated BSP Automata

Automatic Cost Analysis for Imperative BSP Programs

Multi-ML: Programming Multi-BSP Algorithms in ML

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Seamless Integration of Parallelism and Memory Hierarchy

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enumerated BSP Automata

Automatic Cost Analysis for Imperative BSP Programs

Multi-ML: Programming Multi-BSP Algorithms in ML

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation