Impact of Cache Coherence Models on Performance of OpenMP Applications

Tao, Jie; Karl, Wolfgang

doi:10.1007/978-3-540-27866-5_19

Jie Tao¹⁹ &
Wolfgang Karl¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3149))

Included in the following conference series:

European Conference on Parallel Processing

699 Accesses

Abstract

OpenMP is becoming an important shared memory programmingmodel due to its portability, scalability, and flexibility. However, as it is a fact with any programming paradigms, cache access behavior significantly influences the performance of OpenMP applications. Improving cache performance in order to reduce misses therfore becomes a critical issue for High Performance Computing. This can be achieved by optimizing the source code, but also gained through adequate coherence schemes.

This work studies the behavior of various cache coherence protocols, including both hardware based mechanisms and software based relaxed models. The goal is to examine how well individual schemes perform with different architectures and applications, in order to find general ways to support the cache design in shared memory systems. The study is based on a simulation environment capable of modeling the parallel execution of OpenMP programs. First experimental results show that relaxed models are scalable and can be used as efficient alternative for those hardware coherence mechanisms.

Download to read the full chapter text

Chapter PDF

Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support

A Maude Framework for Cache Coherent Multicore Architectures

PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling

Article 28 June 2021

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Archibald, J.: A Cache Coherence Approach for Large Multiprocessor Systems. In: Proceedings of the International Conference on Supercomputing, November 1988, pp. 337–345 (1988)
Google Scholar
Basumallik, A., Min, S.-J., Eigenmann, R.: Towards OpenMP Execution on Software Distributed Shared Memory Systems. In: Zima, H.P., Joe, K., Sato, M., Seo, Y., Shimasaki, M. (eds.) ISHPC 2002. LNCS, vol. 2327, pp. 457–468. Springer, Heidelberg (2002)
Chapter Google Scholar
Dagum, L., Menon, R.: OpenMP: An Industry-Standard API for Shared-Memory Programming. IEEE Computational Science & Engineering 5(1), 46–55 (1998)
Article Google Scholar
Gonzàlez, M., Ayguadé, E., Martorell, X., Labarta, J., Navarro, N., Oliver, J.: NanosCompiler: Supporting Flexible Multilevel Parallelism in OpenMP. Concurrency:Practice and Experience 12(12), 1205–1218 (2000)
Article MATH Google Scholar
Grbic, T.S., Brown, S., Caranci, S., Grindley, G., Gusat, M., Lemieux, G., Loveless, K., Manjikian, N., Srbljic, S., Stumm, M., Vranesic, Z., Zilic, Z.: Design and Implementation of the NUMAchine Multiprocessor. In: Proceedings of the 1998 Conference on Design Automation, Los Alamitos, CA, June 1998, pp. 66–69 (1998)
Google Scholar
Jin, H., Frumkin, M., Yan, J.: The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. Technical Report NAS-99-011, NASA Ames Research Center (October 1999)
Google Scholar
Laudon, J., Lenoski, D.: The SGI Origin: A ccNUMA Highly Scalable Server. In: Proceedings of the 24th International Symposium on Computer Architecture, May 1997, pp. 241–251 (1997)
Google Scholar
Pramanick, I.: MPI and PVM Programming. In: Buyya, R. (ed.) High Performance Cluster Computing. Programming and Applications, vol. 2, ch. 3, pp. 48–86. Prentice Hall PTR, Englewood Cliffs (1999)
Google Scholar
Tao, J., Schulz, M., Karl, W.: A Simulation Tool for Evaluating Shared Memory Systems. In: Proceedings of the 36th Annual Simulation Symposium, Orlando, Florida, April 2003, pp. 335–342 (2003)
Google Scholar
Tao, J., Weidendorfer, J.: Cache Simulation Based on Runtime Instrumentation for OpenMP Applications. In: Proceedings of the 37th Annual Simulation Symposium, Arlington, VA (April 2004) (to appear)
Google Scholar
WWW. Valgrind, an open-source memory debugger for x86-GNU/Linux (1999), http://developer.kde.org/~sewardj/
Zhou, Y., Iftode, L., Singh, J.P., Li, K., Toonen, B.R., Schoinas, I., Hill, M.D., Wood, D.A.: Relaxed Consistency and Coherence Granularity in DSM Systems: A Performance Evaluation. In: Proceedings of the Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, June 1997, pp. 193–205 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Rechnerentwurf und Fehlertoleranz, Universität Karlsruhe (TH), 76128, Karlsruhe, Germany
Jie Tao & Wolfgang Karl

Authors

Jie Tao
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Karl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

No Affiliations,
Marco Danelutto
Computer Science Department, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy
Marco Vanneschi
Information Science and Technologies Institute (ISTI) The Italian National Research Council (CNR), Area della Ricerca, Via Giuseppe Moruzzi, 1, I-56126, Pisa, Italy
Domenico Laforenza

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tao, J., Karl, W. (2004). Impact of Cache Coherence Models on Performance of OpenMP Applications. In: Danelutto, M., Vanneschi, M., Laforenza, D. (eds) Euro-Par 2004 Parallel Processing. Euro-Par 2004. Lecture Notes in Computer Science, vol 3149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27866-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-27866-5_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22924-7
Online ISBN: 978-3-540-27866-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Impact of Cache Coherence Models on Performance of OpenMP Applications

Abstract

Chapter PDF

Similar content being viewed by others

Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support

A Maude Framework for Cache Coherent Multicore Architectures

PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Impact of Cache Coherence Models on Performance of OpenMP Applications

Abstract

Chapter PDF

Similar content being viewed by others

Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support

A Maude Framework for Cache Coherent Multicore Architectures

PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation