Skip to main content

Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications

  • Conference paper
Parallel and Distributed Processing and Applications (ISPA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4330))

Abstract

This paper examines the memory performance of the vector-parallel and scalar-parallel computing platforms across five applications of three scientific areas; electromagnetic analysis, CFD/heat analysis, and seismology. Our evaluation results show that the vector platforms can achieve the high computational efficiency and hence significantly outperform the scalar platforms in the areas of these applications. We did exhaustive experiments and quantitatively evaluated representative scalar and vector platforms using real applications from the viewpoint of the system designers and developers. These results demonstrate that the ratio of memory bandwidth to floating-point operation rate needs to reach 4-bytes/flop to preserve the computational performance with hiding the memory access latencies by pipelined vector operations in the vector platforms. We also confirm that the enough number of memory banks to handle stride memory accesses leads to an increase in the execution efficiency. On the scalar platforms, the cache hit rate needs to be almost 100% to achieve the high computational efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Shingu, S., et al.: A 26.58 Tflops Global Atmospheric Simulation with the Spectral Transform Method on the Earth Simulator. In: Proceedings of the ACM/IEEE SC 2002 conference (2002)

    Google Scholar 

  2. Yokokawa, M., et al.: 16.4-Tflops Direct Numerical Simulation of Turbulence by a Fourier Spectral Method on the Earth. In: Proceedings of the ACM/IEEE SC 2002 conference (2002)

    Google Scholar 

  3. Oliker, L., et al.: Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations. In: Proceedings of the ACM/IEEE SC 2003 conference (2003)

    Google Scholar 

  4. Oliker, L., et al.: Scientific Computations on Modern Parallel Vector System. In: Proceedings of the ACM/IEEE SC 2004 conference (2004)

    Google Scholar 

  5. Fatoohi, R.A.: Vector Performance Analysis of Three Supercomputers: Cray-2, Cray Y-MP, and ETA10-Q. In: Proceedings of Supercomputing 1989 (1989)

    Google Scholar 

  6. Fatoohi, R.A.: Vector Performance Analysis of The NEC SX-2. In: Proceedings of Supercomputing 1990 (1990)

    Google Scholar 

  7. Shan, H., et al.: Performance Characteristics of the Cray X1 and Their Implications for Application Performance Tuning. In: Proceedings of the ICS 2004 (2004)

    Google Scholar 

  8. Kitagawa, K., et al.: A Hardware Overview of SX-6 and SX-7 Supercomputer. NEC Research & Development 44, 2–7 (2003)

    Google Scholar 

  9. Senta, T., et al.: Itanium2 32-way Server System Architecture. NEC Research & Development 44, 8–12 (2003)

    Google Scholar 

  10. Kobayashi, T., et al.: FDTD simulation on array antenna SAR-GPR for land mine detection. In: Proceeding of SSR 2003: 1st International Symposium on Systems and Human Science, Osaka, Japan, November 2003, pp. 279–283 (2003)

    Google Scholar 

  11. Kunz, K.S., Luebbers, R.J.: The Finite Difference Time Domain Method for Electromagnetics. CRC Press, Boca Raton (1993)

    Google Scholar 

  12. Takagi, Y., et al.: Study of High Gain and Broadband Antipodal Fermi Anenna with Corrugation. In: 2004 International Symposium on Antennas and Propagation, vol. 1, pp. 69–72 (2004)

    Google Scholar 

  13. Tsuboi, K., Masuya, G.: Direct Numerical Simulations for Instabilities of Remixed Planar Flames. In: The Fourth Asia-Pacific Conference on Combustion, Nanjing, China (November 2003)

    Google Scholar 

  14. Nakajima, M., et al.: Numerical Simulation of Three-Dimensional Separated Flow and Heat Transfer around Staggerd Surface-Mounted Rectangular Blocks in a Channel. Numerical Heat Transfer, Part A 47, 691–708 (2005)

    Article  Google Scholar 

  15. Ariyoshi, K., et al.: Spatial variation in propagation speed of postseismic slip on the subducting plate boundary. In: 2nd Water Dynamics, vol. B-30, Sendai, Japan (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Musa, A., Takizawa, H., Okabe, K., Soga, T., Kobayashi, H. (2006). Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications. In: Guo, M., Yang, L.T., Di Martino, B., Zima, H.P., Dongarra, J., Tang, F. (eds) Parallel and Distributed Processing and Applications. ISPA 2006. Lecture Notes in Computer Science, vol 4330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11946441_76

Download citation

  • DOI: https://doi.org/10.1007/11946441_76

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68067-3

  • Online ISBN: 978-3-540-68070-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics