Abstract
Standard compilers are incapable of fully harnessing the enormous performance potential of Blue Gene systems. To reach the leading position in the Top500 supercomputing list, IBM had to put considerable effort into coding and tuning a limited range of low-level numerical kernel routines by hand. In this paper the Vienna MAP compiler is presented, which particularly targets signal transform codes ubiquitous in compute-intensive scientific applications. Compiling Fftw code, MAP reaches as much as 80% of the optimum performance of Blue Gene systems. In an application code MAP enabled a sustained performance of 60 Tflop/s to be reached on BlueGene/L.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Belady, L.A.: A study of replacement algorithms for virtual storage computers. IBM Systems Journal 5(2), 78–101 (1966)
Dockser, K.: Oedipus Architecture: Extensions to PowerPC BookE for Hummer2. Technical report, IBM (August 2001)
Moreira, J.E., et al.: Blue Gene/L Programming and Operating Environment. IBM Journal for Research and Development 49(2/3) (2005)
Puschel, M., et al.: SPIRAL: Code Generation for DSP Transforms. Proceedings of the IEEE 93(2), 232–275 (2005)
Chatterjee, S., et al.: Design and exploitation of a high-performance SIMD floatingpoint unit for Blue Gene/L. IBM Journal for Research and Development 49(2/3) (2005)
Fisher, R.J., Dietz, H.G.: Compiling for SIMD Within A Register. In: Carter, L., Ferrante, J., Sehr, D., Chatterjee, S., Prins, J.F., Li, Z., Yew, P.-C. (eds.) LCPC 1998. LNCS, vol. 1656, pp. 290–304. Springer, Heidelberg (1999)
Franchetti, F., Kral, S., Lorenz, J., Ueberhuber, C.W.: Efficient Utilization of SIMD Extensions. IEEE Special Issue on Program Generation, Optimization, and Platform Adaptation 93(2) (2005)
Frigo, M.: A Fast Fourier Transform Compiler. Proceedings of the ACM SIGPLAN Conference on Programming Languages Design and Implementation (PLDI) 34(5), 169–180 (1999)
Frigo, M., Johnson, S.G.: FFTW: An adaptive software architecture for the FFT. In: Proceedings of the IEEE Intl. Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 1381–1384. IEEE, Los Alamitos (1998)
Guo, J., Garzaran, M., Padua, D.: The power of Belady’s algorithm in register allocation for long basic blocks. In: Rauchwerger, L. (ed.) LCPC 2003. LNCS, vol. 2958, pp. 374–390. Springer, Heidelberg (2004)
Gygi, F., Draeger, E., de Supinski, B.R., Yates, R.K., Franchetti, F., Kral, S., Lorenz, J., Ueberhuber, C.W., Gunnels, J., Sexton, J.: Large-Scale First- Principles Molecular Dynamics Simulations on the BlueGene/L Platform using the Qbox Code. In: Proceedings of the ACM/IEEE Conference on Supercomputing, Gordon Bell Prize runner-up (2005)
Hoxey, S., Karim, F., Hay, B., Warren, H. (eds.): The PowerPC Compiler Writer’s Guide. Warthman Associates (1996)
Larsen, S., Amarasinghe, S.: Exploiting superword level parallelism with multimedia instruction sets. ACM SIGPLAN Notices 35(5), 145–156 (2000)
Leupers, R., Bashford, S.: Graph-based code selection techniques for embedded processors. ACM Trans. Design Autom. Electron. Syst. 5(4), 794–814 (2000)
Lorenz, J., Kral, S., Franchetti, F., Ueberhuber, C.W.: Vectorization techniques for the Blue Gene/L double FPU. IBM Journal for Research and Development 49(2/3) (2005)
Muchnick, S.S.: Advanced Compiler Design and Implementation. Morgan Kaufmann, San Francisco (1997)
Sikha, E., Simpson, R.: The PowerPC Architecture: A Specification for a New Family of RISC Processors, 2nd edn. Morgan Kaufmann, San Francisco (1995)
van Hentenryck, P.: Constraint Satisfaction in Logic Programming. MIT Press, Cambridge (1989)
Zima, H., Chapman, B.: Supercompilers for Parallel and Vector Computers. ACM Press, New York (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kral, S., Triska, M., Ueberhuber, C.W. (2006). Compiler Technology for Blue Gene Systems. In: Nagel, W.E., Walter, W.V., Lehner, W. (eds) Euro-Par 2006 Parallel Processing. Euro-Par 2006. Lecture Notes in Computer Science, vol 4128. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823285_29
Download citation
DOI: https://doi.org/10.1007/11823285_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37783-2
Online ISBN: 978-3-540-37784-9
eBook Packages: Computer ScienceComputer Science (R0)