A Case Study on Compiler Optimizations for the Intel® CoreTM 2 Duo Processor

Bik, Aart J. C.; Kreitzer, David L.; Tian, Xinmin

doi:10.1007/s10766-008-0071-8

A Case Study on Compiler Optimizations for the Intel^® Core^TM 2 Duo Processor

Published: 10 April 2008

Volume 36, pages 571–591, (2008)
Cite this article

International Journal of Parallel Programming Aims and scope Submit manuscript

Aart J. C. Bik¹,
David L. Kreitzer¹ &
Xinmin Tian¹

214 Accesses
11 Citations
3 Altmetric
Explore all metrics

Abstract

The complexity of modern processors poses increasingly more difficult challenges to software optimization. Modern optimizing compilers have become essential tools for leveraging the power of recent processors by means of high-level optimizations to exploit multi-core platforms and single-instruction-multiple-data (SIMD) instructions, as well as advanced code generation to deal with microarchitectural performance aspects. Using the Intel^® Core^TM 2 Duo processor and Intel Fortran/C++ compiler as a case study, this paper gives a detailed account of the sort of optimizations required to obtain high performance on modern processors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

C Compilers and Code Optimization for DSPs

A high quality compiler tool for application-specific instruction-set processors with library and parallel supports

Article 14 June 2015

Benbin Chen, Chung-Ta King, … Donghui Guo

A methodology pruning the search space of six compiler transformations by addressing them together as one problem and by exploiting the hardware architecture details

Article 09 January 2017

Vasilios Kelefouras

References

Allen J.R. and Kennedy K. (1987). Automatic translation of Fortran programs to vector form. ACM T. Progr. Lang. Sys. 9: 491–542
Article MATH Google Scholar
Bik A.J.C. (2004). The Software Vectorization Handbook. Intel Press, Hillsboro, OR
Google Scholar
Bik A.J.C., Girkar M., Grey P.M. and Tian X. (1998). Automatic intra-register vectorization for the Intel architecture. Int. J. Parallel Process. 30: 65–98
Article Google Scholar
Callahan, D., Cooper, K.D., Kennedy, K., Torczon, L.: Interprocedural constant propagation. In: SIGPLAN ’86 Symposium on Compiler Construction, pp. 152–161. July 1986
Chandra, R., Dagum, L., Kohr, D., Maydan, D., McDonald, H., Menon, R.: Parallel Programming in OpenMP. Morgan Kaufmann Publishers Inc. (2001)
Eichenberger, A., Wu, P., O’Brien, K.: Vectorization for SIMD architectures with alignment constraints. In: Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation, pp. 82–93. Washington DC, June 2004
Hennessy J.L. and Patterson D.A. (1990). Computer Architecture: A Quantitative Approach. Morgan Kaufmann Publishers, San Mateo, Californa
Google Scholar
Intel Corporation. Intel Architecture Software Developer’s Manual, vol. 1: Basic Architecture. Intel Corporation (available at http://developer.intel.com/) (2007)
Krall A. and Lelait S. (2000). Compilation techniques for multi-media processors. Int. J. Parallel Prog. 28(4): 347–361
Article Google Scholar
Larsen, S., Amarasinghe, S.: Exploiting Superword level parallelism with multimedia instruction sets. In: Proceeding of the SIGPLAN Conference on Programming Language Design and Implementation. Vancouver, B.C., June 2000
Larsen, S., Witchel, E., Amarasinghe, S.: Increasing and detecting memory address congruence. In: Proceedings of the 11th International Conference on Parallel Architectures and Compilation Techniques. Charlottesville, VA, September 2002
McCalpin, J.D.: Memory bandwidth and machine balance in current high performance computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December (1995)
Muchnick S. (1997). Advanced Compiler Design and Implementation. Morgan Kaufmann Publishers, San Mateo, CA
Google Scholar
Pryanishnikov, I., Krall, A., Horspool, N.: Pointer alignment analysis for processors with SIMD instructions. In: Proceedings of the 5th Workshop on Media and Streaming Processors. San Diego, CA, December 2003
Tian, X., Bik, A.J.C., Girkar, M., Grey, P.M., Saito, H., Su, E.: Intel^® OpenMP C++/Fortran compiler for hyper-threading technology: implementation and performance. Intel Technol. J. 6(1) (2002)
Tian X., Gikar M., Bik A.J.C. and Saito H. (2005). Practical compiler techniques on efficient multithreaded code generation for OpenMP programs. Comput. J. 48(5): 558–601
Article Google Scholar
Wolfe M.J. (1996). High Performance Compilers for Parallel Computing. Addison-Wesley, Redwood City, California
MATH Google Scholar
Zima H. (1990). Supercompilers for Parallel and Vector Computers. ACM Press, New York
Google Scholar

Download references

Author information

Authors and Affiliations

Intel Corporation, 2200 Mission College Blvd. SC12-301, Santa Clara, CA, 95052, USA
Aart J. C. Bik, David L. Kreitzer & Xinmin Tian

Authors

Aart J. C. Bik
View author publications
You can also search for this author in PubMed Google Scholar
David L. Kreitzer
View author publications
You can also search for this author in PubMed Google Scholar
Xinmin Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aart J. C. Bik.

Additional information

The first author was working for Intel Corp. when the paper was written, but moved to Google Inc. since.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bik, A.J.C., Kreitzer, D.L. & Tian, X. A Case Study on Compiler Optimizations for the Intel^® Core^TM 2 Duo Processor. Int J Parallel Prog 36, 571–591 (2008). https://doi.org/10.1007/s10766-008-0071-8

Download citation

Received: 11 April 2007
Accepted: 28 February 2008
Published: 10 April 2008
Issue Date: December 2008
DOI: https://doi.org/10.1007/s10766-008-0071-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A Case Study on Compiler Optimizations for the Intel^® Core^TM 2 Duo Processor

Abstract

Access this article

Similar content being viewed by others

C Compilers and Code Optimization for DSPs

A high quality compiler tool for application-specific instruction-set processors with library and parallel supports

A methodology pruning the search space of six compiler transformations by addressing them together as one problem and by exploiting the hardware architecture details

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Case Study on Compiler Optimizations for the Intel® CoreTM 2 Duo Processor

Abstract

Access this article

Similar content being viewed by others

C Compilers and Code Optimization for DSPs

A high quality compiler tool for application-specific instruction-set processors with library and parallel supports

A methodology pruning the search space of six compiler transformations by addressing them together as one problem and by exploiting the hardware architecture details

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

A Case Study on Compiler Optimizations for the Intel^® Core^TM 2 Duo Processor