skip to main content
10.1145/1531743.1531779acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

Characterizing the performance penalties induced by irregular code using pointer structures and indirection arrays on the intel core 2 architecture

Published:18 May 2009Publication History

ABSTRACT

Irregularity is one of the fundamental causes for performance degradation in applications. Both hardware and software have a hard time coping with irregular memory access patterns and irregularity in flow control. On the hardware side, execution is optimized for regular data accesses and irregular memory access streams cannot be predicted. On the software side, compilers are are not able to reason about memory locations and loop bounds. This prevents many optimizations to be applied. In this paper, we measure and characterize the impact of various facets of irregularity using SPARK00, a set of benchmarks that explicitly targets the measurement of the impact of irregularity, on one of themost commonly used architectures today, the Intel Core 2. The benchmarks consist of kernels that are based on pointers, a notorious cause of irregularity, kernels that use indirection arrays, and kernels that implement regular counterparts of some of the irregular kernels. By employing different data sets and different memory layouts these benchmarks are used to characterize architectural features.

References

  1. W. Jalby, C. Lemuet, and X. Le Pasteur. WBTK: A new set of microbenchmarks to explore memory system performance for scientific computing. International Journal of High Performance Computing Applications, 18:211--224, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Magnus Karlsson, Fredrik Dahlgren, and Per Stenström. A prefetching technique for irregular accesses to linked data structures. pages 206--217, January 2000.Google ScholarGoogle Scholar
  3. D. Levinthal. Analyzing and resolving multi-core non scaling on intel core 2 processors. http://www.devx.com/go-parallel/Link/34762.Google ScholarGoogle Scholar
  4. Chi-Keung Luk and Todd C. Mowry. Compiler-based prefetching for recursive data structures. In ASPLOS-VII: Proceedings of the seventh international conference on Architectural support for programming languages and operating systems, pages 222--233, New York, NY, USA, 1996. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Harmen L.A. van der Spek, Erwin M. Bakker, and Harry A.G. Wijshoff. A compile/run-time environment for the automatic transformation of linked list data structures. International Journal of Parallel Programming, 36(6):592--623, December 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Harmen L.A. van der Spek, Erwin M. Bakker, and Harry A.G. Wijshoff. SPARK00: A benchmark package for the compiler evaluation of irregular/sparse codes. In ASCI 2008: Fourteenth Annual Conference of the Advanced School for Computing and Imaging, 2008.Google ScholarGoogle Scholar

Index Terms

  1. Characterizing the performance penalties induced by irregular code using pointer structures and indirection arrays on the intel core 2 architecture

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        CF '09: Proceedings of the 6th ACM conference on Computing frontiers
        May 2009
        238 pages
        ISBN:9781605584133
        DOI:10.1145/1531743

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 18 May 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        CF '09 Paper Acceptance Rate26of113submissions,23%Overall Acceptance Rate240of680submissions,35%

        Upcoming Conference

        CF '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader