skip to main content
10.1145/372202.372462acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

Exploiting non-uniform reuse for cache optimization

Authors Info & Claims
Published:01 March 2001Publication History
First page image

References

  1. 1.D. F. Bacon, S. L. Graham, and O. J. Sharp. Compiler transformations for high-performance computing. ACM Computing Surveys, 26(4):345-420, Dec. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.R. Berrendorf and H. Ziegler. PCL: The Performance Counter Library: A Common Interface to Access Hardware Performance Counters on Microprocessors (Version 1.2), 1998/99. FZJ-ZAM-IB-9816, Available at http://www.fz-juelich.de/zam/PCL/.Google ScholarGoogle Scholar
  3. 3.S. Chatterjee and S. Sen. Cache-efficient matrix transposition. In Proceedings of the Sixth IEEE International Symposium on High-Performance Computer Architecture, pages 195-205, 2000.Google ScholarGoogle Scholar
  4. 4.S. Coleman and K. S. McKinley. Tile size selection using cache organization and data layout. ACM SIGPLAN Notices, 30(6):279-290, June 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.D. Gannon, W. Jalby, and K. Gallivan. Strategies for cache and local memory management by global program transformation. Journal of Parallel and Distributed Computing, 5(5):587-616, Oct. 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.S. Ghosh, M. Martonosi, and S. Malik. Cache miss equations: A compiler framework for analyzing and tuning memory behavior. ACM Transactions on Programming Languages and Systems, 21(4):703-746, Nov. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.M. Kandemir, J.Ramanujam, and A. Choudhary. Improving cache locality by a combination of loop and data transformations. IEEE Transactions on Computers, 48(2), 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.I. Kodukula, K. Pingali, R. Cox, and D. Maydan. An experimental evaluation of tiling and shackling for memory hierarchy management. In Proceedings of the ACM Int. Conference on Supercomputing, pages 482-490, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.C. Leopold. Arranging statements and data of program instances for locality. Future Generation Computer Systems, 14:293-311, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.C. Leopold. Generating structured program instances with a high degree of locality. In Proceedings of the 8th Euromicro Workshop on Parallel and Distributed Processing, pages 267-274. IEEE Computer Society Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.K. S. McKinley, S. Carr, and C.-W. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems, 18(4):424-453, July 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.S. S. Muchnick. Advanced compiler design and implementation. Morgan Kaufmann Publishers, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.N. Mukhopadhyay. On the Effectiveness of Feedback-Guided Parallelization. PhD thesis, University of Manchester, 1999.Google ScholarGoogle Scholar
  14. 14.G. Rivera and C.-W. Tseng. A comparison of compiler tiling algorithms. In Proceedings of the Int. Conference on Compiler Construction, pages 168-182. Springer LNCS 1575, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.G. Rivera and C.-W. Tseng. Locality optimizations for multi-level caches. In SC'99, 1999. Available at http://w3.csc.ucm.es/Otros/sc99/techpap.htm. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.O. Temam, E. D. Granston, and W. Jalby. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proceedings IEEE Supercomputing'93. IEEE Computer Society Press, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. SIGPLAN Notices, 26(6):30-44, June 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.M. J. Wolfe. High Performance Compilers for Parallel Computing. Addison-Wesley, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Exploiting non-uniform reuse for cache optimization

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                SAC '01: Proceedings of the 2001 ACM symposium on Applied computing
                March 2001
                692 pages
                ISBN:1581132875
                DOI:10.1145/372202

                Copyright © 2001 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 March 2001

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate1,650of6,669submissions,25%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader