Skip to main content

ATLAS (Automatically Tuned Linear Algebra Software)

  • Reference work entry
Encyclopedia of Parallel Computing

Synonyms

Numerical libraries

Definition

ATLAS [20222425] is an ongoing research project that uses empirical tuning to optimize dense linear algebra software. The fruits of this research are embodied in an empirical tuning framework available as an open source/free software package (also referred to as “ATLAS”), which can be downloaded from the ATLAS homepage [23]. ATLAS generates optimized libraries which are also often collectively referred to as “ATLAS,” “ATLAS libraries,” or more precisely, “ATLAS-tuned libraries.” In particular, ATLAS provides a full implementation of the BLAS [671012] (Basic Linear Algebra Subprograms) API, and a subset of optimized LAPACK [1] (Linear Algebra PACKage) routines. Because dense linear algebra is rich in operand reuse, many routines can run tens or hundreds of times faster when tuned for the hardware than when written naively. Unfortunately, highly tuned codes are usually not performance portable (i.e., a code transformation that helps...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,600.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 1,799.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bibliography

  1. Anderson E, Bai Z, Bischof C, Demmel J, Dongarra J, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Ostrouchov S, Sorensen D (1999) LAPACK users’ guide, 3rd edn. SIAM, Philadelphia, PA

    Book  Google Scholar 

  2. Bilmes J, Asanović K, Chin CW, Demmel J (1997) Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In: Proceedings of the ACM SIGARC International Conference on SuperComputing, Vienna, Austria, July 1997

    Google Scholar 

  3. Castaldo AM, Whaley RC (2009) Minimizing startup costs for performance-critical threading. In: Proceedings of the IEEE international parallel and distributed processing symposium, Rome, Italy, May 2009

    Google Scholar 

  4. Castaldo AM, Whaley RC (2010) Scaling LAPACK panel operations using parallel cache assignment. In: Accepted for publication in 15th AMC SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, Bangalore, India, January 2010

    Google Scholar 

  5. Castaldo AM, Whaley RC, Chronopoulos AT (2008) Reducing floating point error in dot product using the superblock family of algorithms. SIAM J Sci Comput 31(2):1156–1174

    Article  MATH  MathSciNet  Google Scholar 

  6. Dongarra J, Du Croz J, Duff I, Hammarling S (1990) A set of level 3 basic linear algebra subprograms. ACM Trans Math Softw 16(1):1–17

    Article  MATH  Google Scholar 

  7. Dongarra J, Du Croz J, Hammarling S, Hanson R (1988) Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs. ACM Trans Math Softw 14(1):18–32

    Article  MATH  Google Scholar 

  8. Dongarra J, Du Croz J, Hammarling S, Hanson R (1988) An extended set of FORTRAN basic linear algebra subprograms. ACM Trans Math Softw 14(1):1–17

    Article  MATH  Google Scholar 

  9. Elmroth E, Gustavson F (2000) Applying recursion to serial and parallel qr factorizaton leads to better performance. IBM J Res Dev 44(4):605–624

    Article  Google Scholar 

  10. Hanson R, Krogh F, Lawson C (1973) A proposal for standard linear algebra subprograms. ACM SIGNUM Newsl 8(16):47–69

    Google Scholar 

  11. Kågström B, Ling P, van Loan C (1998) Gemm-based level 3 blas: high performance model implementations and performance evaluation benchmark. ACM Trans Math Softw 24(3):268–302

    Article  MATH  Google Scholar 

  12. Lawson C, Hanson R, Kincaid D, Krogh F (1979) Basic linear algebra subprograms for fortran usage. ACM Trans Math Softw 5(3):308–323

    Article  MATH  Google Scholar 

  13. Li Y, Dongarra J, Tomov S (2009) A note on autotuning GEMM for GPUs. Technical Report UT-CS-09-635, University of Tennessee, January 2009

    Google Scholar 

  14. Whaley TC et al. Atlas mailing lists. http://math-atlas.sourceforge.net/faq.html#lists

  15. Volkov V, Demmel J (2008) Benchmarking GPUs to tune dense linear algebra. In Supercomputing 2008. Los Alamitos, November 2008

    Google Scholar 

  16. Volkov V, Demmel J (2008). LU, QR and Cholesky factorizations using vector capabilities of GPUs. Technical report, University of California, Berkeley, CA, May 2008

    Google Scholar 

  17. Whaley RC (2009) Atlas errata file. http://math-atlas.sourceforge.net/errata.html

  18. Whaley RC (2008) Empirically tuning lapack’s blocking factor for increased performance. In: Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, October 2008

    Google Scholar 

  19. Whaley RC, Castaldo AM (2008) Achieving accurate and context-sensitive timing for code optimization. Softw Practice Exp 38(15):1621–1642

    Article  Google Scholar 

  20. Whaley RC, Dongarra J (1997) Automatically tuned linear algebra software. Technical Report UT-CS-97-366, University of Tennessee, TN, December 1997. http://www.netlib.org/lapack/lawns/lawn131.ps

  21. Whaley RC, Dongarra J (1998) Automatically tuned linear algebra software. In: SuperComputing 1998: high performance networking and computing, San Antonio, TX, USA, 1998. CD-ROM proceedings. Winner, best paper in the systems category. http://www.cs.utsa.edu/~whaley/papers/atlas_sc98.ps

  22. Whaley RC, Dongarra J (1999) Automatically tuned linear algebra software. In: Ninth SIAM conference on parallel processing for scientific computing, 1999. CD-ROM proceedings.

    Google Scholar 

  23. Whaley RC, Petitet A (2009) Atlas homepage. http://math-atlas.sourceforge.net/

  24. Whaley RC, Petitet A (2005) Minimizing development and maintenance costs in supporting persistently optimized BLAS. Softw Practice Exp 35(2):101–121, February 2005. http://www.cs.utsa.edu/~whaley/papers/spercw04.ps

  25. Whaley RC, Petitet A, Dongarra JJ (2001) Automated empirical optimization of software and the ATLAS project. Parallel Comput 27(1–2):3–35

    Article  MATH  Google Scholar 

  26. Whaley RC, Whalley DB (2005) Tuning high performance kernels through empirical compilation. In: The 2005 international conference on parallel processing, Oslo, Norway, June 2005, pp 89–98

    Google Scholar 

  27. Yi Q, Whaley RC (2007) Automated transformation for performance-criticial kernels. In: ACM SIGPLAN symposium on library-centric software design, Montreal, Canada, October 2007

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media, LLC

About this entry

Cite this entry

Whaley, R.C. (2011). ATLAS (Automatically Tuned Linear Algebra Software). In: Padua, D. (eds) Encyclopedia of Parallel Computing. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09766-4_85

Download citation

Publish with us

Policies and ethics