skip to main content
10.1145/2335755.2335802acmotherconferencesArticle/Chapter ViewAbstractPublication PagesxsedeConference Proceedingsconference-collections
research-article

Analyzing throughput and utilization on trestles

Published: 16 July 2012 Publication History

Abstract

The Trestles system is targeted to modest-scale and gateway users, and is designed to enhance users' productivity by maintaining good turnaround time as well as other user-friendly features such as long run times and user reservations. However, the goal of maintaining good throughput competes with the goal of high system utilization. This paper analyzes one year of Trestles operations to characterize the empirical relationship between utilization and throughput, with the objectives of understanding their relationship, and informing allocations and scheduling policies to optimize their tradeoff. There is considerable scatter in the correlation between utilization and throughput, as measured by expansion factor. There are periods of good throughput at both low and high utilizations, while there are other periods when throughput degrades significantly not only at high utilization but even at low utilization. However, throughput consistently degrades above ~90% utilization. User behavior clearly impacts the expansion factor metrics: the great majority of jobs with extreme expansion factors are associated with a very small fraction of users who either (1) flood the queue with many jobs or (2) request job run times far in excess of actual run times. While the former is a user workflow choice, the latter clearly demonstrates the benefit of matching requested time to actual run time. Utilization and throughput metrics derived from XDMoD are compared for Trestles with two other XSEDE systems, Ranger and Kraken, with different sizes and allocation/scheduling policies. Both Ranger and Kraken have generally higher utilization and, not surprisingly, higher expansion factors than Trestles over the analysis period. As a result of this analysis, we intend to increase the target allocation fraction from the current 70% to ~75-80%, and strongly advise users to reasonably match requested run times to actual run times.

References

[1]
Moore, R. L., Hart, D. L., Pfeiffer, W., Tatineni, M., Yoshimoto, K. Young, W. S.; "Trestles: A High-Productivity HPC System Targeted to Modest-Scale and Gateway Users," TeraGrid'11, July 2011, Salt Lake City, Utah, USA. ACM 978-1-4503-0888-5/11/07.
[2]
Yoshimoto, K. K., Choi, D. J., Moore, R. L., Majumdar, A., Hocks, E.; "Implementations of Urgent Computing on Production HPC Systems," April 2012, International Conference on Computational Science, ICCS 2012. (Accepted for publication May 2012.)
[3]
Ernemann, C.; Hamscher, V.; Yahyapour, R.;, "Benefits of global grid computing for job scheduling," Grid Computing, 2004. Proceedings. Fifth IEEE/ACM International Workshop on, vol., no., pp. 374--379, 8 Nov. 2004 URL:http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1382854&isnumber=30134
[4]
Description of TACC Ranger system, http://www.tacc.utexas.edu/resources/hpc
[5]
Description of NICS Kraken system http://www.nics.tennessee.edu/computing-resources/kraken
[6]
Samuel, T. K.; Baer, T.; Brook, R. G.; Ezell, M.; Kovatch, P.;, "Scheduling diverse high performance computing systems with the goal of maximizing utilization," High Performance Computing (HiPC), 2011 18th International Conference on, vol., no., pp.1--6, 18--21 Dec. 2011 URL:http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6152723&isnumber=6152423
[7]
Keleher, P. J., Zotkin, D., Perkovic, D., Attacking the bottlenecks of backfilling schedulers, Cluster Computing, December 2000, 3:4, p. 245--254
[8]
http://www.sdsc.edu/catalina
[9]
http://www.clusterresources.com/product/maui

Cited By

View all
  • (2015)Storage utilization in the long tail of scienceProceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure10.1145/2792745.2792777(1-8)Online publication date: 26-Jul-2015
  • (2014)Gateways to DiscoveryProceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment10.1145/2616498.2616540(1-8)Online publication date: 13-Jul-2014

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
XSEDE '12: Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond
July 2012
423 pages
ISBN:9781450316026
DOI:10.1145/2335755
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 July 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. allocations
  2. expansion factor
  3. scheduling policies
  4. utilization

Qualifiers

  • Research-article

Funding Sources

Conference

XSEDE12

Acceptance Rates

Overall Acceptance Rate 129 of 190 submissions, 68%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)2
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Storage utilization in the long tail of scienceProceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure10.1145/2792745.2792777(1-8)Online publication date: 26-Jul-2015
  • (2014)Gateways to DiscoveryProceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment10.1145/2616498.2616540(1-8)Online publication date: 13-Jul-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media