skip to main content
10.1145/1274971.1275009acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article

Adaptive performance control for distributed scientific coupled models

Published:17 June 2007Publication History

ABSTRACT

The PerCo performance control framework is capable of managing the distributed execution of scientific coupled models using migration, for example, in response to changes in an execution environment. PerCo monitors execution times and reacts according to an adaptive performance control strategy whenever serious changes of behaviour occur. A computationally cheap technique is used per model to smooth the series of monitored execution times and to provide a short-term forecast for future execution times on currently assigned resources. Where this short-term forecast fails to be achieved, the system analyses whether migration would improve matters. For models that are candidates for migration, more accurate but computationally expensive techniques are used to form a longer-term prediction of future execution times on various candidate resources. Based on the predicted gain, a migration decision is made taking account of the expected cost of migration. Experimental results for small real scientific coupled models show that the performance control strategy behaves effectively in scenarios in which the ambient load is varied during execution.

References

  1. C. Armstrong, R. Ford, J. Gurd, M. Luján, K. R. Mayes, and G. D. Riley. Performance control of scientific coupled models in Grid environments. Concurrency and Computation: Practice and Experience, 17(2--4):259--295, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Basu, A. Mukherjee, and S. Klivansky. Time series models for internet traffic. In Proceedings of the Fifteenth Annual Joint Conference of the IEEE Computer and Communication Societies -- INFOCOM'96, volume 2, pages 611--620, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. G. Box, G. Jenkins, and G. Reinsel. Time Series Analysis: Forecasting and Control. Prentice Hall, 3rd edition, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Delgado-Buscalioni, P. V. Coveney, G. D. Riley, and R. W. Ford. Hybrid molecular-continuum fluid models: implementation within a general coupling framework. Philosophical Transactions of the Royal Society: Series A, 363(1833):1975--1986, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  5. P. A. Dinda. Design, implementation, and performance of an extensible toolkit for resource prediction in distributed systems. IEEE Transactions on Parallel and Distributed Systems, 17(2):160--173, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. W. Ford, G. Riley, M. K. Bane, C. W. Armstrong, and T. Freeman. Gcf: A general coupling framework. Concurrency and Computation: Practice and Experience, 18(2):163--181, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Gibbons. A historical application profiler for use by parallel schedulers. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, volume 1291 of Lecture Notes in Computer Science, pages 58--77, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Guim, A. Goyeneche, J. Corbalan, J. Labarta, and G. Terstyansky. Grid computing performance prediction based in historical information. In CoreGRID integration workshop. Integrated Research in Grid Computing Workshop, 2005.Google ScholarGoogle Scholar
  9. E. Huedo, R. S. Montero, and I. M. Llorente. A framework for adaptive execution in Grids. Software: Practice and Experience, 34(7):631--651, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. N. H. Kapadia, J. A. B. Fortes, and C. E. Brodley. Predictive application-performance modeling in a computational grid environment. In Proceedings of the Eighth International Symposium on High Performance Distributed Computing -- HPDC'99, pages 47--54, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Kennedy et al. Toward a framework for preparing and executing adaptive grid applications. Proceedings of the 16th International Parallel and Distributed Processing Symposium -- IPDPS, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. J. Lafreniere and A. C. Sodan. Scopred-scalable user-directed performance prediction using complexity modelling and historical data. In Proceedings of the 11th International Workshop on Job Scheduling Strategies for Parallel Processing -- JSSPP, volume 3834 of Lecture Notes in Computer Science, pages 62--90, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B.-D. Lee and J. M. Schopf. Run-time prediction of parallel applications on shared environments. In Proceedings of the IEEE International Conference on Cluster Computing, pages 487--491, 2003.Google ScholarGoogle Scholar
  14. S. G. Makridakis, S. C. Wheelwright, and R. J. Hyndman. Forecasting: Methods and Applications. John Wiley & Sons, 3rd edition, 1998.Google ScholarGoogle Scholar
  15. K. R. Mayes, M. Luján, G. D. Riley, J. Chin, P. V. Coveney, and J. R. Gurd. Towards performance control on the Grid. Philosophical Transactions of the Royal Society: Series A, 363(1833):1793--1805, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  16. L. J. Senger, M. J. Santana, and R. H. C. Santana. An instance-based learning approach for predicting execution times of parallel applications. In Proceedings of the 3rd International Information and Telecommunication Technologies Symposium, pages 9--15, 2005.Google ScholarGoogle Scholar
  17. W. Smith, I. T. Foster, and V. E. Taylor. Predicting application run times using historical information. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing -- JSSPP, volume 1459 of Lecture Notes in Computer Science, pages 122--142, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. V. Taylor, X. Wu, and R. Stevens. Prophesy: An infrastructure for performance analysis and modelling of parallel and Grid applications. ACM SIGMETRICS Performance Evaluation Review, 30(4):13--18, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. N. Tran and D. A. Reed. ARIMA time series modelling and forecasting for adaptive I/O prefetching. In Proceedings 15th International Conference on Supercomputing -- ICS'01, pages 473--485, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. S. Vadhiyar and J. J. Dongarra. Self adaptivity in Grid computing. Concurrency and Computation: Practice and Experience, 17(2--4):235--257, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Vazhkudai and J. M. Schopf. Using regression techniques to predict large data transfers. International Journal of High Performance Computing Applications, 17(3):249--268, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Wolski, N. T. Spring, and J. Hayes. The Network Weather Service: A distributed resource performance forecasting service for metacomputing. Future Generation Computing Systems, 15(5):757--768, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Adaptive performance control for distributed scientific coupled models

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              ICS '07: Proceedings of the 21st annual international conference on Supercomputing
              June 2007
              315 pages
              ISBN:9781595937681
              DOI:10.1145/1274971

              Copyright © 2007 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 17 June 2007

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate584of2,055submissions,28%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader