Skip to main content
Log in

Application-specific feature selection and clustering approach with HPC system profiling data

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Exascale computing, the next-generation computing environment, is expected to be applied to scientific and engineering applications. Accordingly, high-performance computing (HPC) technology is also being developed to improve the performance and high-speed parallelism of many-core processors. Previous researches on improving HPC performance have developed in the form of improving the overall system performance by analyzing the state of the system occurring in the range of the knowledge of expert. However, performance events occurring in a processor in a many-core environment have a large number of indicators, and it is difficult to analyze the correlation between them. In this paper, we propose an application-specific feature selection and clustering approach with HPC system profiling data. The proposed approach performs PCA-based feature selections for efficient performance analysis methods. In addition, the application-specific characteristics from profiling data can be analyzed by unsupervised learning. In our experiments, we evaluated highly parallel supercomputers with NAS parallel benchmark and were able to cluster applications efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Álvarez C-F, Javier AP, Milian RA, Ishii K, Morizawa RK, Sala Rosa MB (2019) Efficient development of high-performance data analytics in Python. Future Gener Comput Syst 111:571–580

  2. Runting RK, Phinn S, Xie Z, Venter O, JEM Watson (2020) Opportunities for big data in conservation and sustainability. Nat Commun 11(1):1–4

  3. Kim A, Lee J, Kim M (2016) Resource management model based on cloud computing environment. Int J Distrib Sens Netw 12(11):1550147716676554

  4. Tansley S, Tolle K (2009) The fourth paradigm: data-intensive scientific discovery, vol 1. Microsoft Research, Redmond

  5. Kim A, Lee J, Kim M (2016) Context-aware recommendation model based on mobile application analysis platform. Multimed Tools Appl 75(22):14783–14794

    Article  Google Scholar 

  6. Reed DA, Dongarra J (2015) Exascale computing and big data. Commun ACM 58(7):56–68

    Article  Google Scholar 

  7. Khaleghzadeh H, Manumachu RR, Lastovetsky A (2018) A novel data-partitioning algorithm for performance optimization of data-parallel applications on heterogeneous HPC platforms. IEEE Trans Parallel Distrib Syst 29(10):2176–2190

    Article  Google Scholar 

  8. Zhou L, Pan S, Wang J, Vasilakos AV (2017) Machine learning on big data: opportunities and challenges. Neurocomputing 237:350–361

    Article  Google Scholar 

  9. Cappello F, et al (2014) Toward exascale resilience: 2014 update. Supercomput Front Innov Int J 1(1):5–28

  10. Stegailov VV, Orekhov ND, Smirnov GS (2015) HPC hardware efficiency for quantum and classical molecular dynamics. In: International Conference on Parallel Computing Technologies. Springer, Cham, pp 469–473

  11. Erez M, Ahn JH, Garg A, Dally WJ, Darve E (2004) Analysis and performance results of a molecular modeling application on Merrimac. In: SC’04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing. IEEE, pp 42–42

  12. Williams S, Kalamkar DD, Singh A, Deshpande AM, Van Straalen B, Smelyanskiy M, Almgren A, Dubey P, Shalf J, Oliker L (2012) Optimization of geometric multigrid for emerging multi and manycore processors. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, ser SC ’12. IEEE Computer Society Press, Salt Lake City, Utah, pp 96:1–96:11

  13. Kumbhare N, Tunc C, Machovec D, Akoglu A, Hariri S, Siegel HJ (2017) Value based scheduling for oversubscribed power-constrained homogeneous HPC systems. In: International Conference on Cloud and Autonomic Computing (ICCAC), pp 120–130

  14. Richoux AN (2012) Scheduling in a high-performance computing (HPC) system. U.S. Patent No. 8,209,395

  15. Di S, Bouguerra M, Bautista-Gomez L, Cappello F (2014) Optimization of multi-level checkpoint model for large scale HPC applications. In: Proceedings of IEEE 28th international parallel distributing processing symposium, pp 1181–1190

  16. Wang Y et al (2014) Characterization and optimization of memory-resident mapreduce on HPC systems. In: 2014 IEEE 28th international parallel and distributed processing symposium. IEEE

  17. Lastovetsky A, Reddy R (2017) New model-based methods and algorithms for performance and energy optimization of data parallel applications on homogeneous multicore clusters. IEEE Trans Parallel Distrib Syst 28(4):1119–1133

    Article  Google Scholar 

  18. O’brien K et al (2017) A survey of power and energy predictive models in HPC systems and applications. ACM Comput Surv 50(3):1–38

  19. Ren G, Tune E, Moseley T, Shi Y, Rus S, Hundt R (2010) Google-wide profiling: a continuous profiling infrastructure for data centers. IEEE Micro 30(4):65–78

    Article  Google Scholar 

  20. Schlegel B, Karnagel T, Kiefer T, Lehner W (2013) Scalable frequent itemset mining on many-core processors. In: Proceedings of the ninth international workshop on data management on new hardware, ser DaMoN’13. ACM, New York, NY, pp 3:1–3:8

  21. Klinkenberg J, Terboven C, Lankes S, Müller MS (2017) Data mining-based analysis of HPC center operations. In: 2017 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, pp 766–773

  22. Choi J, Park G, Nam D (2018) Efficient classification of application characteristics by using hardware performance counters with data mining. In: 2018 IEEE 3rd international workshops on foundations and applications of self* systems (FAS* W). IEEE, pp 24–29

  23. Collectl. [online]. Available http://collectl.sourceforge.net/

  24. Wieërs D. Dstat: Versatile resource statistics tool. http://dag.wieers.com/home-made/dstat/

  25. Sar [online]. Available https://linux.die.net/man/1/sar

  26. Ware H, Frdrick F (1994) Linux Man Page: Vmstat (8, Book Linux Man Page: Vmstat), vol 8

  27. Free [online]. Available https://linux.die.net/man/1/free

  28. Godard S (2015) Sysstat Utilities Home Page! [online]. Available http://sebastien.godard.pagesperso-orange.fr/index.html

  29. Blagodurov S, Fedorova A (2011) User-level scheduling on numa mul-tic ore systems under linux

  30. Lv Y, Sun B, Luo Q, Wang J, Yu Z, Qian X (2018) CounterMiner: mining big performance data from hardware counters. In: 51st Annual IEEE/ACM international symposium on microarchitecture (MICRO), Fukuoka, pp 613–626. https://doi.org/10.1109/MICRO.2018.00056

  31. Zellweger G, Lin D, Roscoe T (2016) So many performance events so little time. In: Proceedings of the ACM Asia-Pacific workshop on systems (APSys)

  32. May JM (2001) Mpx: software for multiplexing hardware performance counters in multithreaded programs. In: Proceedings of IEEE international symposium on parallel and distributed processing

  33. Mytkowicz T, Sweeney PF, Hauswirth M, Diwan A (2007) Time interpolation: so many metrics so few registers. In: Proceedings of IEEE international symposium on microarchitecture

  34. Weaver VM, McKee SA (2008) Can hardware performance counters be trusted. In: Proceedings of IEEE international symposium on workload characterization

  35. Mytkowicz T, Diwan A, Hauswirth M, Sweeney PF (2009) Producing wrong data without doing anything obviously wrong. In: Proceedings of the 14th ACM symposium on architectural support for programming languages and operating systems

  36. Yasin A, Top-Down A (2014) method for performance analysis and counters architecture. In: IEEE international symposium on performance analysis of systems and software (ISPASS), Monterey, CA, pp 35–44. https://doi.org/10.1109/ISPASS.2014.6844459

  37. Zhang J et al (2006) Application classification through monitoring and learning of resource consumption patterns. In: Proceedings of the parallel and distributed processing symposium

  38. Molka D et al (2017) Detecting memory-boundedness with hardware performance counters. In: Proceedings of the 8th ACM/SPEC International Conference on Performance Engineering, pp 27–38

  39. Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Korea Institute of Science and Technology Information (KISTI) Grant (No. K-20-L02-C08-S01).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Jongmin Lee or Mucheol Kim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shin, M., Park, G., Park, C.Y. et al. Application-specific feature selection and clustering approach with HPC system profiling data. J Supercomput 77, 6817–6831 (2021). https://doi.org/10.1007/s11227-020-03533-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-020-03533-2

Keywords

Navigation