Skip to main content
Log in

Dependability evaluation and sensitivity analysis of data center cooling systems

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Over the years, data centers have been evolving to meet the new demands of cloud computing platforms, e-commerce, social networking services, and big data. These large data centers must meet various dependability requirements to guarantee the quality of service at a high level of reliability and availability, reducing interoperability time, as this is a major competitive factor for companies. The cooling system plays an important role in the availability of data centers. It is responsible for keeping the IT system at a suitable temperature to avoid hardware and software failures since operating at high temperatures negatively impacts the reliability of the electronic components. This paper investigates reliability and availability metrics and performs parametric sensitivity analysis for data center cooling systems. Our approach is based on modeling reliability block diagrams and the sensitivity analysis technique; the last is to assess how sensitive the availability is about model component failure and repair times.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Availability of data and materials

Data supporting the findings of this study are not openly available due to confidentiality reasons and are available from the corresponding author upon reasonable request. The data are arranged in the author’s Academic Drive and can be accessed through this https://docs.google.com/spreadsheets/u/1/d/1rCK6NSv1lSc4KKcFQNMGfE62p2bIYlOR/edit?usp=drive_web%20&ouid=101398600213951117518%20&rtpof=true.

References

  1. Han B, Li W, Li M, Liu L, Song J (2020) Study on libr/h2o absorption cooling system based on enhanced geothermal system for data center. Energy Rep 6:1090–1098

    Article  Google Scholar 

  2. Callou G, Maciel P, Tutsch D, Ferreira J, Araújo J, Souza R (2013) Estimating sustainability impact of high dependable data centers: a comparative study between Brazilian and US energy mixes. Computing 95(12):1137–1170

    Article  Google Scholar 

  3. Institute U (2021) Sistema Tier Classification. Disponível em. https://pt.uptimeinstitute.com/tiers. Accessed on Dec 2021

  4. Camboim K, Ferreira J, Melo C, Araujo J, Alencar F, Maciel P (2021) Dependability and sustainability evaluation of data center electrical architectures. In: 2021 IEEE International Systems Conference (SysCon). IEEE, pp 1–8

  5. Marin PS (2011) Data centers-desvendando cada passo: conceitos, projeto, infraestrutura física e eficiência energética. Érica, São Paulo

    Google Scholar 

  6. Camboim K, Souza E, Guimarães A, Araujo J, Maciel P (2021) Reliability-and-availability sensitivity analysis on convergent network infrastructures: methodology and case study. In: 2021 IEEE International Conference on Systems, Man and Cybernetics (SMC). IEEE, pp 1–8

  7. Ding T, Chen X, Cao H, He Z, Wang J, Li Z (2021) Principles of loop thermosyphon and its application in data center cooling systems: a review. Renew Sustain Energy Rev 150:111389

  8. Fu L, Wan J, Yang J, Cao D, Zhang G (2017) Dynamic thermal and it resource management strategies for data center energy minimization. J Cloud Comput 6(1):1–16

    Article  Google Scholar 

  9. Chen H, Peng Y-h, Wang Y-l (2019) Thermodynamic analysis of hybrid cooling system integrated with waste heat reusing and peak load shifting for data center. Energy Convers Manag 183:427–439

  10. Daraghmeh HM, Wang C-C (2017) A review of current status of free cooling in datacenters. Appl Therm Eng 114:1224–1239

    Article  Google Scholar 

  11. Mujawar A, Kumar S, Krishnan SS, Sawant A (2018) Iot: Green data center strategies. Int J Future Revolut Comput Sci Commun Eng 4(5):170–174

    Google Scholar 

  12. Khalaj AH, Halgamuge SK (2017) A review on efficient thermal management of air-and liquid-cooled data centers: from chip to the cooling system. Appl Energy 205:1165–1188

    Article  Google Scholar 

  13. Liu Y, Wei X, Xiao J, Liu Z, Xu Y, Tian Y (2020) Energy consumption and emission mitigation prediction based on data center traffic and pue for global data centers. Glob Energy Interconnect 3(3):272–282

    Article  Google Scholar 

  14. Zhang X, Lindberg T, Xiong N, Vyatkin V, Mousavi A (2017) Cooling energy consumption investigation of data center it room with vertical placed server. Energy Procedia 105:2047–2052

    Article  Google Scholar 

  15. Souza L, Camboim K, Alencar F (2022) A systematic literature review about integrating dependability attributes, performability and sustainability in the implantation of cooling subsystems in data center. J Supercomput 1–37

  16. Nadjahi C, Louahlia H, Lemasson S (2018) A review of thermal management and innovative cooling strategies for data center. Sustain Comput Inf Syst 19:14–28

    Google Scholar 

  17. Rathnam L (2022) CRAC and CRAH: a complete guide to cooling your data centers and computer rooms. https://techgenix.com/cooling-data-center-crac-crah/. Accessed July 2022

  18. Cronin D (2022) How To Choose Between CRAHs and CRACs. https://www.missioncriticalmagazine.com/articles/93650-how-to-choose-between-crahs-and-cracs. Accessed July 2022

  19. Jose L (2019) Data center cooling infrastructure. https://dc.mynetworkinsights.com/data-center-cooling-infrastructure/. Accessed July 2022

  20. Gomes DM, Endo PT, Gonçalves G, Rosendo D, Santos GL, Kelner J, Sadok D, Mahloo M (2017) Evaluating the cooling subsystem availability on a cloud data center. In: 2017 IEEE Symposium on Computers and Communications (ISCC). IEEE, pp 736–741

  21. Lata M, Kumar V (2016) Innovative cooling strategies for cloud computing data centers. IUP J Inf Technol 12(1)

  22. Carmo G (2021) Data Center #03 RefrigeraçÃO Chiller A ÁGUA CICLO BÁSICO DA REFRIGERAÇÃO. https://www.youtube.com/watch?v=71byYXQGNp4 &t=706s. Accessed July 2022

  23. Chu J, Huang X (2021) Research status and development trends of evaporative cooling air-conditioning technology in data centers. Energy Built Environ

  24. Hale P, Arno RG (2000) Survey of reliability and availability information for power distribution, power generation, and HVAC components for commercial, industrial, and utility installations. In: 2000 IEEE Industrial and Commercial Power Systems Technical Conference. Conference Record (Cat. No. 00CH37053). IEEE, pp 31–54

  25. Johnston C, Partners L (2021) Excool zero engineering review. Report, Cundall

  26. Kuo W, Zuo MJ (2003) Optimal reliability modeling: principles and applications. Wiley, Hoboken

    Google Scholar 

  27. Laprie JCC, Avizienis A, Kopetz H (eds) (1992) Dependability: basic concepts and terminology. Springer, Secaucus

    MATH  Google Scholar 

  28. Avizienis A, Laprie JC, Randell B, Landwehr C (2004) Basic concepts and taxonomy of dependable and secure computing. IEEE Trans Dependable Secure Comput 1(1):11–33. https://doi.org/10.1109/TDSC.2004.2

    Article  Google Scholar 

  29. Pradhan DK (1996) Fault-tolerant computer system design. Prentice-Hall Inc., Hoboken

    Google Scholar 

  30. Maciel PRM, Trivedi KS, Matias R, Kim DS (2011) Dependability modeling. Performance and dependability in service computing: concepts, techniques and research directions, pp 53–97. Information Science Reference - Imprint of: IGI Publishing, Hershey, PA

  31. Ebeling CE (1997) An introduction to reliability and maintainability engineering. McGrow-Hill Book Co, Singapura

    Google Scholar 

  32. Matos Júnior, RDS (2016) Identification of availability and performance bottlenecks in cloud computing systems: an approach based on hierarchical models and sensitivity analysis

  33. Dâmaso A, Rosa N, Maciel P (2017) Integrated evaluation of reliability and power consumption of wireless sensor networks. Sensors 17(11):2547

    Article  Google Scholar 

  34. Hamby DM (1994) A review of techniques for parameter sensitivity analysis of environmental models. Environ Monit Assess 32(2):135–154

    Article  Google Scholar 

  35. Frank PM, Eslami M (1980) Introduction to system sensitivity theory. IEEE Trans Syst Man Cybern 10(6):337–338

    Article  Google Scholar 

  36. Dantas RCDSP (2019) Modelos de desempenho, confiabilidade e disponibilidade para o planejamento de sistemas de transporte público

  37. Melo RMD (2017) Análise de sensibilidade aplicada à identificação de pontos que requerem melhoria na disponibilidade em infraestrura de cloud

  38. Jain R (1991) The art of computer systems performance analysis: techniques for experimental design, measurement, simulation, and modeling. Wiley, Hoboken

    MATH  Google Scholar 

  39. Ross SM (2017) Introductory statistics. Academic Press, Boston

    Book  MATH  Google Scholar 

  40. Matos RDS, Maciel PR, Machida F, Kim DS, Trivedi KS (2012) Sensitivity analysis of server virtualized system availability. IEEE Trans Reliab 61(4):994–1006

    Article  Google Scholar 

  41. Matos R, Araujo J, Oliveira D, Maciel P, Trivedi K (2015) Sensitivity analysis of a hierarchical model of mobile cloud computing. Simul Model Pract Theory 50:151–164

    Article  Google Scholar 

  42. Maciel P, Matos R, Silva B, Figueiredo J, Oliveira D, Fé I, Maciel R, Dantas J (2017) Mercury: performance and dependability evaluation of systems with exponential, expolynomial, and general distributions. In: 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC). IEEE, pp 50–57

  43. Camboim K, Araujo J, Melo C, Alencar F, Maciel P (2021) Dependability and sensitivity analysis in dense data center networks. In: 2021 16th Iberian Conference on Information Systems and Technologies (CISTI). IEEE, pp 1–6

  44. Camboim K, Ferreira J, Araujo J, Alencar F (2020) Sustainability analysis in data center dense architectures. In: 2020 IEEE 9th International Conference on Cloud Networking (CloudNet). IEEE, pp 1–6

  45. MoDCS (2023) MoDCS Research Group. Disponível em. https://www.modcs.org/. Accessed Feb 2023

  46. Wang J, Zhang Q, Yoon S, Yu Y (2019) Impact of uncertainties on the supervisory control performance of a hybrid cooling system in data center. Build Environ 148:361–371

    Article  Google Scholar 

  47. Junior RM, Guimaraes A, Camboim K, Maciel P, Trivedi K (2011) Sensitivity analysis of availability of redundancy in computer networks. In: The Fourth International Conference on Communication Theory, Reliability, and Quality of Service, Budapest, Hungary. Citeseer

  48. Gomes D, Leoni G, Sadok D, Gonçalves G, Endo P, Maciel P (2020) Temperature variation impact on estimating costs and most critical components in a cloud data centre. Int J Comput Appl Technol 62(4):361–374

    Article  Google Scholar 

  49. Lushpa I, Novikov K, Polesskiy S (2019) The reliability characteristics of the data processing centers cooling systems. In: 2019 International Seminar on Electron Devices Design and Production (SED), pp 1–4. https://doi.org/10.1109/SED.2019.8798415

  50. Jiang J, Wei X, Gao W, Kuroki S, Liu Z (2018) Reliability and maintenance prioritization analysis of combined cooling, heating and power systems. Energies. https://doi.org/10.3390/en11061519

    Article  Google Scholar 

  51. Kumar V, Gupta S, Tripathi AK (2018) Early quantification of reliability for a safety critical and control system: a case study of reactor core cooling system. Int J Eng Technol 7(2.12):248–252

    Article  Google Scholar 

  52. Veeramany A, Pandey MD (2011) Reliability analysis of nuclear component cooling water system using semi-Markov process model. Nucl Eng Des 241(5):1799–1806

    Article  Google Scholar 

  53. Callou G, Andrade E, Ferreira J (2019) Modeling and analyzing availability, cost and sustainability of it data center systems. In: 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC). IEEE, pp 2127–2132

  54. Wang J, Zhang Q, Yoon S, Yu Y (2019) Reliability and availability analysis of a hybrid cooling system with water-side economizer in data center. Build Environ 148:405–416

    Article  Google Scholar 

  55. Koo S, Chung T-S, Kim S (2015) Availability analysis for a data center cooling system with (n, k)-way CRACs. Springer, Berlin

    Book  Google Scholar 

  56. Cheung H, Wang S (2019) Reliability and availability assessment and enhancement of water-cooled multi-chiller cooling systems for data centers. Reliab Eng Syst Saf 191:106573

    Article  Google Scholar 

  57. Wan J, Gui X, Kasahara S, Zhang Y, Zhang R (2018) Air flow measurement and management for improving cooling and energy efficiency in raised-floor data centers: a survey. IEEE Access 6:48867–48901

    Article  Google Scholar 

  58. Callou G, Ferreira J, Maciel P, Tutsch D, Souza R (2014) An integrated modeling approach to evaluate and optimize data center sustainability, dependability and cost. Energies 7(1):238–277

    Article  Google Scholar 

  59. Silva B, Callou G, Tavares E, Maciel P, Figueiredo J, Sousa E, Araujo C, Magnani F, Neves F (2013) Astro: an integrated environment for dependability and sustainability evaluation. Sustain Comput Inf Syst 3(1):1–17

    Google Scholar 

  60. Ferreira J, Callou G, Maciel P, Tutsch D (2020) An algorithm to optimise the energy distribution of data centre electrical infrastructures. Int J Grid Util Comput 11(3):419–433

    Article  Google Scholar 

  61. Rosendo D, Gomes D, Santos GL, Goncalves G, Moreira A, Ferreira L, Endo PT, Kelner J, Sadok D, Mehta A et al (2019) A methodology to assess the availability of next-generation data centers. J Supercomput 75(10):6361–6385

    Article  Google Scholar 

Download references

Funding

None.

Author information

Authors and Affiliations

Authors

Contributions

These authors contributed equally to this work.

Corresponding author

Correspondence to Lubnnia Souza.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Souza, L., Camboim, K., Araujo, J. et al. Dependability evaluation and sensitivity analysis of data center cooling systems. J Supercomput 79, 19607–19635 (2023). https://doi.org/10.1007/s11227-023-05419-5

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-023-05419-5

Keywords

Navigation