Abstract
Over the years, data centers have been evolving to meet the new demands of cloud computing platforms, e-commerce, social networking services, and big data. These large data centers must meet various dependability requirements to guarantee the quality of service at a high level of reliability and availability, reducing interoperability time, as this is a major competitive factor for companies. The cooling system plays an important role in the availability of data centers. It is responsible for keeping the IT system at a suitable temperature to avoid hardware and software failures since operating at high temperatures negatively impacts the reliability of the electronic components. This paper investigates reliability and availability metrics and performs parametric sensitivity analysis for data center cooling systems. Our approach is based on modeling reliability block diagrams and the sensitivity analysis technique; the last is to assess how sensitive the availability is about model component failure and repair times.









Similar content being viewed by others
Availability of data and materials
Data supporting the findings of this study are not openly available due to confidentiality reasons and are available from the corresponding author upon reasonable request. The data are arranged in the author’s Academic Drive and can be accessed through this https://docs.google.com/spreadsheets/u/1/d/1rCK6NSv1lSc4KKcFQNMGfE62p2bIYlOR/edit?usp=drive_web%20&ouid=101398600213951117518%20&rtpof=true.
References
Han B, Li W, Li M, Liu L, Song J (2020) Study on libr/h2o absorption cooling system based on enhanced geothermal system for data center. Energy Rep 6:1090–1098
Callou G, Maciel P, Tutsch D, Ferreira J, Araújo J, Souza R (2013) Estimating sustainability impact of high dependable data centers: a comparative study between Brazilian and US energy mixes. Computing 95(12):1137–1170
Institute U (2021) Sistema Tier Classification. Disponível em. https://pt.uptimeinstitute.com/tiers. Accessed on Dec 2021
Camboim K, Ferreira J, Melo C, Araujo J, Alencar F, Maciel P (2021) Dependability and sustainability evaluation of data center electrical architectures. In: 2021 IEEE International Systems Conference (SysCon). IEEE, pp 1–8
Marin PS (2011) Data centers-desvendando cada passo: conceitos, projeto, infraestrutura física e eficiência energética. Érica, São Paulo
Camboim K, Souza E, Guimarães A, Araujo J, Maciel P (2021) Reliability-and-availability sensitivity analysis on convergent network infrastructures: methodology and case study. In: 2021 IEEE International Conference on Systems, Man and Cybernetics (SMC). IEEE, pp 1–8
Ding T, Chen X, Cao H, He Z, Wang J, Li Z (2021) Principles of loop thermosyphon and its application in data center cooling systems: a review. Renew Sustain Energy Rev 150:111389
Fu L, Wan J, Yang J, Cao D, Zhang G (2017) Dynamic thermal and it resource management strategies for data center energy minimization. J Cloud Comput 6(1):1–16
Chen H, Peng Y-h, Wang Y-l (2019) Thermodynamic analysis of hybrid cooling system integrated with waste heat reusing and peak load shifting for data center. Energy Convers Manag 183:427–439
Daraghmeh HM, Wang C-C (2017) A review of current status of free cooling in datacenters. Appl Therm Eng 114:1224–1239
Mujawar A, Kumar S, Krishnan SS, Sawant A (2018) Iot: Green data center strategies. Int J Future Revolut Comput Sci Commun Eng 4(5):170–174
Khalaj AH, Halgamuge SK (2017) A review on efficient thermal management of air-and liquid-cooled data centers: from chip to the cooling system. Appl Energy 205:1165–1188
Liu Y, Wei X, Xiao J, Liu Z, Xu Y, Tian Y (2020) Energy consumption and emission mitigation prediction based on data center traffic and pue for global data centers. Glob Energy Interconnect 3(3):272–282
Zhang X, Lindberg T, Xiong N, Vyatkin V, Mousavi A (2017) Cooling energy consumption investigation of data center it room with vertical placed server. Energy Procedia 105:2047–2052
Souza L, Camboim K, Alencar F (2022) A systematic literature review about integrating dependability attributes, performability and sustainability in the implantation of cooling subsystems in data center. J Supercomput 1–37
Nadjahi C, Louahlia H, Lemasson S (2018) A review of thermal management and innovative cooling strategies for data center. Sustain Comput Inf Syst 19:14–28
Rathnam L (2022) CRAC and CRAH: a complete guide to cooling your data centers and computer rooms. https://techgenix.com/cooling-data-center-crac-crah/. Accessed July 2022
Cronin D (2022) How To Choose Between CRAHs and CRACs. https://www.missioncriticalmagazine.com/articles/93650-how-to-choose-between-crahs-and-cracs. Accessed July 2022
Jose L (2019) Data center cooling infrastructure. https://dc.mynetworkinsights.com/data-center-cooling-infrastructure/. Accessed July 2022
Gomes DM, Endo PT, Gonçalves G, Rosendo D, Santos GL, Kelner J, Sadok D, Mahloo M (2017) Evaluating the cooling subsystem availability on a cloud data center. In: 2017 IEEE Symposium on Computers and Communications (ISCC). IEEE, pp 736–741
Lata M, Kumar V (2016) Innovative cooling strategies for cloud computing data centers. IUP J Inf Technol 12(1)
Carmo G (2021) Data Center #03 RefrigeraçÃO Chiller A ÁGUA CICLO BÁSICO DA REFRIGERAÇÃO. https://www.youtube.com/watch?v=71byYXQGNp4 &t=706s. Accessed July 2022
Chu J, Huang X (2021) Research status and development trends of evaporative cooling air-conditioning technology in data centers. Energy Built Environ
Hale P, Arno RG (2000) Survey of reliability and availability information for power distribution, power generation, and HVAC components for commercial, industrial, and utility installations. In: 2000 IEEE Industrial and Commercial Power Systems Technical Conference. Conference Record (Cat. No. 00CH37053). IEEE, pp 31–54
Johnston C, Partners L (2021) Excool zero engineering review. Report, Cundall
Kuo W, Zuo MJ (2003) Optimal reliability modeling: principles and applications. Wiley, Hoboken
Laprie JCC, Avizienis A, Kopetz H (eds) (1992) Dependability: basic concepts and terminology. Springer, Secaucus
Avizienis A, Laprie JC, Randell B, Landwehr C (2004) Basic concepts and taxonomy of dependable and secure computing. IEEE Trans Dependable Secure Comput 1(1):11–33. https://doi.org/10.1109/TDSC.2004.2
Pradhan DK (1996) Fault-tolerant computer system design. Prentice-Hall Inc., Hoboken
Maciel PRM, Trivedi KS, Matias R, Kim DS (2011) Dependability modeling. Performance and dependability in service computing: concepts, techniques and research directions, pp 53–97. Information Science Reference - Imprint of: IGI Publishing, Hershey, PA
Ebeling CE (1997) An introduction to reliability and maintainability engineering. McGrow-Hill Book Co, Singapura
Matos Júnior, RDS (2016) Identification of availability and performance bottlenecks in cloud computing systems: an approach based on hierarchical models and sensitivity analysis
Dâmaso A, Rosa N, Maciel P (2017) Integrated evaluation of reliability and power consumption of wireless sensor networks. Sensors 17(11):2547
Hamby DM (1994) A review of techniques for parameter sensitivity analysis of environmental models. Environ Monit Assess 32(2):135–154
Frank PM, Eslami M (1980) Introduction to system sensitivity theory. IEEE Trans Syst Man Cybern 10(6):337–338
Dantas RCDSP (2019) Modelos de desempenho, confiabilidade e disponibilidade para o planejamento de sistemas de transporte público
Melo RMD (2017) Análise de sensibilidade aplicada à identificação de pontos que requerem melhoria na disponibilidade em infraestrura de cloud
Jain R (1991) The art of computer systems performance analysis: techniques for experimental design, measurement, simulation, and modeling. Wiley, Hoboken
Ross SM (2017) Introductory statistics. Academic Press, Boston
Matos RDS, Maciel PR, Machida F, Kim DS, Trivedi KS (2012) Sensitivity analysis of server virtualized system availability. IEEE Trans Reliab 61(4):994–1006
Matos R, Araujo J, Oliveira D, Maciel P, Trivedi K (2015) Sensitivity analysis of a hierarchical model of mobile cloud computing. Simul Model Pract Theory 50:151–164
Maciel P, Matos R, Silva B, Figueiredo J, Oliveira D, Fé I, Maciel R, Dantas J (2017) Mercury: performance and dependability evaluation of systems with exponential, expolynomial, and general distributions. In: 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC). IEEE, pp 50–57
Camboim K, Araujo J, Melo C, Alencar F, Maciel P (2021) Dependability and sensitivity analysis in dense data center networks. In: 2021 16th Iberian Conference on Information Systems and Technologies (CISTI). IEEE, pp 1–6
Camboim K, Ferreira J, Araujo J, Alencar F (2020) Sustainability analysis in data center dense architectures. In: 2020 IEEE 9th International Conference on Cloud Networking (CloudNet). IEEE, pp 1–6
MoDCS (2023) MoDCS Research Group. Disponível em. https://www.modcs.org/. Accessed Feb 2023
Wang J, Zhang Q, Yoon S, Yu Y (2019) Impact of uncertainties on the supervisory control performance of a hybrid cooling system in data center. Build Environ 148:361–371
Junior RM, Guimaraes A, Camboim K, Maciel P, Trivedi K (2011) Sensitivity analysis of availability of redundancy in computer networks. In: The Fourth International Conference on Communication Theory, Reliability, and Quality of Service, Budapest, Hungary. Citeseer
Gomes D, Leoni G, Sadok D, Gonçalves G, Endo P, Maciel P (2020) Temperature variation impact on estimating costs and most critical components in a cloud data centre. Int J Comput Appl Technol 62(4):361–374
Lushpa I, Novikov K, Polesskiy S (2019) The reliability characteristics of the data processing centers cooling systems. In: 2019 International Seminar on Electron Devices Design and Production (SED), pp 1–4. https://doi.org/10.1109/SED.2019.8798415
Jiang J, Wei X, Gao W, Kuroki S, Liu Z (2018) Reliability and maintenance prioritization analysis of combined cooling, heating and power systems. Energies. https://doi.org/10.3390/en11061519
Kumar V, Gupta S, Tripathi AK (2018) Early quantification of reliability for a safety critical and control system: a case study of reactor core cooling system. Int J Eng Technol 7(2.12):248–252
Veeramany A, Pandey MD (2011) Reliability analysis of nuclear component cooling water system using semi-Markov process model. Nucl Eng Des 241(5):1799–1806
Callou G, Andrade E, Ferreira J (2019) Modeling and analyzing availability, cost and sustainability of it data center systems. In: 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC). IEEE, pp 2127–2132
Wang J, Zhang Q, Yoon S, Yu Y (2019) Reliability and availability analysis of a hybrid cooling system with water-side economizer in data center. Build Environ 148:405–416
Koo S, Chung T-S, Kim S (2015) Availability analysis for a data center cooling system with (n, k)-way CRACs. Springer, Berlin
Cheung H, Wang S (2019) Reliability and availability assessment and enhancement of water-cooled multi-chiller cooling systems for data centers. Reliab Eng Syst Saf 191:106573
Wan J, Gui X, Kasahara S, Zhang Y, Zhang R (2018) Air flow measurement and management for improving cooling and energy efficiency in raised-floor data centers: a survey. IEEE Access 6:48867–48901
Callou G, Ferreira J, Maciel P, Tutsch D, Souza R (2014) An integrated modeling approach to evaluate and optimize data center sustainability, dependability and cost. Energies 7(1):238–277
Silva B, Callou G, Tavares E, Maciel P, Figueiredo J, Sousa E, Araujo C, Magnani F, Neves F (2013) Astro: an integrated environment for dependability and sustainability evaluation. Sustain Comput Inf Syst 3(1):1–17
Ferreira J, Callou G, Maciel P, Tutsch D (2020) An algorithm to optimise the energy distribution of data centre electrical infrastructures. Int J Grid Util Comput 11(3):419–433
Rosendo D, Gomes D, Santos GL, Goncalves G, Moreira A, Ferreira L, Endo PT, Kelner J, Sadok D, Mehta A et al (2019) A methodology to assess the availability of next-generation data centers. J Supercomput 75(10):6361–6385
Funding
None.
Author information
Authors and Affiliations
Contributions
These authors contributed equally to this work.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Souza, L., Camboim, K., Araujo, J. et al. Dependability evaluation and sensitivity analysis of data center cooling systems. J Supercomput 79, 19607–19635 (2023). https://doi.org/10.1007/s11227-023-05419-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-023-05419-5