Abstract
The interactions and dependencies between software and hardware are often neglected in modeling system reliability in the past few decades due to the mathematical complexity. However, many system failures occurred from the interactions or simultaneous occurrences of software and hardware. This paper first proposes a new diagram of categorizing system-level failures and further incorporates such a diagram into the development of complex system reliability framework. System-level failures result from software subsystem, hardware subsystem, and the interactions of software and hardware subsystems. The focus of this study is on the investigation of the interactions failures generated from the interactions of software and hardware subsystems. In addition to the considerations of total hardware failures, software-induced hardware failures, and hardware-induced software failures introduced by Zhu and Pham (Mathematics 7(11):1049, 2019), we further introduce the partial hardware failures that can be respectively induced by hardware and software to explicitly demonstrate the dependencies and interactions between software and hardware. Hence, a new complex system reliability framework is developed based on such system-level failure categorization with the Markov process. Furthermore, the numerical examples are studied to illustrate the impacts on system reliability with the changes of state transition parameters that modeling the interactions of software and hardware subsystems. Finally, we have studied two maintenance policies of the proposed complex system reliability model.















Similar content being viewed by others
References
Avizienis, A., Laprie, J. C., Randell, B., & Landwehr, C. (2004). Basic concepts and taxonomy of dependable and secure computing. IEEE Transactions on Dependable and Secure Computing, 1(1), 11–33.
Bar-Yam, Y. (2019). Dynamics of complex systems. Cambridge: CRC Press.
Dessouky, G., Gens, D., Haney, P., Persyn, G., Kanuparthi, A., & Khattri, H., et al. (2019). Hardfails: insights into software-exploitable hardware bugs. In: Proceedings of 28th USENIX Security Symposium, Santa Clara, California, USA (pp. 213–230).
Evtyushkin, D., Riley, R., Abu-Ghazaleh, N. C. A. E., & Ponomarev, D. (2018). Branchscope: A new side-channel attack on directional branch predictor. ACM SIGPLAN Notices, 53(2), 693–707.
Ferraiuolo, A., Xu, R., Zhang, D., Myers, A. C., & Suh, G. E. (2017). Verification of a practical hardware security architecture through static information flow analysis. In: Proceedings of the twenty-second international conference on architectural support for programming languages and operating systems, Xi’an, China (pp. 555–568).
Friedman, M. A., Tran, P. Y., & Goddard, P. L. (1992). Reliability techniques for combined hardware and software systems. In: Proceedings of the annual reliability and maintainability symposium, Las Vegas, NV, USA.
Goel, A. L., & Okumoto, K. (1979). Time-dependent error-detection rate model for software reliability and other performance measures. IEEE Transactions on Reliability, 28(3), 206–211.
Gras, B., Razavi, K., Bos, H., & Giuffrida, C. (2018). Translation leak-aside buffer: Defeating cache side-channel protections with {TLB} attacks. In: Proceedings of 27th {USENIX} security symposium, Baltimore, Maryland, USA (pp. 955–972).
Gu, Y., Fan, C., Liang, L., & Zhang, J. (2019). Reliability calculation method based on the Copula function for mechanical systems with dependent failure. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03202-5.
Hecht, H., & Hecht, M. (1986). Software reliability in the system context. IEEE Transactions on Software Engineering, 1, 51–58.
Jerraya, A., & Wolf, W. (2005). Hardware/software interface codesign for embedded systems. Computer, 38(2), 63–69.
Jing, Q., Vasilakos, A. V., Wan, J., Lu, J., & Qiu, D. (2014). Security of the internet of things: Perspectives and challenges. Wireless Networks, 20(8), 2481–2501.
Kanoun, K., & Ortalo-Borrel, M. (2000). Fault-tolerant system dependability-explicit modeling of hardware and software component-interactions. IEEE Transactions on Reliability, 49(4), 363–376.
Kapur, P. K., Pham, H., Aggarwal, A. G., & Kaur, G. (2012). Two dimensional multi-release software reliability modeling and optimal release planning. IEEE Transactions on Reliability, 61(3), 758–768.
Laprie, J. C., & Kanoun, K. (1992). X-ware reliability and availability modeling. IEEE Transactions on Software Engineering, 2, 130–147.
Laprie, J. C., Kanoun, K., Béounes, C., & Kaâniche, M. (1991). The KAT (Knowledge-Action-Transformation) approach to the modeling and evaluation of reliability and availability growth. IEEE Transactions on Software Engineering, 4, 370–382.
Lorén, S., & de Maré, J. (2015). Maintenance for reliability—A case study. Annals of Operations Research, 224(1), 111–119.
Lung, C. H., Zhang, X., & Rajeswaran, P. (2016). Improving software performance and reliability in a distributed and concurrent environment with an architecture-based self-adaptive framework. Journal of Systems and Software, 121, 311–328.
Minamino, Y., Inoue, S., & Yamada, S. (2016). NHPP-based change-point modeling for software reliability assessment and its application to software development management. Annals of Operations Research, 244(1), 85–101.
Moghimi, A., Wichelmann, J., Eisenbarth, T., & Sunar, B. (2019). Memjam: A false dependency attack against constant-time crypto implementations. International Journal of Parallel Programming, 47(4), 538–570.
Park, J., & Baik, J. (2015). Improving software reliability prediction through multi-criteria based dynamic model selection and combination. Journal of Systems and Software, 101, 236–244.
Park, J., Kim, H. J., Shin, J. H., Baik, J. (2012). An embedded software reliability model with consideration of hardware related software failures. In: Proceedings of 2012 IEEE sixth international conference on software security and reliability, Gaithersburg, MD, USA (pp. 207–214).
Pham, H., & Wang, H. (2000). Optimal (τ, T) opportunistic maintenance of a k-out-of-n: G system with imperfect PM and partial failure. Naval Research Logistics (NRL), 47(3), 223–239.
Pignal, P. I. (1988). An analysis of hardware and software availability exemplified on the IBM 3725 communication controller. IBM Journal of Research and Development, 32(2), 268–278.
Ross, S. M. (2014). Introduction to probability models. Cambridge: Academic Press.
Sinuany-Stern, Z. (1999). Reliability and maintenance in production control-an introduction. Annals of Operations Research, 91, 1–10.
Stark, G. E. (1987). Dependability evaluation of integrated hardware/software systems. IEEE Transactions on Reliability, 36(4), 440–444.
Tao, F., Zuo, Y., Da Xu, L., & Zhang, L. (2014). IoT-based intelligent perception and access of manufacturing resource toward cloud manufacturing. IEEE Transactions on Industrial Informatics, 10(2), 1547–1557.
Teng, X., Pham, H., & Jeske, D. R. (2006). Reliability modeling of hardware and software interactions, and its applications. IEEE Transactions on Reliability, 55(4), 571–577.
Van Driel, W. D., Schuld, M., Wijgers, R., & Van Kooten, W. E. J. (2014). Software reliability and its interaction with hardware reliability. In: Proceedings of 2014 15th international conference on thermal, mechanical and mulit-physics simulation and experiments in microelectronics and microsystems, Ghent, Belgium (pp. 1–8).
Wan, J., Tang, S., Shu, Z., Li, D., Wang, S., Imran, M., et al. (2016). Software-defined industrial internet of things in the context of industry 4.0. IEEE Sensors Journal, 16(20), 7373–7380.
Wang, J., Bai, G., Li, Z., & Zuo, M. J. (2020). A general discrete degradation model with fatal shocks and age-and state-dependent nonfatal shocks. Reliability Engineering and System Safety, 193, 106648.
Wang, F., & Li, H. (2018). Distribution modeling for reliability analysis: Impact of multiple dependences and probability model selection. Applied Mathematical Modelling, 59, 483–499.
Wang, H., & Pham, H. (1996). Optimal age-dependent preventive maintenance policies with imperfect maintenance. International Journal of Reliability, Quality and Safety Engineering, 3(02), 119–135.
Wang, Y., & Pham, H. (2011). Modeling the dependent competing risks with multiple degradation processes and random shock using time-varying copulas. IEEE Transactions on Reliability, 61(1), 13–22.
Wang, S., Wan, J., Li, D., & Zhang, C. (2016a). Implementing smart factory of industrie 4.0: An outlook. International Journal of Distributed Sensor Networks, 12(1), 3159805.
Wang, S., Wan, J., Zhang, D., Li, D., & Zhang, C. (2016b). Towards smart factory for industry 4.0: A self-organized multi-agent system with big data based feedback and coordination. Computer Networks, 101, 158–168.
Welke, S. R., Johnson, B. W., & Aylor, J. H. (1995). Reliability modeling of hardware/software systems. IEEE Transactions on Reliability, 44(3), 413–418.
Xu, D., He, J., & Yang, Z. (2019). Reliability prediction based on Birnbaum–Saunders model and its application to smart meter. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03429-2.
Yang, G. (2007). Life cycle reliability engineering. Hoboken: Wiley.
Yarom, Y., Genkin, D., & Heninger, N. (2017). CacheBleed: A timing attack on OpenSSL constant-time RSA. Journal of Cryptographic Engineering, 7(2), 99–112.
Zeng, Y., Xing, L., Zhang, Q., & Jia, X. (2019). An analytical method for reliability analysis of hardware-software co-design system. Quality and Reliability Engineering International, 35(1), 165–178.
Zhao, X., Cai, J., Mizutani, S., & Nakagawa, T. (2020a). Preventive replacement policies with time of operations, mission durations, minimal repairs and maintenance triggering approaches. Journal of Manufacturing Systems. https://doi.org/10.1016/j.jmsy.2020.04.003.
Zhao, X., Chen, M., & Nakagawa, T. (2020b). Periodic replacement policies with shortage and excess costs. Annals of Operations Research. https://doi.org/10.1007/s10479-020-03566-z.
Zhao, X., Wu, C., Wang, X., & Sun, J. (2020c). Reliability analysis of k-out-of-n: F balanced systems with multiple functional sectors. Applied Mathematical Modelling, 82, 108–124.
Zhou, H., Wang, S., Qi, F., & Gao, S. (2019). Maintenance modeling and operation parameters optimization for complex production line under reliability constraints. Annals of Operations Research. https://doi.org/10.1007/s10479-019-03228-9.
Zhu, M., & Pham, H. (2018a). A two-phase software reliability modeling involving with software fault dependency and imperfect fault removal. Computer Languages, Systems and Structures, 53, 27–42.
Zhu, M., & Pham, H. (2018b). A software reliability model incorporating martingale process with gamma-distributed environmental factors. Annals of Operations Research. https://doi.org/10.1007/s10479-018-2951-7.
Zhu, M., & Pham, H. (2018c). A multi-release software reliability modeling for open source software incorporating dependent fault detection process. Annals of Operations Research, 269(1–2), 773–790.
Zhu, M., & Pham, H. (2019). A novel system reliability modeling of hardware, software, and interactions of hardware and software. Mathematics, 7(11), 1049.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhu, M. A new framework of complex system reliability with imperfect maintenance policy. Ann Oper Res 312, 553–579 (2022). https://doi.org/10.1007/s10479-020-03852-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-020-03852-w