Abstract
Due to the large-scale growth of data, the storage scale of data centers is getting larger and larger. Hard disk is the main storage medium, once a failure occurs, it will bring huge losses to users and enterprises. In order to improve the reliability of storage systems, many machine learning methods have been widely employed to predict hard disk failure in the past few decades. However, due to the large number of different models of hard disks in the heterogeneous disk system, traditional machine learning methods cannot build a general model. Inspired by a DANN based unsupervised domain adaptation approach for image classification, in this paper, we propose the DFPTL (Disk Failure Prediction via Transfer Learning) approach, which introduce the DANN approach to predict failure in heterogeneous disk systems by reducing the distribution differences between different models of disk datasets. This approach only needs unlabeled data (the target domain) of a specific disk model and the labeled data (the source domain) collected from a different disk model from the same manufacturer. Experimental results on real-world datasets demonstrate that DFPTL can achieve adaptation effect in the presence of domain shifts and outperform traditional machine learning algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Vishwanath, K.V., Nagappan, N.: Characterizing cloud computing hardware reliability. In: Proceedings of the 1st ACM symposium on Cloud computing, pp. 193–204 (2010)
Allen, B.: Monitoring hard disks with smart. Linux J. 117, 74–77 (2004)
Eckart, B., Chen, X., He X., et al.: Failure prediction models for proactive fault tolerance within storage systems. In: 2008 IEEE International Symposium on Modeling, Analysis and Simulation of Computers and Telecommunication Systems, pp. 1–8. IEEE (2008)
Murray, J.F., Hughes, G.F., Kreutz-Delgado, K.: Machine learning methods for predicting failures in hard drives: a multiple-instance application. J. Mach. Learn. Res. 6(May), 783–816 (2005)
Xiao, J., Xiong, Z., Wu, S., et al.: Disk failure prediction in data centers via online learning. In: Proceedings of the 47th International Conference on Parallel Processing, pp. 1–10 (2018)
Li, J., Ji, X., Jia, Y., et al.: Hard drive failure prediction using classification and regression trees. In: 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp. 383–394. IEEE (2014)
Murray, J.F., Hughes, G.F., Kreutz-Delgado, K.: Hard drive failure prediction using non-parametric statistical methods. In: Proceedings of ICANN/ICONIP (2003)
Yang, W., Hu, D., Liu, Y., et al.: Hard drive failure prediction using big data. In: 2015 IEEE 34th Symposium on Reliable Distributed Systems Workshop (SRDSW), pp. 13–18. IEEE (2015)
Zhao, Y., Liu, X., Gan, S., et al.: Predicting disk failures with HMM-and HSMM-based approaches. In: Industrial Conference on Data Mining, pp. 390–404. Springer, Berlin, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14400-4_30
Botezatu, M.M., Giurgiu, I., Bogojeska, J., et al.: Predicting disk replacement towards reliable data centers. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 39–48 (2016)
Zhu, B., Wang, G., Liu, X., et al.: Proactive drive failure prediction for large scale storage systems. In: 2013 IEEE 29th symposium on mass storage systems and technologies (MSST), pp. 1–5. IEEE (2013)
Xu, Y., Sui, K., Yao, R., et al.: Improving service availability of cloud systems by predicting disk error. In: 2018 {USENIX} Annual Technical Conference ({USENIX}{ATC} 18), pp. 481–494 (2018)
Pereira, F.L.F., dos Santos Lima, F D., de Moura Leite, L G., et al.: Transfer learning for Bayesian networks with application on hard disk drives failure prediction. In: 2017 Brazilian Conference on Intelligent Systems (BRACIS), pp. 228–233. IEEE (2017)
Jiang, W., Hu, C., Zhou, Y., et al.: Are disks the dominant contributor for storage failures? a comprehensive study of storage subsystem failure characteristics. ACM Trans. Storage (TOS). 4(3), 1–25 (2008)
Ganin, Y., Lempitsky, V.: Unsupervised domain adaptation by backpropagation. In: International conference on machine learning, pp. 1180–1189 (2015)
Hughes, G.F., Murray, J.F., Kreutz-Delgado, K., et al.: Improved disk-drive failure warnings. IEEE Trans. Reliab. 51(3), 350–357 (2002)
Hamerly, G., Elkan, C.: Bayesian approaches to failure prediction for disk drives. ICML 1, 202–209 (2001)
Wang, Y., Ma, E.W.M., Chow, T.W.S., et al.: A two-step parametric method for failure prediction in hard disk drives. IEEE Trans. Ind. Inf. 10(1), 419–430 (2013)
Xu, C., Wang, G., Liu, X., et al.: Health status assessment and failure prediction for hard drives with recurrent neural networks. IEEE Trans. Comput. 65(11), 3502–3508 (2016)
Zhu, Y., Zhuang, F., Wang, J., et al.: Multi-representation adaptation network for cross-domain image classification. Neural Netw. 119, 214–221 (2019)
Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th annual meeting of the association for computational linguistics, pp. 1118–1127 (2010)
Wang, J., Zheng, V.W., Chen, Y., et al.: Deep transfer learning for cross-domain activity recognition. In: proceedings of the 3rd International Conference on Crowd Science and Engineering, pp. 1–8 (2018)
Kermany, D.S., Goldbaum, M., Cai, W., et al.: Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(5), 1122–1131 (2018)
Zhang, J., Zhou, K., Huang, P., et al.: Transfer learning based failure prediction for minority disks in large data centers of heterogeneous disk systems. In: Proceedings of the 48th International Conference on Parallel Processing, pp.1–10 (2019)
Jiang, T., Zeng, J., Zhou, K., et al.: Lifelong disk failure prediction via GAN-Based anomaly detection. In: 2019 IEEE 37th International Conference on Computer Design (ICCD), pp. 199–207. IEEE (2019)
Zhang, X., Kim, J., Lin, Q., et al.: Cross-dataset time series anomaly detection for cloud systems. In: 2019 {USENIX} Annual Technical Conference ({USENIX}{ATC} 19), pp. 1063–1076 (2019)
Shimodaira, H.: Improving predictive inference under covariate shift by weighting the log-likelihood function. J. Stat. Plan. Infer. 90(2), 227–244 (2000)
Yang, Y., Loog, M.: Active learning using uncertainty information. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 2646–2651. IEEE (2016)
Powers, D.M.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation (2011)
He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)
Liu, F.T., Ting, K.M., Zhou, Z.H.: Isolation forest. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 413–422. IEEE (2008)
Breunig, M.M., Kriegel, H.P., Ng, R.T., et al.: LOF: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 93–104 (2000)
dos Santos Lima, F.D., Amaral, G.M.R., de Moura Leite, L.G., et al.: Predicting failures in hard drives with lstm networks. In: 2017 Brazilian Conference on Intelligent Systems (BRACIS), pp. 222–227. IEEE (2017)
Acknowledgements
This research is supported by the key research and development program of Jiangsu province( BE2019012).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhao, R. et al. (2021). Hard Disk Failure Prediction via Transfer Learning. In: Tian, Y., Ma, T., Khan, M.K. (eds) Big Data and Security. ICBDS 2020. Communications in Computer and Information Science, vol 1415. Springer, Singapore. https://doi.org/10.1007/978-981-16-3150-4_43
Download citation
DOI: https://doi.org/10.1007/978-981-16-3150-4_43
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3149-8
Online ISBN: 978-981-16-3150-4
eBook Packages: Computer ScienceComputer Science (R0)