Abstract
The advent of cloud computing and autonomous data centers operating fully without human supervision has highlighted the need for fault-tolerant architectures and intelligent software tools for system parameter optimization. Demands on computational throughput have to be balanced with environmental concerns, such as energy consumption and waste heat. Using multivariate time series data collected from an experimental data center, we build a state model using clustering, then estimate the states represented by the clusters using both a hidden Markov model and a long-short term memory neural net. Knowledge of future states of the system can be used to solve tasks such as reduced energy consumption and optimized resource allocation in the data center.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahuja, N., et al.: Power variation trend prediction in modern datacenter’. In: 2017 16th IEEE Intersociety Conference on Thermal and Thermo-Mechanical Phenomena in Electronic Systems (ITherm) (2017)
Aibin, M., et al.: Traffic prediction for inter-data center cross-stratum optimization problems. In: Proceedings of the 2018 International Conference on Computing, Networking and Communications (ICNC): Optical and Grid Computing, p. 6 (2018)
Bauer, A., et al.: Time Series forecasting for self-aware systems. Proc. IEEE 108(7) , 1068–1093, July 2020
Boser, B.E., Guyon, I.M., Vapnik, M.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory, pp. 144–152. ACM Press, Pittsburgh (1992)
Brännvall, R., et al.: EDGE: Microgrid Data Center with mixed energy storage. In: Proceedings of the Eleventh ACM International Conference on Future Energy Systems, E-Energy 2020: The Eleventh ACM International Conference on Future Energy Systems. Virtual Event Australia, pp. 466–473. ACM, June 12, 2020
Davies, D.L., Bouldin, D.W.: A cluster separation measure. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1.2, pp. 224–227, April 1979
Gurumdimma, N., Jhumka, A.: Detection of recovery patterns in cluster systems using resource usage data. In: 2017 IEEE 22nd Pacific Rim International Symposium on Dependable Computing (PRDC) (2017)
Gurumdimma, N., et al.: CRUDE: combining resource usage data and error logs for accurate error detection in large-scale distributed systems. In: 2016 IEEE 35th Symposium on Reliable Distributed Systems (SRDS), pp. 51–60. IEEE, Budapest, September 2016
Hochreiter, S., Schmidhuber, J.: Long short-term memory. In: Neural Comput. 9(8), 1735–1780 (1997)
Hochreiter, S., et al.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies. In: Kolen, J.F., Kremer, S.C. (eds.) A Field Guide to Dynamical Recurrent Networks, IEEE (2001)
ITEA3/AutoDC. About AutoDC. https://autodc.tech/about/. Accepted 8 Aug 2021
Jolliffe, I.T.: Principal component analysis. In: Lovric, M. (ed.) International Encyclopedia of Statistical Science. Springer, Berlin (2002). https://doi.org/10.1007/978-3-642-04898-2_455
Kaiser, H.F.: The application of electronic computers to factor analysis. In: Educational and Psychological Measurement, vol. 20. pp. 141–151, April 1960
Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36(1), 41–50 (2003)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations (ICLR). San Diego, CA, USA (2015). arXiv: 1412.6980
Kocsis, L., Szepesvári, C.: Bandit based Monte-Carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282–293. Springer, Heidelberg (2006). https://doi.org/10.1007/11871842_29
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)
Makanju, A., Zincir-Heywood, A.N., Milios, E.E.: System state discovery via information content clustering of system logs. In: 2011 Sixth International Conference on Availability, Reliability and Security (ARES), pp. 301–306. IEEE, Vienna, August 2011
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Rajachandrasekar, R., Besseron, X., Panda, D.K.: Monitoring and predicting hardware failures in HPC clusters with FTBIPMI. In: 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum, pp. 1136–1143. IEEE, Shanghai, China, May 2012
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)
Shi, J., He, G., Liu, X.: Anomaly detection for key performance indicators through machine learning. In: 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC), pp. 1–5. IEEE, Guiyang, Aug. 2018
Steinbach, M., Ertöz, L., Kumar, V.: The challenges of clustering high dimensional data. In: Wille, L.T. (ed.) New Directions in Statistical Physics, pp. 273–309. Springer, Berlin, Heidelberg (2004). https://doi.org/10.1007/978-3-662-08968-2_16
Toivola, J., Prada, M.A., Hollmén, J.: Novelty detection in projected spaces for structural health monitoring. In: Cohen, P.R., Adams, N.M., Berthold, M.R. (eds.) IDA 2010. LNCS, vol. 6065, pp. 208–219. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13062-5_20
Yang, W., et al.: Hard drive failure prediction using big data. In: 2015 IEEE 34th Symposium on Reliable Distributed Systems Workshop (SRDSW), pp. 13–18. IEEE, Montreal, QC, September 2015
Acknowledgments
We acknowledge the computational resources provided by the Aalto Science-IT project. We thank Rickard Brännvall and Jonas Gustafsson of RISE ICE Datacenter for their help with the data set.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Rinta-Koski, OP., Sirola, M., Nguyen, L.N., Hollmén, J. (2021). State Discovery and Prediction from Multivariate Sensor Data. In: Lemaire, V., Malinowski, S., Bagnall, A., Guyet, T., Tavenard, R., Ifrim, G. (eds) Advanced Analytics and Learning on Temporal Data. AALTD 2021. Lecture Notes in Computer Science(), vol 13114. Springer, Cham. https://doi.org/10.1007/978-3-030-91445-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-91445-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91444-8
Online ISBN: 978-3-030-91445-5
eBook Packages: Computer ScienceComputer Science (R0)