Abstract
Time series data has a crucial role in business. It reveals temporal trends and patterns, making it possible for decision-makers to make informed decisions and mitigate problems even before they happen. The existence of missing values in time series can bring difficulties in the analysis and lead to inaccurate conclusions. Thus, there is the need to solve this issue by performing missing data imputation on time series.
In this work, we propose a Focalize KNN that takes advantage of time series properties to perform missing data imputation. The approach is tested with different methods, combinations of parameters and features. The results of the proposed approach, with overlap and disjoint missing patterns, show Focalize KNN is very beneficial in scenarios with disjoint missing patterns.
This work is supported by FEDER, through POR LISBOA 2020 and COMPETE 2020 of the Portugal 2020 Project CityCatalyst POCI-01-0247-FEDER-046119. Ana Almeida acknowledges the Doctoral Grant from Fundação para a Ciência e Tecnologia (2021.06222.BD). Susana Brás is funded by national funds, European Regional Development Fund, FSE, through COMPETE2020 and FCT, in the scope of the framework contract foreseen in the numbers 4, 5 and 6 of the article 23, of the Decree-Law 57/2016, of August 29, changed by Law 57/2017, of July 19. We thank OpenWeather for providing the datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Almeida, A., Brás, S., Oliveira, I., Sargento, S.: Vehicular traffic flow prediction using deployed traffic counters in a city. Futur. Gener. Comput. Syst. 128, 429–442 (2022). https://doi.org/10.1016/j.future.2021.10.022
Cao, W., Wang, D., Li, J., Zhou, H., Li, L., Li, Y.: Brits: bidirectional recurrent imputation for time series. In: Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 31. Curran Associates, Inc. (2018)
Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8 (2018). https://doi.org/10.1038/s41598-018-24271-9
Chicco, D., Warrens, M.J., Jurman, G.: The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput. Sci. 7 (2021). https://doi.org/10.7717/peerj-cs.623
Grus, J.: Data Science from Scratch: First Principles with Python. O’Reilly Media, Inc. (2019)
Khayati, M., Lerner, A., Tymchenko, Z., Cudre-Mauroux, P.: Mind the gap: an experimental evaluation of imputation of missing values techniques in time series. Proc. VLDB Endow. 13, 768–782 (2020). https://doi.org/10.14778/3377369.3377383
Kuppannagari, S.R., Fu, Y., Chueng, C.M., Prasanna, V.K.: Spatio-temporal missing data imputation for smart power grids. In: Proceedings of the Twelfth ACM International Conference on Future Energy Systems, e-Energy 2021, pp. 458–465. Association for Computing Machinery, New York (2021). https://doi.org/10.1145/3447555.3466586
Li, L., Zhang, J., Wang, Y., Ran, B.: Missing value imputation for traffic-related time series data based on a multi-view learning method. IEEE Trans. Intell. Transp. Syst. 20(8), 2933–2943 (2019). https://doi.org/10.1109/TITS.2018.2869768
Luo, Y., Zhang, Y., Cai, X., Yuan, X.: E\(^2\)GAN: end-to-end generative adversarial network for multivariate time series imputation. In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, pp. 3094–3100. International Joint Conferences on Artificial Intelligence Organization (2019). https://doi.org/10.24963/ijcai.2019/429
Oehmcke, S., Zielinski, O., Kramer, O.: KNN ensembles with penalized DTW for multivariate time series imputation, pp. 2774–2781 (2016). https://doi.org/10.1109/IJCNN.2016.7727549
Shamsi, J.A.: Resilience in smart city applications: faults, failures, and solutions. IT Prof. 22(6), 74–81 (2020). https://doi.org/10.1109/MITP.2020.3016728
Sun, B., Ma, L., Cheng, W., Wen, W., Goswami, P., Bai, G.: An improved k-nearest neighbours method for traffic time series imputation. In: 2017 Chinese Automation Congress (CAC), pp. 7346–7351 (2017). https://doi.org/10.1109/CAC.2017.8244105
Wettschereck, D., Dietterich, T.: Locally adaptive nearest neighbor algorithms. In: Cowan, J., Tesauro, G., Alspector, J. (eds.) Advances in Neural Information Processing Systems, vol. 6. Morgan-Kaufmann (1993)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Almeida, A., Brás, S., Sargento, S., Pinto, F.C. (2023). Time Series Imputation in Faulty Systems. In: Pertusa, A., Gallego, A.J., Sánchez, J.A., Domingues, I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2023. Lecture Notes in Computer Science, vol 14062. Springer, Cham. https://doi.org/10.1007/978-3-031-36616-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-36616-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36615-4
Online ISBN: 978-3-031-36616-1
eBook Packages: Computer ScienceComputer Science (R0)