Abstract
In recent years, power grid data missing which caused by manual operation error and equipment failure often occurs, bringing difficulties to power grid big data analysis. In order to solve the problem that the accuracy of grid data filling is insufficient, a method of grid missing data filling based on historical data mining assisted multi-dimensional scenario analysis is proposed. Firstly, the highly correlated attribute data are selected as the reference basis for missing attribute data filling through the Fluctuation cross-correlation Analysis (FCCA), and the correlation degree is further quantified by combining weights. Secondly, based on load scenario analysis, the similarity between data sources is measured by Dynamic Time Warping (DTW). Finally, combined with DTW and combined weight, the date which has the most similar data was found, and use the same time period data of it to fill the missing data. The simulation results show that the proposed data filling method has higher accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang, N.: Methodolgical progress note: handling missing data in clinical research. J. Hosp. Med. 14(4), 237–239 (2020)
Zhao, Y.: Statistical inference for missing data mechanisms. Stat. Med. 39(1), 4325–4333 (2020)
Venugopalan, J., Chanani, N., et al.: Novel data imputation for multiple types of missing data in intensive care units. J. Biomed. Health Inf. 23(3), 1243–1250 (2019)
Mahmud, M.S., Huang, J.Z., et al.: A survey of data partitioning and sampling methods to support big data analysis. Big Data Min. Anal. 3(2), 85–101 (2020)
Markovsky, I.: A missing data approach to data-driven filtering and control. IEEE Trans. Autom. Control 62(4), 1972–1978 (2017)
Panda, B.S., Adhikari, R.K.: A method for classification of missing values using data mining techniques. In: International Conference on Computer Science, Engineering and Applications (ICCSEA), pp. 1–5. IEEE (2020)
Xiao, J.L.: SVM and KNN ensemble learning for traffic incident detection. Physica A 517, 29–35 (2019)
Zhang, S., Li, X., Zong, M., Zhu, X., Wang, R.: Efficient kNN classification with different numbers of nearest neighbors. IEEE Trans. Neural Netw. Learn. Syst. 29(5), 1774–1785 (2018)
Marchang, N., Tripathi, R.: KNN-ST: exploiting spatio-temporal correlation for missing data inference in environmental crowd sensing. IEEE Sens. J. 21(4), 3429–3436 (2021)
Purwar, A., Singh, S.K.: Hybrid prediction model with missing value imputation for medical data. Expert Syst. Appl. 42(13), 5621–5631 (2015)
Sun, C., Chen, Y., et al.: Imputation of missing data from offshore wind farms using spatio-temporal correlation and feature correlation. Energy 229(1), 161–173 (2021)
Huawei, H., Ning, S., et al.: Alarm root-cause identification for petrochemical process system based on fluctuation correlation analysis. In: Chinese Control and Decision Conference (CCDC), pp. 373–376 (2019)
Lu, C., Li, L., et al.: Application of combination weighting method to weight calculation in performance evaluation of ICT. In: 15th International Conference on Advanced Learning Technologies, pp. 258–259. IEEE (2015)
Wang, X., Jiao, Y., et al.: Estimation of clusters number and initial centers of k-means algorithm using watershed method. In: 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES), pp. 505–508 (2015)
Hong, J.Y., Park, S.H., et al.: Segmented dynamic time warping based signal pattern classification. In: International Conference on Computational Science and Engineering (CSE) and International Conference on Embedded and Ubiquitous Computing (EUC), pp. 263–265. IEEE (2019)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chen, G., Zhu, Z., Yang, L., Lin, G., Yun, Y., Jiang, P. (2022). Power Grid Missing Data Filling Method Based on Historical Data Mining Assisted Multi-dimensional Scenario Analysis. In: Tian, Y., Ma, T., Khan, M.K., Sheng, V.S., Pan, Z. (eds) Big Data and Security. ICBDS 2021. Communications in Computer and Information Science, vol 1563. Springer, Singapore. https://doi.org/10.1007/978-981-19-0852-1_27
Download citation
DOI: https://doi.org/10.1007/978-981-19-0852-1_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0851-4
Online ISBN: 978-981-19-0852-1
eBook Packages: Computer ScienceComputer Science (R0)