Abstract
Data collection is the key task of wireless sensing nodes. Efficient data collection is very important for precise decision making. The decision taken can be very crucial in the context of environment, human health, military applications, etc. With the advances in technology, the sensing nodes have become more reliable. But still, there is a possibility that the data might get missed or corrupted. This may be due to various reasons like node failure, communication failure or may be due to a regular maintenance schedule. Missing data may lead to inappropriate decision making which may be fatal and thus might lead to harsh consequences. Thus, there arises a need to handle the missed data efficiently. Here in this paper, a framework for data prediction and forecasting is proposed using Auto ARIMA in R. The framework is implemented first by imputing missing data and then by without imputing missing data. It is observed that the proposed model with imputation of missing data demonstrates a better fit than without imputation of missing data. Hence, proper imputation leads to better forecasting which is important for taking appropriate decisions in various applications. In addition, the proposed framework has also been compared with three state-of-art models, and the results demonstrate that the proposed model exhibits a better fit.
Similar content being viewed by others
Data Availability
The dataset named “Air Quality Data Set” analysed during the current study is available in the UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/Air+quality.
Code Availability
Auto ARIMA in R is used to conduct this study.
References
Akyildiz, I. F., Su, W., Sankarasubramaniam, Y., & Cayirci, E. (2002). Wireless sensor networks: A survey. Computer Networks, 38(4), 393–422.
Yick, J., Mukherjee, B., & Ghosal, D. (2008). Wireless sensor network survey. Computer Networks, 52(12), 2292–2330.
Gowrishankar, S., Basavaraju, T.G., Manjaiah, D.H., & Sarkar, Subir K. (2008). “Issues in wireless sensor networks”. Proceedings of the World Congress on Engineering, London UK. pp. 176–187.
Graham, J. W. (2009). Missing data analysis: Making it work in the real world. Annual Review of Psychology, 60, 549–576.
Bhandari, S., Bergmann, N., Jurdak, R., & Kusy, B. (2017). Time series data analysis of wireless sensor network measurements of temperature. Sensors, 17(6), 1221.
Brockwell, P. J., & Davis, R. A. (2016). Introduction to time series and forecasting. Springer.
Box, G. E. P., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M. (2016). Time series analysis: Forecasting and control. John Willey and Sons.
Hyndman, R. J., & Athanasopoulos, G. (2018). Forecasting: Principles and practice. OTexts.
Tyralis, H., & Papacharalampous, G. (2017). Variable selection in time series forecasting using random forests. Algorithms, 10(4), 114.
Tyralis, H., & Papacharalampous, G. A. (2018). Large scale assessment of Prophet for multi-step ahead forecasting of monthly streamflow. Advances in Geosciences, 45, 147–153.
Papacharalampous, G., Tyralis, H., & Koutsoyiannis, D. (2018). Predictability of monthly temperature and precipitation using automatic time series forecasting methods. Acta Geophysica, 66(4), 807–831.
Fekade, B., Maksymyuk, T., Kyryk, M., & Jo, M. (2017). Probabilistic recovery of incomplete sensed data in IoT. IEEE Internet of Things Journal, 5(4), 2282–2292.
Alsheikh, M. A., Lin, S., Niyato, D., & Tan, H. P. (2014). Machine learning in wireless sensor networks: Algorithms, strategies, and applications. IEEE Communications Surveys and Tutorials, 16(4), 1996–2018.
Jaques, N., Taylor, S., Sano, A., & Picard, R. (2017). Multimodal autoencoder: A deep learning approach to filling in missing sensor data and enabling better mood prediction. In 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII), pp. 202–208. IEEE.
Das, M., & Ghosh, S. K. (2017). A deep-learning-based forecasting ensemble to predict missing data for remote sensing analysis. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 10(12), 5228–5236.
Zhang, Y. F., Thorburn, P. J., Xiang, W., & Fitch, P. (2019). SSIM–A deep learning approach for recovering missing time series sensor data. IEEE Internet of Things Journal, 6(4), 6618–6628.
Yuan, H., Xu, G., Yao, Z., Jia, J., & Zhang, Y. (2018). Imputation of missing data in time series for air pollutants using long short-term memory recurrent neural networks. In Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers, pp. 1293–1300.
Li, L., Zhang, J., Wang, Y., & Ran, B. (2018). Missing value imputation for traffic-related time series data based on a multi-view learning method. IEEE Transactions on Intelligent Transportation Systems, 20(8), 2933–2943.
Banaezadeh, F., & Haghighat, A. T. (2015). Evaluation ARIMA modeling-based target tracking scheme in wireless sensor networks using statistical tests. Wireless Personal Communications, 84, 2913–2925.
Saverio De Vito (2008). UCI Machine Learning Repository-Air Quality Data Set. https://archive.ics.uci.edu/ml/datasets/Air+quality.
De Vito, S., Massera, E., Piga, M., Martinotto, L., & Di Francia, G. (2008). On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario. Sensors and Actuators B: Chemical, 129(2), 750–757.
Funding
This research received no external funding support.
Author information
Authors and Affiliations
Contributions
Conceptualization: Ankur Choudhary and Santosh Kumar; Formal analysis: Ankur Choudhary; Investigation: Ankur Choudhary and Manish Sharma; Methodology: Ankur Choudhary and Santosh Kumar; Project administration: Santosh Kumar; Supervision: Santosh Kumar; Validation: Ankur Choudhary, Manish Sharma and KP Sharma; Visualization: Ankur Choudhary; Writing-original draft: Ankur Choudhary; Writing-review & editing: Santosh Kumar and KP Sharma.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Consent for Publication
All authors have read and agreed to the published version of the manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Choudhary, A., Kumar, S., Sharma, M. et al. A Framework for Data Prediction and Forecasting in WSN with Auto ARIMA. Wireless Pers Commun 123, 2245–2259 (2022). https://doi.org/10.1007/s11277-021-09237-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-021-09237-x