Skip to main content

Critical Comparison of Data Imputation Techniques at IoT Edge

  • Conference paper
  • First Online:
Intelligent Distributed Computing XIV (IDC 2021)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1026))

Included in the following conference series:

Abstract

The advances within the Internet of Things and sensor systems put the focus on the improvement of the data reliability as close to the edge as possible. This work investigates how well-established techniques can be used for the imputation of contaminated data. We look at the performance of four algorithms for different contamination rates and error bursts of variable length. Furthermore, the algorithms are also evaluated on a constrained environment to showcase the behavior of data imputation methods at the edge of an IoT-based system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://iot.ee.surrey.ac.uk:8080/datasets/pollution/readme.txt.

References

  1. Tariq, U.U., Ali, H., Liu, L., Hardy, J., Kazim, M., Ahmed, W.: Energy-aware scheduling of streaming applications on edge-devices in IoT based healthcare. IEEE Trans. Green Commun. Netw. (in press) (2021)

    Google Scholar 

  2. Ferrara, E., Fragale, L., Fortino, G., Song, W., Perra, C., Di Mauro, M., Liotta, A.: An AI approach to collecting and analyzing human interactions with urban environments. IEEE Access 7, 141476–141486 (2019)

    Article  Google Scholar 

  3. Pace, P., Aloi, G., Gravina, R., Caliciuri, G., Fortino, G., Liotta, A.: An edge-based architecture to support efficient applications for healthcare industry 4.0. IEEE Trans. Ind. Inform. 15(1), 481–489 (2019)

    Google Scholar 

  4. Xiang, X., Gui, J., Xiong, N.N.: An integral data gathering framework for supervisory control and data acquisition systems in green IoT. IEEE Trans. Green Commun. Netw. (in press) (2021)

    Google Scholar 

  5. Erhan, L., Ndubuaku, M., Di Mauro, M., Song, W., Chen, M., Fortino, G., Bagdasar, O., Liotta, A.: Smart anomaly detection in sensor systems: a multi-perspective review. Inf. Fusion 67, 64–79 (2021)

    Article  Google Scholar 

  6. González-Vidal, A., Rathore, P., Rao, A.S., Mendoza-Bernal, J., Palaniswami, M., Skarmeta-Gómez, A.F.: Missing data imputation with Bayesian maximum entropy for Internet of Things applications. IEEE Internet Things J. (in press) (2020)

    Google Scholar 

  7. Fekade, B., Maksymyuk, T., Kyryk, M., Jo, M.: Probabilistic recovery of incomplete sensed data in IoT. IEEE Internet Things J. 5(4), 2282–2292 (2018)

    Article  Google Scholar 

  8. Yan, X., Xiong, W., Hu, L., Wang, F., Zhao, K.: Missing value imputation based on gaussian mixture model for the internet of things. Math. Probl. Eng. 15 (2015)

    Google Scholar 

  9. Peixoto, M.L.M., Souza, I., Barbosa, M., Lecomte, G., Batista, B.G., Kuehne, B.T., Filho, D.M.L.: Data missing problem in smart surveillance environment. In: Proceedings of the IEEE HPCS, pp. 962–969 (2018)

    Google Scholar 

  10. Roman Tkachenko, Ivan Izonin, Natalia Kryvinska, Ivanna Dronyuk, and Khrystyna Zub. An approach towards increasing prediction accuracy for the recovery of missing IoT data based on the GRNN-SGTM ensemble. Sensors, 20(2625), 2020

    Google Scholar 

  11. Liu, Y., Dillon, T., Yu, W., Rahayu, W., Mostafa, F.: Missing value imputation for industrial IoT sensor data with large gaps. IEEE Internet Things J. 7(8), 6855–6867 (2020)

    Article  Google Scholar 

  12. Kong, L., Xia, M., Liu, X., Wu, M., Liu, X.: Data loss and reconstruction in sensor networks. In: Proceedings of the IEEE INFOCOM, pp. 1654–1662 (2013)

    Google Scholar 

  13. P. Fountas and K. Kolomvatsos. Ensemble based data imputation at the Edge. In proc. IEEE ICTAI, pages 961–968, 2020

    Google Scholar 

  14. Kolomvatsos, K., Papadopoulou, P., Anagnostopoulos, C., Hadjiefthymiades, S.: A spatio-temporal data imputation model for supporting analytics at the Edge. In: Lecture Notes in Computer Science, vol. 11701, pp. 138–150. Springer, Cham (2019)

    Google Scholar 

  15. Pan, L., Li, J.: K-Nearest neighbor based missing data estimation algorithm in wireless sensor networks. Wirel. Sens. Netw. 2, 115–122 (2010)

    Article  Google Scholar 

  16. Ali, M.I., Gao, F., Mileo, A.: Citybench: a configurable benchmark to evaluate RSP engines using smart city datasets. In: Proceedings of the ISWC, pp. 374–389 (2015)

    Google Scholar 

  17. Azur, Melissa J., Stuart, Elizabeth A., Frangakis, Constantine, Leaf, Philip J.: Multiple imputation by chained equations: what is it and how does it work?: multiple imputation by chained equations. Int. J. Methods Psychiatr. Res. 20(1), 40–49 (2011)

    Article  Google Scholar 

  18. Raghunathan, T.W., Lepkowksi, J.M., Van Hoewyk, J., Solenbeger, P.: A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv. Pract. 27, 85–95 (2001)

    Google Scholar 

  19. Stekhoven, Daniel J., Bühlmann, Peter: MissForest–non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1), 112–118 (2012)

    Article  Google Scholar 

  20. Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., Altman, R.B.: Missing value estimation methods for DNA microarrays. Bioinformatics 17(6), 520–525 (2001). June

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Laura Erhan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Erhan, L., Di Mauro, M., Bagdasar, O., Liotta, A. (2022). Critical Comparison of Data Imputation Techniques at IoT Edge. In: Camacho, D., Rosaci, D., Sarné, G.M.L., Versaci, M. (eds) Intelligent Distributed Computing XIV. IDC 2021. Studies in Computational Intelligence, vol 1026. Springer, Cham. https://doi.org/10.1007/978-3-030-96627-0_4

Download citation

Publish with us

Policies and ethics