Skip to main content

Assessing Data Quality: An Approach for the Spread of COVID-19

  • Conference paper
  • First Online:
Recent Challenges in Intelligent Information and Database Systems (ACIIDS 2023)

Abstract

The work aims to develop a method for assessing the quality of publicly available data collections on the spread of the COVID-19 pandemic with daily infection statistics, recoveries and deaths. The World Health Organization, European Center for Disease Prevention and Control, Johns Hopkins University and Ministry of Health of the Republic of Poland provide this data as proof of concept. Metrics have been proposed that describe the most important quality features for this type of data collection - accuracy, completeness and consistency. Additional measures have also been defined based on anomaly detection, credibility and correlation between sets. A quality assessment method has been developed that uses specific metrics. The effectiveness of measures was tested on original and modified data. The findings showed that the measures were defined correctly. The method assigns lower-quality categories to datasets containing irregularities and higher for data with fewer errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Behkamal, B., Kahani, M., Bagheri, E., Jeremic, Z.: A metrics-driven approach for quality assessment of linked open data. J. Theor. Appl. Electron. Commer. Res. 9(2), 11–12 (2014)

    Article  Google Scholar 

  2. Benford, F.: The law of anomalous numbers. Proc. Am. Philos. Soc. 78(4), 551–572 (1938)

    MATH  Google Scholar 

  3. Chen, H., Hailey, D., Wang, N., Yu, P.: A review of data quality assessment methods for public health information systems. Int. J. Environ. Res. Public Health 11(5), 5170–5207 (2014)

    Article  Google Scholar 

  4. Farhadi, N.: Can we rely on COVID-19 data? An assessment of data from over 200 countries worldwide. Sci. Progr. 104(2), 1–19 (2021)

    Google Scholar 

  5. Farhadi, N., Lahooti, H.: Forensic analysis of COVID-19 data from 198 countries two years after the pandemic outbreak. COVID 2(4), 472–484 (2022)

    Article  Google Scholar 

  6. Kolias, P.: Applying Benford’s law to COVID-19 data: the case of the European Union. J. Public Health 44, e221–e226 (2022)

    Article  Google Scholar 

  7. Pucher, S., Król, D.: A Quality Assessment Tool for Koblenz Datasets Using Metrics-Driven Approach. In: Fujita, H., Fournier-Viger, P., Ali, M., Sasaki, J. (eds.) IEA/AIE 2020. LNCS (LNAI), vol. 12144, pp. 747–758. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-55789-8_64

    Chapter  Google Scholar 

  8. Wang, G., et al.: Comparing and integrating us COVID-19 data from multiple sources with anomaly detection and repairing. J. Appl. Stat. 50(11–12), 2408–2434 (2023)

    Google Scholar 

Download references

Acknowledgments

Part of the work presented in this paper received financial support from the statutory funds at the Wrocław University of Science and Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dariusz Król .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Król, D., Bodek, A. (2023). Assessing Data Quality: An Approach for the Spread of COVID-19. In: Nguyen, N.T., et al. Recent Challenges in Intelligent Information and Database Systems. ACIIDS 2023. Communications in Computer and Information Science, vol 1863. Springer, Cham. https://doi.org/10.1007/978-3-031-42430-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-42430-4_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-42429-8

  • Online ISBN: 978-3-031-42430-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics