Skip to main content

Design and Evaluation of a Data Collector and Analyzer to Monitor the COVID-19 and Other Epidemic Outbreaks

  • Conference paper
  • First Online:
Information Technology and Systems (ICITS 2021)

Abstract

Pandemic situations require analysis, rapid decision making by managers and constant monitoring of the effectiveness of collective health-related approaches. These works can be more efficient with the help of clearer and more representative views of the data, as well as with the application of other measures and projections of epidemiological nature to the information. However, performing such aggregations of data can become a major challenge in contexts with little or no integration between databases, or even when there is no technological core mature enough to feed and integrate technological advances in the workflow of health professionals. This paper aims to present the results of the meeting of project approaches such as the OSEMN framework, a software architecture based on Microservices and Data Science technologies, all tools aligned to make the environment of descriptive and predictive analysis of epidemic data (still dominated by manual processes) evolve towards a context of automation, reliability and application of machine learning, aiming at the organization and addition of value to the results of the data structuring. The project’s validation objects were the documents of the situation of the Covid-19 disease pandemic in the region of the city of Brasília, Federal District, Capital of Brazil.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Administrative regions are government divisions of Brazil’s Federal District. For the sake of simplicity, an administrative region can be defined as a district.

References

  1. Apache Tika - a content analysis toolkit. https://tika.apache.org/

  2. Apache Tika - Class PDFParser. http://tika.apache.org/1.24.1/api/org/apache/tika/parser/pdf/PDFParser

  3. Bash Reference Manual. https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html

  4. Coronavirus 10-day forecast. https://covid19forecast.science.unimelb.edu.au/

  5. Cron(8)—linux manual page. https://man7.org/linux/man-pages/man8/cron.8.html

  6. Flask - Web Development. https://flask.palletsprojects.com/en/1.1.x/

  7. MongoDB Database. https://www.mongodb.com/

  8. Python Programming Language. https://www.python.org/

  9. Regular Expressions Reference. https://www.regular-expressions.info/reference.html

  10. Scikit-Learn. https://scikit-learn.org/stable/

  11. Sqlite database. https://www.sqlite.org/index.html

  12. Tika-Python. https://github.com/chrismattmann/tika-python

  13. de Carvalho Victorino, M., Shiessl, M., Oliveira, E.C., Ishikawa, E., de Holanda, M.T., de Lima Hokama, M.: Uma proposta de ecossistema de big data para a análise de dados abertos governamentais concetados. Informação & sociedade 27(1) (2017)

    Google Scholar 

  14. Dineva, K., Atanasova, T.: OSEMN process for working over data acquired by iot devices mounted in beehives. Current Trends Nat. Sci. 7(13), 47–53 (2018)

    Google Scholar 

  15. Diouf, R., Sarr, E.N., Sall, O., Birregah, B., Bousso, M., Mbaye, S.N.: Web scraping: state-of-the-art and areas of application. In: 2019 IEEE International Conference on Big Data (Big Data), pp. 6040–6042 (2019)

    Google Scholar 

  16. Lima, C.M.A.D.O.: Information about the new coronavirus disease (COVID-19). Radiologia Brasileira 53, V–VI, April 2020. https://doi.org/10.1590/0100-3984.2020.53.2e1

  17. Mogk, M.: Automatic data processing in analyses of epidemics. In: Epidemics of Plant Diseases: Mathematical Analysis and Modeling, pp. 55–77. Springer (1974). https://doi.org/10.1007/978-3-642-96220-2_3

  18. Santos, A.D.F.D., Fonseca Sobrinho, D., Araujo, L.L., Procópio, C.d.S.D., Lopes, E.A.S., Lima, A.M.d.L.D.d., Reis, C.M.R.D., Abreu, D.M.X.D., Jorge, A.O., Matta-Machado, A.T.: Incorporação de Tecnologias de Informação e Comunicação e qualidade na atenção básica em saúde no Brasil 33 (2017). https://doi.org/10.1590/0102-311x00172815

  19. Santos, S.R., Paula, A.F.A., Lima, J.P.: O enfermeiro e sua percepção sobre o sistema manual de registro no prontuário. Revista Latino-Americana de Enfermagem 11, 80–87 (2003). https://doi.org/10.1590/S0104-11692003000100012

    Article  Google Scholar 

  20. Vargiu, E., Urru, M.: Exploiting web scraping in a collaborative filtering-based approach to web advertising. Artif. Intell. Research 2(1), 44–54 (2013)

    Google Scholar 

  21. Wang, C.J., Ng, C.Y., Brook, R.H.: Response to COVID-19 in Taiwan: big data analytics, new technology, and proactive testing. Jama 323(14), 1341–1342 (2020)

    Article  Google Scholar 

  22. Wimberly, M.C., Chuang, T.W., Henebry, G.M., Liu, Y., Midekisa, A., Semuniguse, P., Senay, G.: A computer system for forecasting malaria epidemic risk using remotely-sensed environmental data (2012)

    Google Scholar 

Download references

Acknowledgment

The authors would like to thank the support of the Brazilian research, development and innovation agencies CNPq (Projects INCT SegCiber 465741/2014-2, PQ-2 312180/2019-5 and LargEWiN BRICS2017-591), CAPES (Projects FORTE 23038.007604/2014-69 and PROBRAL 88887.144009/2017-00) and FAPDF (UIoT Projects 0193.001366/2016 and SSDDC 0193. 001365/2016), as well as the support of the LATITUDE/UnB Laboratory (SDN Project 23106. 099441/2016-43), cooperation with the Ministry of the Economy (TEDs DIPLA 005/2016 and ENAP 083/2016), the Office of Institutional Security of the Presidency of the Republic (TED 002/2017), the Attorney General’s Office (TED 697,935/2019) and the Administrative Council for Economic Defense (TED 08700.000047/2019-14).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lucas C. de Almeida .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Almeida, L.C.d., Filho, F.L.d.C., Marques, N.A., Prado, D.S.d., Mendonça, F.L.L.d., Sousa Jr., R.T.d. (2021). Design and Evaluation of a Data Collector and Analyzer to Monitor the COVID-19 and Other Epidemic Outbreaks. In: Rocha, Á., Ferrás, C., López-López, P.C., Guarda, T. (eds) Information Technology and Systems. ICITS 2021. Advances in Intelligent Systems and Computing, vol 1330. Springer, Cham. https://doi.org/10.1007/978-3-030-68285-9_3

Download citation

Publish with us

Policies and ethics