Skip to main content

Toward an Effective Identification of Tweet Related to Meningitis Based on Supervised Machine Learning

  • Conference paper
  • First Online:
Book cover Fourth International Congress on Information and Communication Technology

Abstract

Epidemic surveillance requires a rapid collection and integration of data and events related to the disease. Adequate measures, including education and awareness, must be rapidly taken to reduce the disastrous consequences of the disease. However, developing countries, especially those in West Africa, face a lack of real-time data collection and analysis system. This situation delays the analysis of risk and decision making. The aim of this research is to contribute to the surveillance of the meningitis epidemic based on Twitter datasets. The approach, we adopted in this research is divided into two parts. The first part consisted of investigating different methods to convert the tweet data into numerical data that will be used in machine-learning algorithms for the classification tasks. The second step is to evaluate these approaches using different algorithms and compare their performance in term of training time, accuracy, F1-score, and recall. As a result, we found that the SVM machine algorithm performed good with 0.98 of accuracy using the TF-IDF embedding approach while the ANN algorithm performed good with accuracy of 0.95 using the skip-gram embedding model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mowery, J.: Twitter influenza surveillance: quantifying seasonal misdiagnosis patterns and their impact on surveillance estimates. Online J. Public Health Inform. (2016)

    Google Scholar 

  2. Paul, M., Dredze, M., Broniatowski, D., Generous, N.: Worldwide influenza surveillance through twitter. In: AAAI (2015)

    Google Scholar 

  3. Culotta, A: Towards detecting influenza epidemics by analyzing twitter messages. In: KDD Workshop on Social Media Analytics (2010)

    Google Scholar 

  4. Broniatowski, D.A., Michael J.P., Dredze, M.: National and local influenza surveillance through twitter: an analysis of the 2012–2013 influenza epidemic. PLoS ONE (2013)

    Google Scholar 

  5. United nations department of economic and social affairs/population division, world population prospects, p. 17 (2017)

    Google Scholar 

  6. Logan, S.A.E., MacMahon E.: Viral meningitis. BMJ (2008)

    Google Scholar 

  7. Savory, E.C., Cuevas, L.E., Yassin, M.A., Hart, C.A., Molesworth, A.M.: Thomson MC Evaluation of the meningitis epidemics risk model in Africa. Epidemiol. Infect. 134, 1047–1051 (2006)

    Article  Google Scholar 

  8. Kaburi, B.B., Kubio, C., Kenu, E., Ameme, D.K. et al.: Evaluation of bacterial meningitis surveillance data of the northern region. Ghana, 2010–2015, Pan Afr. Med. J. (2017)

    Google Scholar 

  9. Lingani, C., Bergeron-Caron, C., Stuart, J.M., et al.: Meningococcal meningitis surveillance in the African meningitis belt, 2004–2013. Clin. Infect. Dis. (2015)

    Google Scholar 

  10. Lamb, A., Paul, M.J., Dredze, M.: Separating fact from fear: tracking flu infections on twitter. In: North American Chapter of the Association for Computational Linguistics (NAACL) (2013)

    Google Scholar 

  11. Salton, G., Wong, A., Yang, C.: A vector space model for automatic indexing, communications of the ACM 18 (1975)

    Google Scholar 

  12. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  13. Bengio, Y., Schwenk, H., Senecal, J.-S., Morin, F., Gauvain, J.-L,: Neural Probabilistic Language Models, Innovations in Machine Learning. Springer (2006)

    Google Scholar 

  14. Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS 2013 (2013)

    Google Scholar 

  15. Béré, W.R.C., Camara, G., Malo, S., Lo, M., Ouaro, S.: Towards meningitis ontology for the annotation of text corpora. In: Kebe, M.F., Gueye, C., Ndiaye, A. (Eds.) Innovation and interdisciplinary solutions for underserved areas. CNRIA 2017, InterSol 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol. 204. Springer, Cham (2018)

    Google Scholar 

Download references

Acknowledgements

This research would have been impossible without the support of the Japanese International Cooperation Agency in short JICA that allowed me to pursue a Master thesis at Miyagi University. I would like to thank JICA for their financial support. I would also like to thank the CEA-MITIC, which is a research institution in ICT-based in Sénegal that supported me financially to participate in this conference.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thierry Roger Bayala .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bayala, T.R., Malo, S., Togashi, A. (2020). Toward an Effective Identification of Tweet Related to Meningitis Based on Supervised Machine Learning. In: Yang, XS., Sherratt, S., Dey, N., Joshi, A. (eds) Fourth International Congress on Information and Communication Technology. Advances in Intelligent Systems and Computing, vol 1027. Springer, Singapore. https://doi.org/10.1007/978-981-32-9343-4_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-32-9343-4_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-32-9342-7

  • Online ISBN: 978-981-32-9343-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics