Abstract
During the COVID-19 pandemic, Internet and SN technologies are an effective resource for disease surveillance and a good way to communicate to prevent disease outbreaks. In December 2019, the frequency of the words COVID-19, SARS-CoV-2, and pandemic was very low in online environment, being only few posts informing that, “the mysterious coronavirus in China could spread.” After March 1, 2020, there have been numerous research projects that analyze the flows of messages in social networks in order to perform real-time analyses, to follow the trends of the pandemic evolution, to identify new disease outbreaks, and to elaborate better predictions. In this context, this study analyzes the posts collected during [August–September 2020], on the Twitter network, that contain the word “COVID-19,” written both in Romanian and English. For the Romanian language posts, we obtained a dictionary of the words used, for which it was calculated their occurrence frequency in the multitude of tweets collected and pre-processed. The frequency of words for non-noisy messages was identified from the multitude of words in the obtained dictionary. For the equivalent of these words in English, we obtained the probability density of words in the extracted and pre-processed posts written in English on Twitter. This study also identifies the percentage of similarity between tweets that contain words with a high frequency of apparition. The similarity for the collected and pre-processed tweets that have “ro.” in the filed called Language has been computed making use of Levenshtein algorithm. These calculations are intended to quickly help find the relevant posts related to the situation generated by the COVID-19 pandemic. It is well known that the costs of analyzing data from social networks are very low compared to the costs involved in analyzing data from the centers of government agencies; therefore, the proposed method may be useful.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Avci, C., Tekinerdogan, B., Athanasiadis, I.N.: Software architectures for big data: a systematic literature review. Big Data Anal. 5(1), 1–53 (2020). https://doi.org/10.1186/s41044-020-00045-1
Guo, H.D., Zhang, L., Zhu, L.W.: Earth observation big data for climate change research. Adv. Clim. Chang. Res. 6(2), 108–117 (2015)
Zhao, P., Hu, H.: Geographical patterns of traffic congestion in growing megacities: big data analytics from Beijing. Cities 92, 164–174 (2019)
Tan, C., Sun, L., Liu, K.: Big data architecture for pervasive healthcare: a literature review. In: Proceedings of the Twenty-Third European Conference on Information Systems, pp. 26–29. Münster, Germany (2015)
Fitzgerald, R.C.: Big data is crucial to the early detection of cancer. Nat. Med. 26(1), 19–20 (2020)
Moustafa, K.: Make good use of big data: a home for everyone, Elsevier public health emergency collection. Cities 107, (2020)
Kramer, A., Guillory, J., Hancock, J.: Experimental evidence of massive scale emotional contagion through social networks. PNAS 111(24), 8788–8790 (2014)
Banerjee, S., Jenamani, M., Pratihar, D.K.: A survey on influence maximization in a social network. Knowl. Inf. Syst. 62, 3417–3455 (2020)
Yue, Y.: Scale adaptation of text sentiment analysis algorithm in big data environment: Twitter as data source. In: Atiquzzaman, M., Yen, N., Xu, Z. (eds.) Big Data Analytics for Cyber-Physical System in Smart City. BDCPS 2019. Advances in Intelligent Systems and Computing, vol. 1117, pp. 629–634. Springer, Singapore (2019)
Badaoui, F., Amar, A., Ait Hassou, L., et al.: Dimensionality reduction and class prediction algorithm with application to microarray big data. J. Big Data 4, 32 (2017)
Teodorescu, H.N.L., Pirnau, M.: In: Muhammad Nazrul Islam (ed.) Cap 6: ICT for Early Assessing the Disaster Amplitude, for Relief Planning, and for Resilience Improvement (2020). e-ISBN: 9781785619977
Shan, S., Zhao, F.R., Wei, Y., Liu, M.: Disaster management 2.0: a real-time disaster damage assessment model based on mobile social media data—A case study of Weibo (Chinese Twitter). Saf. Sci. 115, 393–413 (2019)
Teodorescu, H.N.L.: Using analytics and social media for monitoring and mitigation of social disasters. Procedia Eng. 107C, 325–334 (2015)
Pirnau, M.: Tool for monitoring web sites for emergency-related posts and post analysis. In: Proceedings of the 8th Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–6. Bucharest, Romania, 14–17 Oct (2015).
Wang, B., Zhuang, J.: Crisis information distribution on Twitter: a content analysis of tweets during hurricane sandy. Nat. Hazards 89(1), 161–181 (2017)
Eriksson, M., Olsson, E.K.: Facebook and Twitter in crisis communication: a comparative study of crisis communication professionals and citizens. J. Contingencies Crisis Manage. 24(4), 198–208 (2016)
Laylavi, F., Rajabifard, A., Kalantari, M.: Event relatedness assessment of Twitter messages for emergency response. Inf. Process. Manage. 53(1), 266–280 (2017)
Banujan, K., Banage Kumara, T.G.S., Paik, I.: Twitter and online news analytics for enhancing post-natural disaster management activities. In: Proceedings of the 9th International Conference on Awareness Science and Technology (iCAST), pp. 302–307. Fukuoka (2018)
Takahashi, B., Tandoc, E.C., Carmichael, C.: Communicating on Twitter during a disaster: an analysis of tweets during typhoon Haiyan in the Philippines. Comput. Hum. Behav. 50, 392–398 (2015)
Teodorescu, H.N.L., Pirnau, M.: Analysis of requirements for SN monitoring applications in disasters—a case study. In: Proceedings of the 8th International Conference on Electronics, Computers and Artificial Intelligence (ECAI), pp. 1–6. Ploiesti, Romania (2016)
Ahmed, W., Bath, P.A., Sbaffi, L., Demartini, G.: Novel insights into views towards H1N1 during the 2009 pandemic: a thematic analysis of Twitter data. Health Inf. Libr. J. 36, 60–72 (2019)
Asadzadeh, A., Kötter, T., Salehi, P., Birkmann, J.: Operationalizing a concept: the systematic review of composite indicator building for measuring community disaster resilience. Int. J. Disaster Risk Reduction 25, 147–162 (2017)
Teodorescu, H.N.L., Saharia, N.: A semantic analyzer for detecting attitudes on SNs. In: Proceedings of the International Conference on Communications (COMM), pp. 47–50. Bucharest, Romania (2016)
Teodorescu, H.N.L.: On the responses of social networks’ to external events. In: Proceedings of the 7th International Conference on Electronics, Computers and Artificial Intelligence, pp. 13–18. Bucharest, Romania (2015)
Gottfried, J., Shearer, E.: News use across social media platforms 2016. White Paper, 26. Pew Research Center (2016)
Gupta, A., Lamba, H., Kumaraguru, P., Joshi, A.: Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy. In WWW’13 Proceedings of the 22nd International Conference on World Wide Web, pp. 729–736 (2013)
Allcott, H., Gentzkow, M.: Social media and fake news in the 2016 election. J. Econ. Perspect. 31(2), 211–236 (2017)
Lyu, H., Chen, L., Wang, Y., Luo, J.: Sense and sensibility: characterizing social media users regarding the use of controversial terms for COVID-19. IEEE Trans. Big Data (2020)
Teodorescu, H.N.L., Bolea, S.C.: On the algorithmic role of synonyms and keywords in analytics for catastrophic events. In: Proceedings of the 8th International Conference on Electronics, Computers and Artificial Intelligence, ECAI, pp. 1–6. Ploiesti, Romania (2016)
Teodorescu, H.N.L.: Emergency-related, social network time series: description and analysis. In: Rojas, I., Pomares, H. (eds.) Time Series Analysis and Forecasting. Contributions to Statistics, pp. 205–215. Springer, Cham (2016)
Bolea, S.C.: Vocabulary, synonyms and sentiments of hazard-related posts on social networks. In: Proceedings of the 8th Conference Speech Technology and Human-Computer Dialogue (SpeD), pp. 1–6. Bucharest, Romania (2015)
Bolea, S.C.: Language processes and related statistics in the posts associated to disasters on social networks. Int. J. Comput. Commun. Control 11(5), 602–612 (2016)
Teodorescu, H.N.L.: Survey of IC&T in disaster mitigation and disaster situation management, Chapter 1. In: Teodorescu, H.-N., Kirschenbaum, A., Cojocaru, S., Bruderlein, C. (eds.), Improving Disaster Resilience and Mitigation—IT Means and Tools. NATO Science for Peace and Security Series—C, pp. 3–22. Springer, Dordrecht (2014)
Kanis, J., Skorkovská, L.: Comparison of different lemmatization approaches through the means of information retrieval performance. In: Proceedings of the 13th International Conference on Text, Speech and Dialogue TSD'10, pp. 93–100 (2010)
Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat. Lang. Eng. 10(3–4), 327–348 (2004)
Jacobs, P.S.: Joining statistics with NLP for text categorization. In: Proceedings of the Third Conference on Applied Natural Language Processing, pp. 178–185 (1992)
Jivani, A.G.: A comparative study of stemming algorithms. Int. J Comp Tech. Appl 2, 1930–1938 (2011)
Ingason, A.K., Helgadóttir, S., Loftsson, H., Rögnvaldsson, E.: A mixed method lemmatization algorithm using a hierarchy of linguistic identities (HOLI). In: Raante, A., Nordström, B. (eds.), Advances in Natural Language Processing. Lecture Notes in Computer Science, vol. 5221, pp. 205–216. Springer, Berlin (2008)
Krouska, A., Troussas, C., Virvou, M.: The effect of preprocessing techniques on Twitter sentiment analysis. In: Proceedings of the International Conference on Information, Intelligence, Systems & Applications, pp. 13–15. Chalkidiki, Greece (2016)
Babanejad, N., Agrawal, A., An, A., Papagelis, M.: A comprehensive analysis of preprocessing for word representation learning in affective tasks. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5799–5810 (2020)
Camacho-Collados, J., Pilehvar, M.T.: On the role of text preprocessing in neural network architectures: an evaluation study on text categorization and sentiment analysis. In: Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pp. 40–46. Association for Computational Linguistics (2018)
Davis, C.A., Varol, O., Ferrara, E., Flammini, A., Menczer, F.: BotOrNot: a system to evaluate social bots, a system to evaluate social bots. In: Proceedings of the 25th International Conference Companion on World Wide Web, pp. 273–274 (2016)
Ferrara, E.: COVID-19 on Twitter: Bots, Conspiracies, and Social Media Activism. arXiv preprint arXiv:2004.09531 (2020)
Metaxas, P., Finn, S.T.: The infamous#Pizzagate conspiracy theory: Insight from a Twitter Trails investigation. Wellesley College Faculty Research and Scholarship (2017)
Teodorescu, H.N.L.: Social signals and the ENR index—noise of searches on SN with keyword-based logic conditions. In: Proceedings of the International Symposium on Signals, Circuits and Systems. Iasi, Romania (2015)
Aouragh, S.I.: Adaptating Levenshtein distance to contextual spelling correction. Int. J. Comput. Sci. Appl. 12(1), 127–133 (2015)
Kobzdej, P.: Parallel application of Levenshtein’s distance to establish similarity between strings. Front. Artif. Intell. Appl. 12(4) (2003)
Rani, S.; Singh, J.: Enhancing Levenshtein’s edit distance algorithm for evaluating document similarity. In: Communications in Computer and Information Science, pp. 72–80. Springer, Singapore (2018)
Acknowledgements
I thank Prof. H.N. Teodorescu for the suggestions on this research and for correcting several preliminary versions of this chapter.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Pirnau, M. (2022). An Analysis of the Content in Social Networks During COVID-19 Pandemic. In: Ben Ahmed, M., Teodorescu, HN.L., Mazri, T., Subashini, P., Boudhir, A.A. (eds) Networking, Intelligent Systems and Security. Smart Innovation, Systems and Technologies, vol 237. Springer, Singapore. https://doi.org/10.1007/978-981-16-3637-0_62
Download citation
DOI: https://doi.org/10.1007/978-981-16-3637-0_62
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3636-3
Online ISBN: 978-981-16-3637-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)