Abstract
Natural language processing (NLP) plays a significant role in tools for the COVID-19 pandemic response, from detecting misinformation on social media to helping to provide accurate clinical information or summarizing scientific research. However, the approaches developed thus far have not benefited all populations, regions or languages equally. We discuss ways in which current and future NLP approaches can be made more inclusive by covering low-resource languages, including alternative modalities, leveraging out-of-the-box tools and forming meaningful partnerships. We suggest several future directions for researchers interested in maximizing the positive societal impacts of NLP.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Avery, E.: Public information officers’ social media monitoring during the Zika virus crisis, a global health threat surrounded by public uncertainty. Public Relations Rev. 43 (2017). https://doi.org/10.1016/j.pubrev.2017.02.018
Batrinca, B., Treleaven, P.: Social media analytics: a survey of techniques, tools and platforms. AI and Soc. 30, 89–116 (2014). https://doi.org/10.1007/s00146-014-0549-4
Baym, N.: Data not seen: the uses and shortcomings of social media metrics. First Monday 18 (2013). https://doi.org/10.5210/fm.v18i10.4873
Beaunoyer, E., Dupéré, S., Guitton, M.J.: COVID-19 and digital inequalities: reciprocal impacts and mitigation strategies. Comput. Human Behav., 106424 (2020)
Boukes, M., Velde, B., Araujo, T., Vliegenthart, R.: What’s the tone? Easy doesn’t do it: analyzing performance and agreement between off-the-shelf sentiment analysis tools. Commun. Methods Meas. 14, 1–22 (10 2019). https://doi.org/10.1080/19312458.2019.1671966
Bullock, J., Luccioni, A., Pham, K.H., Lam, C.S.N., Luengo-Oroz, M., et al.: Mapping the landscape of Artificial Intelligence applications against COVID-19. J. Artif. Intell. Res. 69, 807–845 (2020)
Bullock, J., Luengo-Oroz, M.: Automated speech generation from UN General assembly statements: Mapping risks in AI generated texts. In: International Conference on Machine Learning AI for Social Good Workshop (2019)
Conneau, A., Baevski, A., Collobert, R., Mohamed, A., Auli, M.: Unsupervised cross-lingual representation learning for speech recognition. arXiv preprint arXiv:2006.13979 (2020)
Cresci, S., Di Pietro, R., Petrocchi, M., Spognardi, A., Tesconi, M.: Fame for sale: edetection of fake Twitter followers. Decision Support Syst. 80, 56–71 (2015) https://doi.org/10.1016/j.dss.2015.09.003
Cruz, J.C.B., Tan, J.A., Cheng, C.: Localization of fake news detection via multitask transfer learning. arXiv preprint arXiv:1910.09295 (2019)
Deiner, M.S., et al.: Facebook and Twitter vaccine sentiment in response to measles outbreaks. Health Inf. J. 25(3), 1116–1132 (2019) https://doi.org/10.1177/1460458217740723, pMID: 29148313
DSEG: A framework for the ethical use of advanced data science methods in the humanitarian sector (2020)
Eberhard, D.M., Simons, G.F., Fennig, C.D.E.: Ethnologue: lof the world. twenty-third edition, online version (2020). www.ethnologue.com
Eysenbach, G.: How to fight an infodemic: the four pillars of infodemic management. J. Med. Internet Res. 22, e21820 (2020). https://doi.org/10.2196/21820
Fadaee, M., Bisazza, A., Monz, C.: Data augmentation for low-resource neural machine translation. arXiv preprint arXiv:1705.00440 (2017)
Hidalgo-Sanchis, P.: Using speech-to-text technology to support response to the COVID-19 pandemic. https://www.unglobalpulse.org/2020/05/using-speech-to-text-technology-to-support-response-to-the-covid-19-pandemic/ (May 2020)
Hossain, T., Logan , R.L., Ugarte, A., Matsubara, Y., Young, S., Singh, S.: COVIDLIES: detecting COVID-19 misinformation on social media. In: Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020 (2020)
Hossain, Z., Rahman, A., Islam, S., Kar, S.: BanFakeNews: A dataset for detecting fake news in Bangla. arXiv preprint arXiv:2004.08789 (2020)
Johnson, M., et al.: Google’s multilingual neural machine translation system: enabling zero-shot translation. Trans. Assoc. Comput. Linguist. 5, 339–351 (2017)
Lee, I.: Social media analytics for enterprises: typology, methods, and processes. Business Horizon. 61 (2017). https://doi.org/10.1016/j.bushor.2017.11.002
Levy, S., Wang, W.Y.: Cross-lingual transfer learning for COVID-19 outbreak alignment. arXiv preprint arXiv:2006.03202 (2020)
Lindsay, B.R.: Social media and disasters: current uses, future options, and policy considerations (2011)
Luengo-Oroz, M., et al.: Artificial intelligence cooperation to support the global response to COVID-19. Nat. Mach. Intell. 2(6) (2020)
Magueresse, A., Carles, V., Heetderks, E.: Low-resource languages: a review of past work and future challenges. arXiv preprint arXiv:2006.07264 (2020)
Munro, R.: Crowdsourcing and the crisis-affected community. Inf. Retrieval 16(2), 210–266 (2013). https://doi.org/10.1007/s10791-012-9203-2
Nakamura, K., Levy, S., Wang, W.Y.: r/fakeddit: a new multimodal benchmark dataset for fine-grained fake news detection. arXiv preprint arXiv:1911.03854 (2019)
Newman, L., Hutchinson, P., Meekers, D.: Key findings of the 3-2-1 service COVID-19 surveys: Information (Wave 1) (2020). viamo.io/wp-content/uploads/2020/09/Updated/3/2/1/Service/COVID-19/Survey/Information/1.pdf
Nielsen: The steady reach of radio: winning consumer attention (2019). www.nielsen.com/us/en/insights/article/2019/the-steady-reach-of-radio-winning-consumers-attention/
Orife, I., et al.: Masakhane - Machine translation for Africa. arXiv preprint arXiv:2003.11529 (2020)
Rai, A.: Explainable AI: from black box to glass box. J. Acad. Market. Sci. 48 (2019). https://doi.org/10.1007/s11747-019-00710-5
Rappaport, S.: Listening solutions: a marketer’s guide to software and services. J. Advert. Res. - JAR 50 (2010). https://doi.org/10.2501/S002184991009135X
Ruggiero, A., Vos, M.: Social media monitoring for crisis communication: process, methods and trends in the scientific literature. Online J. Commun. Media Technol.4, 103–130 (2014). https://doi.org/10.29333/ojcmt/2457
Scannell, K.P.: The Crúbadán Project: corpus building for under-resourced languages. In: Building and Exploring Web Corpora: Proceedings of the 3rd Web as Corpus Workshop. vol. 4, pp. 5–15 (2007)
Sheppard, B.: Mitigating terror and avoidance behavior through the risk perception matrix to augment resilience. J. Homeland Secur. Emergency Manage. 8 (2011). https://doi.org/10.2202/1547-7355.1840
Spangher, A., Peng, N., May, J., Ferrara, E.: Enabling low-resource transfer learning across COVID-19 corpora by combining event-extraction and co-training. In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020 (2020)
Stavrakantonakis, I., Gagiu, A.E., Kasper, H., Toma, I., Thalhammer, A.: An approach for evaluation of social media monitoring tools (2012)
UNDP, UN Global Pulse: A guide to data innovation for development: from idea to proof of concept (2016)
UNESCO: Statistics on Radio (2013). www.unesco.org/new/en/unesco/events/prizes-and-celebrations/celebrations/international-days/world-radio-day-2013/statistics-on-radio/
United Nations: 17 goals to transform our world. www.un.org/sustainabledevelopment/
Vinuesa, R., et al.: The role of Artificial Intelligence in achieving the sustainable development goals. Nat. Commun. 11(1) (2020)
Young, L., Soroka, S.: Affective news: the automated coding of sentiment in political texts. Political Commun. 29, 205–231 (2012). https://doi.org/10.1080/10584609.2012.671234
Zheng, X., Liu, Y., Gunceler, D., Willett, D.: Using synthetic audio to improve the recognition of out-of-vocabulary words in end-to-end ASR systems (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Luccioni, A.S., Pham, K.H., Lam, C.S.N., Aylett-Bullock, J., Luengo-Oroz, M. (2021). Ensuring the Inclusive Use of NLP in the Global Response to COVID-19. In: Kamp, M., et al. Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2021. Communications in Computer and Information Science, vol 1525. Springer, Cham. https://doi.org/10.1007/978-3-030-93733-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-93733-1_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-93732-4
Online ISBN: 978-3-030-93733-1
eBook Packages: Computer ScienceComputer Science (R0)