Abstract
Tourism is one of the sectors of the economy (global, regional, local) that was particularly heavily affected by Covid-19. As registered in many tourism departments from many countries, their financial incomes decreased since some of them were directly depended on the tourism sector. Sentiment analysis has emerged as a widely used method to understand the emotions and sentiments expressed by humans in text, to enhance the overall experience in a specific situation. The objective of this paper is to use advanced data analytics methods and natural language processing (NLP) techniques to extract comments from visitors about a tourist area, particularly those posted on Twitter during different periods of time. To identify what they considered as interesting sites in a tourist area, sentiment analysis is applied. The research focuses on tweets posted within a 3 km. radius from the center of Central Park in New York. This valuable information about the tourist’s opinions and interests can be used to gain insight information that can be used by the area management to take advantage of this to implement strategies that attract people's attention. The proposed methodology extracts tweets related to the study area and processes them by applying Natural Language Processing, then the texts are classified by a logistic regression algorithm. The relevance of the approach is the implementation of NLP and machine learning to include technology in the touristic area's management through the data analysis, and that the methodology could be replicated in different areas as long as the data exists.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Khalid, U., Okafor, L.E., Sanusi, O.I.: Exploring diverse sources of linguistic influence on international tourism flows. J. Travel Res., Online First. (2021). https://doi.org/10.1177/0047287520988909
Gopalan, S., Khalid, U.: How does financial inclusion influence tourism demand? empirical evidence from emerging markets and developing economies. Tour. Recreation Res. (2022). https://doi.org/10.1080/02508281.2022.2028084
Finance & Development: Finance & development, 57(4), 1 (2020). https://doi.org/10.5089/9781513544625.022
Troisi, O., Visvizi, A., Grimaldi, M.: Digitalizing business models in hospitality ecosystems: toward data-driven innovation. Eur. J. Innov. Manag. 26(7), 242–277 (2023). https://doi.org/10.1108/EJIM-09-2022-0540
Troisi, O., Grimaldi, M., Visvizi, A.: Digital transformation in tourism ecosystems: what impact on sustainability and innovation?. In: Visvizi, A., Troisi, O., Grimaldi, M. (eds.) Research and Innovation Forum 2022. RIIFORUM 2022. Springer Proceedings in Complexity. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-19560-0_5
Beysolow II, T.: What is natural language processing? Appl. Nat. Lang. Process. with Python, 1–12 (2018). https://doi.org/10.1007/978-1-4842-3733-5_1
Gregoriades, A., Pampaka, M., Herodotou, H., et al.: Explaining tourist revisit intention using natural language processing and classification techniques. J. Big Data 10, 60 (2023)
Li, Y., Hu, C., Huang, C., Duan, L.: The concept of smart tourism in the context of tourism information services. Tour. Manage. 58, 293–300 (2017)
Perano, M., Del Regno, C., Pellicano, M., Casali, G.L.: COVID-19 and smart city in the context of tourism: a bibliometric analysis using VOSviewer software. In: Visvizi, A., Troisi, O., Grimaldi, M. (eds.) Research and Innovation Forum 2022. RIIFORUM 2022. Springer Proceedings in Complexity. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-19560-0_22
Yue, L., Chen, W., Li, X., Zuo, W., Yin, M.: A survey of sentiment analysis in social media. Knowl. Inf. Syst. 60(2), 617–663 (2019)
Rrmoku, K., Selimi, B., Ahmedi, L.: Application of trust in recommender systems—utilizing naive Bayes classifier. Comput. 10(1), 6 (2022)
Afzaal, M., Usman, M., Fong, A.C.M.: Predictive aspect-based sentiment classification of online tourist reviews. J. Inf. Sci. 45(3), 341–363 (2018). https://doi.org/10.1177/0165551518789872
Mishra, R.K., Raj, H., Urolagin, S., Jothi, J., Nawaz, N.: Cluster-based knowledge graph and entity-relation representation on tourism economical sentiments. Appl. Sci. 12(16), 8105 (2022). https://doi.org/10.3390/app12168105
Zhang, Y., Lingyi, M., Pei-Xue, L., Lu, Y., Zhang, J.: COVID-19’s impact on tourism: will compensatory travel intention appear? Asia Pacific J. Tour. Res. 26(7), 732–747 (2021). https://doi.org/10.1080/10941665.2021.1908383
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences (2014). https://arxiv.org/abs/1404.2188
Wang X, Liu Y, Sun, C.J., Wang, B.: Predicting polarities of tweets by composing word embeddings with long shortterm memory. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1343–1353, China (2015)
Wei, S., Song, S.: Sentiment classification of tourism reviews based on visual and textual multifeature fusion. Wireless Comm. Mobile Comp. Article ID 9940817, 10 p (2022). https://doi.org/10.1155/2022/9940817
Alvarez-Carmona, M.A., Aranda, R., Rodriguez-Gonzalez, A., Fajardo-Delgado, D., Sanchez, M.G., Perez-Espinosa, H., Martinez-Miranda, J., Guerrero-Rodriguez, R., Bustio-Martinez, L., Pacheco, A.D.: Natural language processing applied to tourism research: a systematic review and future research directions. J. King Saud Univ.—Comp. Info. Sci. 34(10), 10125–10144 (2022). https://doi.org/10.1016/j.jksuci.2022.10.010
Menner, T., Höpken, W., Fuchs, M., Lexhagen, M.: Topic detection: identifying relevant topics in tourism reviews. Info. Comm. Tech. Tourism 2016, 411–423 (2016). https://doi.org/10.1007/978-3-319-28231-2_30
Thomas, A.: Natural language processing with spark NLP: learning to understand text at scale. Van Duuren Media (2020)
Álvarez-Carmona, M.A., Aranda, R., Arce-Cardenas, S., Fajardo-Delgado, D., Guerrero-Rodríguez, R., López-Monroy, A.P., Martínez-Miranda, J., Pérez-Espinosa, H., Rodríguez-González, A.Y.: Overview of rest-mex at IberLEF 2021: Recommendation system for text Mexican tourism. Procesamiento Del Lenguaje Natural 67(67), 163–172 (2021). https://doi.org/10.26342/2021-67-14
Shah, K., Patel, H., Sanghvi, D., Shah, M.: A comparative analysis of logistic regression, random forest and KNN models for the text classification. Augment. Human Res. 5(1). https://doi.org/10.1007/s41133-020-00032-0
Chang, I.C., Horng, J.S., Liu, C.H., Chou, S.F., Yu, T.Y.: Exploration of topic classification in the tourism field with text mining technology—a case study of the academic journal papers. Sustain. 14(7), 4053 (2022). https://doi.org/10.3390/su14074053
Kowsari, K., Meimandi, K.J., Heidarysafa, M., Mendu, S., Barnes, L.E., Brown, D.E.: Text classification algorithms: a survey. Info. 10(4), 150 (2019). https://doi.org/10.3390/info10040150
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Saldana-Perez, M., Moreno-Ibarra, M., Palma-Preciado, C., Guzman, G., Contreras-Jimenez, Y. (2024). Reinforcing Tourism Post-pandemic Through a Natural Language Processing Data Analysis. In: Visvizi, A., Troisi, O., Corvello, V. (eds) Research and Innovation Forum 2023. RIIFORUM 2023. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-031-44721-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-031-44721-1_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44720-4
Online ISBN: 978-3-031-44721-1
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)