Abstract
The tourism industry stimulates business revenues and economic activities across the globe. Effective analysis of enormous tourism reviews boosts both service quality and growth of industries. Aspect-based sentiment analysis (ABSA) has shown a prominent role in the aspect segmentation and sentiment ratings that obtains overall feedbacks and individual aspect feedback. In this regard, researchers are using Artificial Neural Network (ANN) for ABSA model learning. In addition to ANN, the state-of-the-art sentiment rating models adopted arithmetic mean (AM) of word embedding vectors and considered equal weightage to all aspects and reviews. But in real-world circumstances, these aspects and aspect reviews do not exhibit equal importance. They may vary from user to user and cannot be given equal weights. This is the first sentiment aggregation research that considers overall sentiment rating is consensus value from sentiment of its aspects and each aspect sentiment is the majority’s opinion associated sentences and their words. The proposed multi-layer knowledge representation architecture addresses this concept by using Word2Vec and extended families of the Ordered Weighted Average (OWA) operators. The novel approach signifies the weighted degree of importance for opinions and aspects using majority additive OWA (MAOWA), selective majority additive OWA (SMAOWA), and weighted selective aggregated majority OWA (WSAMOWA) operators. In addition to this, the proposed model also considers explicit and implicit aspect segmentation for review files, incorporates the meaning of slang internet words, and location-based geospatial rating analysis. Experimentation and evaluation conducted on TripAdvisor, Booking.com, Datafiniti tourism datasets show improvement in RMSE 14.68%, 59.03% and 12.97% and in Pearson correlation 30.63%, 23.34% and 32.61% respectively.
Similar content being viewed by others
Data availability
As per request
Code availability
As per request
References
Abbasimehr H, Shabani M (2019) A sentiment aggregation system based on an OWA operator. In 2019 5th international conference on web research (ICWR) (pp. 1-5). IEEE. https://doi.org/10.1109/ICWR.2019.8765285
Aftab H, Shuja J, Alasmary W, Alanazi E (2021) Hybrid DBSCAN based community detection for edge caching in social media applications. In 2021 international wireless communications and Mobile computing (IWCMC) (pp. 2038-2043). IEEE. https://doi.org/10.1109/IWCMC51323.2021.9498609
Amari SI (1993) Backpropagation and stochastic gradient descent method. Neurocomputing 5(4–5):185–196. https://doi.org/10.1016/0925-2312(93)90006-O
Anselin L, Syabri I, Kho Y (2010) GeoDa: an introduction to spatial data analysis. In: In handbook of applied spatial analysis (pp. 73–89). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03647-7_5
Cambria E, Das D, Bandyopadhyay S, Feraco A (2017). Affective computing and sentiment analysis. In a practical guide to sentiment analysis (pp. 1–10). Springer, Cham. https://doi.org/10.1007/978-3-319-55394-8_1
Chang YC, Ku CH, Chen CH (2019) Social media analytics: extracting and visualizing Hilton hotel ratings and reviews from TripAdvisor. Int J Inf Manag 48:263–279. https://doi.org/10.1016/j.ijinfomgt.2017.11.001
Chang YC, Ku CH, Chen CH (2020) Using deep learning and visual analytics to explore hotel reviews and responses. Tour Manag 80:104129. https://doi.org/10.1016/j.tourman.2020.104129
Chang V, Liu L, Xu Q, Li T, Hsu C H (2020) An improved model for sentiment analysis on luxury hotel review Expert Systems, e12580. https://doi.org/10.1111/exsy.12580
Cheng Z, Ding Y, Zhu L, Kankanhalli M (2018) Aspect-aware latent factor model: rating prediction with ratings and reviews. In proceedings of the 2018 world wide web conference (pp. 639-648). https://doi.org/10.1145/3178876.3186145
cloud5 [electronic resource] (2022) 5 Hospitality Trends from Q1 2022, Available: https://cloud5.com/blog/5-hospitality-trends-q1-2022/?msclkid=dc2bd270bcc911ec826f59e201cabcc6, Accessed 20th April, 2022
Cotter A, Shamir O, Srebro N, Sridharan K (2011) Better mini-batch algorithms via accelerated gradient methods. arXiv preprint arXiv:1106.4574. https://doi.org/10.48550/arXiv.1106.4574
Ganganwar V, Rajalakshmi R (2019) Implicit aspect extraction for sentiment analysis: a survey of recent approaches. Procedia Comput Sci 165:485–491. https://doi.org/10.1016/j.procs.2020.01.010
Gaxiola F, Melin P, Valdez F, Castillo O (2015) Generalized type-2 fuzzy weight adjustment for backpropagation neural networks in time series prediction. Inf Sci 325:159–174. https://doi.org/10.1016/j.ins.2015.07.020
Ghosal S, Jain A, Sharma S, Tayal DK (2021) ARMLOWA: aspect rating analysis with multi-layer approach. Progress Artif Intell 10:1–12. https://doi.org/10.1007/s13748-021-00252-4
Godil DI, Sharif A, Rafique S, Jermsittiparsert K (2020) The asymmetric effect of tourism, financial development, and globalization on ecological footprint in Turkey. Environ Sci Pollut Res 27(32):40109–40120. https://doi.org/10.1007/s11356-020-09937-0
Gupta A, Taneja SB, Malik G, Vij S, Tayal DK, Jain A (2019) SLANGZY: a fuzzy logic-based algorithm for English slang meaning selection. Progress Artif Intell 8(1):111–121. https://doi.org/10.1007/s13748-018-0159-3
Hai Z, Chang K, Kim JJ (2011) Implicit feature identification via co-occurrence association rule mining. In international conference on intelligent text processing and computational linguistics (pp. 393-404). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19400-9_31
Hermann K M, Blunsom P (2014) Multilingual models for compositional distributed semantics. arXiv preprint arXiv:1404.4641. https://doi.org/10.48550/arXiv.1404.4641
Hu M, Liu B (2004) Mining and summarizing customer reviews. In proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining (pp. 168-177). https://doi.org/10.1145/1014052.1014073
Jermsittiparsert K (2019). Behavior of tourism industry under the situation of environmental threats and carbon emission: Time series analysis from Thailand. 670216917. https://doi.org/10.32479/ijeep.8365
Jerripothula KR, Rai A, Garg K, Rautela YS (2020) Feature-level rating system using customer reviews and review votes. IEEE Trans Comput Soc Syst 7(5):1210–1219. https://doi.org/10.1109/TCSS.2020.3010807
Jin L, Li Z, Pan Y, Tang J (2020) Weakly-supervised image hashing through masked visual-semantic graph-based reasoning. In proceedings of the 28th ACM international conference on multimedia (pp. 916-924). https://doi.org/10.1145/3394171.3414022
Karanik M, Peláez JI, Bernal R (2016) Selective majority additive ordered weighting averaging operator. Eur J Oper Res 250(3):816–826. https://doi.org/10.1016/j.ejor.2015.10.011
Lee K H (2004). First course on fuzzy theory and applications (Vol. 27). Springer Science & Business Media. https://doi.org/10.1007/3-540-32366-X
Li Z, Tang J, Mei T (2018) Deep collaborative embedding for social image understanding. IEEE Trans Pattern Anal Mach Intell 41(9):2070–2083. https://doi.org/10.1109/TPAMI.2018.2852750
Luo Y, Tang RL (2019) Understanding hidden dimensions in textual reviews on Airbnb: an application of modified latent aspect rating analysis (LARA). Int J Hosp Manag 80:144–154. https://doi.org/10.1016/j.ijhm.2019.02.008
Mariani MM, Borghi M (2018) Effects of the booking. Com rating system: bringing hotel class into the picture. Tour Manag 66:47–52. https://doi.org/10.1016/j.tourman.2017.11.006
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781. https://doi.org/10.48550/arXiv.1301.3781
Nazir A, Rao Y, Wu L, Sun L (2020). Issues and challenges of aspect-based sentiment analysis: a comprehensive survey. IEEE Trans Affect Comput https://doi.org/10.1109/TAFFC.2020.2970399
Ozyurt B, Akcayol MA (2021) A new topic modeling based approach for aspect extraction in aspect based sentiment analysis: SS-LDA. Expert Syst Appl 168:114231. https://doi.org/10.1016/j.eswa.2020.114231
Pang B, Lee L, Vaithyanathan, S (2002) Thumbs up? Sentiment classification using machine learning techniques. arXiv preprint cs/0205070. https://doi.org/10.48550/arXiv.cs/0205070
Peláez JI, Doña JM (2003) Majority additive–ordered weighting averaging: a new neat ordered weighting averaging operator based on the majority process. Int J Intell Syst 18(4):469–481. https://doi.org/10.1002/int.10096
Peláez JI, Bernal R, Karanik M (2016) Majority OWA operator for opinion rating in social media. Soft Comput 20(3):1047–1055. https://doi.org/10.1007/s00500-014-1564-6
Peng Z, Li Z, Zhang J, Li Y, Qi G J, Tang J (2019) Few-shot image recognition with knowledge transfer. In proceedings of the IEEE/CVF international conference on computer vision (pp. 441-449). https://doi.org/10.1109/ICCV.2019.00053
Pham DH, Le AC (2018) Learning multiple layers of knowledge representation for aspect based sentiment analysis. Data Knowl Eng 114:26–39. https://doi.org/10.1016/j.datak.2017.06.001
Pham D H, Le A C, Nguyen T T T (2016) Determing aspect ratings and aspect weights from textual reviews by using neural network with paragraph vector model. In international conference on computational social networks (pp. 309-320). Springer, Cham. https://doi.org/10.1007/978-3-319-42345-6_27
Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 108:42–49. https://doi.org/10.1016/j.knosys.2016.06.009
Prasojo R E, Kacimi M, Nutt W (2015) Entity and aspect extraction for organizing news comments. In proceedings of the 24th ACM international on conference on information and knowledge management (pp. 233-242). https://doi.org/10.1145/2806416.2806576
Rana TA, Cheah YN (2016) Aspect extraction in sentiment analysis: comparative analysis and survey. Artif Intell Rev 46(4):459–483. https://doi.org/10.1007/s10462-016-9472-z
Razzaq A, Sharif A, Ahmad P, Jermsittiparsert K (2021) Asymmetric role of tourism development and technology innovation on carbon dioxide emission reduction in the Chinese economy: fresh insights from QARDL approach. Sustain Dev 29(1):176–193. https://doi.org/10.1002/sd.2139
Sarlis S, Maglogiannis I (2020) On the reusability of sentiment analysis datasets in applications with dissimilar contexts. In IFIP international conference on artificial intelligence applications and innovations (pp. 409-418). Springer, Cham. https://doi.org/10.1007/978-3-030-49161-1_34
Serrano-Guerrero J, Olivas J A, Romero F P (2019) Computing sentiment analysis through aspect-based fuzzy aggregations. In EUSFLAT Conf https://doi.org/10.2991/eusflat-19.2019.63
Serrano-Guerrero J, Chiclana F, Olivas JA, Romero FP, Homapour E (2020) A T1OWA fuzzy linguistic aggregation methodology for searching feature-based opinions. Knowl-Based Syst 189:105131. https://doi.org/10.1016/j.knosys.2019.105131
Serrano-Guerrero J, Romero F P, Olivas J A (2020) An OWA and aspect-based approach applied to rating prediction. In 2020 IEEE international conference on fuzzy systems (FUZZ-IEEE) (pp. 1-8). IEEE. https://doi.org/10.1109/FUZZ48607.2020.9177614
Shuja J, Humayun MA, Alasmary W, Sinky H, Alanazi E, Khan MK (2021) Resource efficient geo-textual hierarchical clustering framework for social iot applications. IEEE Sensors J 21(22):25114–25122. https://doi.org/10.1109/JSEN.2021.3060953
Thelwall M (2019) Sentiment analysis for tourism. Big Data Innov Tour Travel Hosp:87–104. https://doi.org/10.1007/978-981-13-6339-9_6
Toutanova K, Klein D, Manning C D, Singer Y (2003) Feature-rich part-of-speech tagging with a cyclic dependency network. In proceedings of the 2003 human language technology conference of the north American chapter of the Association for Computational Linguistics (pp. 252-259). https://doi.org/10.3115/1073445.1073478
UNWTO [electronic resource] (2021) World Tourism Organization, Available: https://www.unwto.org/global-and-regional-tourism-performance, Accessed 20th September, 2021
UNWTO [electronic resource] (2022) UNWTO World Tourism Barometer January 2022 Available: https://webunwto.s3.eu-west-1.amazonaws.com/s3fs-public/2022-01/220118-Barometersmall.pdf?msclkid=ef151eb8b9a111ec9e61db79cf28a24d, Accessed 20th April, 2022
Wang H, Lu Y, Zhai C (2010) Latent aspect rating analysis on review text data: a rating regression approach. In proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 783-792). https://doi.org/10.1145/1835804.1835903
Wang H, Lu Y, Zhai C (2011) Latent aspect rating analysis without aspect keyword supervision. In proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 618-626). https://doi.org/10.1145/2020408.2020505
Xing S, Wang Q, Zhao X, Li T (2019) A hierarchical attention model for rating prediction by leveraging user and product reviews. Neurocomputing 332:417–427. https://doi.org/10.1016/j.neucom.2018.12.027
Yadav ML, Roychoudhury B (2019) Effect of trip mode on opinion about hotel aspects: a social media analysis approach. Int J Hosp Manag 80:155–165. https://doi.org/10.1016/j.ijhm.2019.02.002
Yager RR (1988) On ordered weighted averaging aggregation operators in multicriteria decisionmaking. IEEE Trans Syst Man Cybern 18(1):183–190. https://doi.org/10.1109/21.87068
Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527–6535. https://doi.org/10.1016/j.eswa.2008.07.035
Yen J, Langari R, Zadeh L A (1995) Industrial applications of fuzzy logic and intelligent systems. IEEE press https://doi.org/10.5555/545838
Yu L, Bai X (2021) Implicit aspect extraction from online clothing reviews with fine-tuning BERT algorithm. J Phys Conf Series 1995(1):012040). IOP publishing. https://doi.org/10.1088/1742-6596/1995/1/012040
Yusoff B, Merigó JM, Ceballos D, Peláez JI (2018) Weighted-selective aggregated majority-OWA operator and its application in linguistic group decision making. Int J Intell Syst 33(9):1929–1948. https://doi.org/10.1002/int.22004
Zhang W, Xu H, Wan W (2012) Weakness finder: find product weakness from Chinese reviews by using aspects based sentiment analysis. Expert Syst Appl 39(11):10283–10291. https://doi.org/10.1016/j.eswa.2012.02.166
Funding
No funding was received for conducting this study.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Financial interests
The authors have no financial or proprietary interests in any material discussed in this article.
Conflicts of interest/competing interests
Not Applicable
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ghosal, S., Jain, A. Weighted aspect based sentiment analysis using extended OWA operators and Word2Vec for tourism. Multimed Tools Appl 82, 18353–18380 (2023). https://doi.org/10.1007/s11042-022-13800-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13800-4