Skip to main content

Analysing User Reviews in Tourism with Topic Models

  • Conference paper
  • First Online:
Information and Communication Technologies in Tourism 2015

Abstract

User generated content in general and textual reviews in particular constitute a vast source of information for the decision making of tourists and management and are therefore a key component for e-tourism. This paper explores different application scenarios for the topic model method to process these textual reviews in order to provide accurate decision support and recommendations as well as to build a basis for further analytics. Besides contributing a new model based on the topic model method, this paper also includes empirical evidence from experiments on user reviews from the YELP dataset and from TripAdvisor.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://www.yelp.com/dataset_challenge

References

  • Agarwal, D., & Chen, B. C. (2010). fLDA: Matrix factorization through latent dirichlet allocation. In Proceedings of the Third ACM International Conference on Web Search and Data Mining (pp. 91–100). ACM.

    Google Scholar 

  • Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55(4), 77–84.

    Article  Google Scholar 

  • Blei, D. M., & Lafferty, J. D. (2006a). Correlated topic models. Advances in Neural Information Processing Systems, 18, 147.

    Google Scholar 

  • Blei, D. M., & Lafferty, J. D. (2006b). Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning (pp. 113–120). ACM.

    Google Scholar 

  • Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. The Journal of Machine Learning Research, 3, 993–1022.

    Google Scholar 

  • Chang, J., & Blei, D. M. (2010). Hierarchical relational models for document networks. The Annals of Applied Statistics, 4(1), 124–150.

    Article  Google Scholar 

  • Deerwester, S. C., Dumais, S. T., Landauer, T. K., Furnas, G. W., & Harshman, R. A. (1990). Indexing by latent semantic analysis. JASIS, 41(6), 391–407.

    Article  Google Scholar 

  • Dippelreiter, B., Grün, C., Pöttler, M., Seidel, I., Berger, H., Dittenbach, M., et al. (2007). Online tourism communities on the path to Web 2.0—An evaluation, virtual communities in travel and tourism. Information Technology and Tourism, 10(4), 329–353.

    Article  Google Scholar 

  • Herlocker, J. L., Konstan, J. A., Borchers, A., & Riedl, J. (1999). An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 230–237). ACM.

    Google Scholar 

  • Hofmann, T. (1999). Probabilistic latent semantic indexing. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 50–57). ACM.

    Google Scholar 

  • Jannach, D., Zanker, M., & Fuchs, M. (2014). Leveraging multi-criteria customer feedback for satisfaction analysis and improved recommendations. Information Technology and Tourism, 14(2), 119–149.

    Article  Google Scholar 

  • Lin, C., & He, Y. (2009). Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (pp. 375–384). ACM.

    Google Scholar 

  • Litvin, S. W., Goldsmith, R. E., & Pan, B. (2008). Electronic word-of-mouth in hospitality and tourism management. Tourism Management, 29(3), 458–468.

    Article  Google Scholar 

  • McAuley, J., & Leskovec, J. (2013). Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the Seventh ACM Conference on Recommender Systems (pp. 165–172). ACM.

    Google Scholar 

  • Mcauliffe, J. D., & Blei, D. M. (2008). Supervised topic models. In Advances in Neural Information Processing Systems (pp. 121–128). Cambridge, MA: MIT Press.

    Google Scholar 

  • Mnih, A., & Salakhutdinov, R. (2007). Probabilistic matrix factorization. In Proceedings of the Advances in Neural Information Processing Systems (pp. 1257–1264).

    Google Scholar 

  • Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1–2), 1–135.

    Article  Google Scholar 

  • Rossetti, M., Stella, F., & Zanker, M. (2013). Towards explaining latent factors with topic models in collaborative recommender systems. In Database and Expert Systems Applications (DEXA), 2013 24th International Workshop on (pp. 162–167). IEEE.

    Google Scholar 

  • Sarwar, B., Karypis, G., Konstan, J., & Riedl, J. (2001). Item-based collaborative filtering recommendation algorithms. In Proceedings of the Tenth International Conference on World Wide Web (pp. 285–295). ACM.

    Google Scholar 

  • Schmallegger, D., & Carson, D. (2008). Blogs in tourism: Changing approaches to information exchange. Journal of Vacation Marketing, 14(2), 99–110.

    Article  Google Scholar 

  • Wang, C., & Blei, D. M. (2011). Collaborative topic modeling for recommending scientific articles. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 448–456). ACM.

    Google Scholar 

  • Wang, H., Lu, Y., & Zhai, C. (2010). Latent aspect rating analysis on review text data: A rating regression approach. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 783–792). ACM.

    Google Scholar 

  • Xiang, Z., & Gretzel, U. (2010). Role of social media in online travel information search. Tourism Management, 31(2), 179–188.

    Article  Google Scholar 

  • Ye, Q., Law, R., Gu, B., & Chen, W. (2011). The influence of user-generated content on traveler behavior: An empirical investigation on the effects of e-word-of-mouth to hotel online bookings. Computers in Human Behavior, 27(2), 634–639.

    Article  Google Scholar 

  • Zehrer, A., Crotts, J. C., & Magnini, V. P. (2011). The perceived usefulness of blog postings: An extension of the expectancy-disconfirmation paradigm. Tourism Management, 32(1), 106–113.

    Article  Google Scholar 

Download references

Acknowledgements

The first author wishes to acknowledge the financial support provided by the Australian Government Department of Education through the 2014 Endeavour Research Fellowship awarded for the visiting period at the Advanced Analytics Institute, University of Technology, Sydney, Australia, under the supervision of Prof. Longbing Cao.

Furthermore, authors acknowledge the financial support from the European Union (EU), the European Regional Development Fund (ERDF), the Austrian Federal Government and the State of Carinthia in the Interreg IV Italien-Österreich programme (project acronym O-STAR).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marco Rossetti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Rossetti, M., Stella, F., Cao, L., Zanker, M. (2015). Analysing User Reviews in Tourism with Topic Models. In: Tussyadiah, I., Inversini, A. (eds) Information and Communication Technologies in Tourism 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-14343-9_4

Download citation

Publish with us

Policies and ethics