Skip to main content

OpeNER: Open Tools to Perform Natural Language Processing on Accommodation Reviews

  • Conference paper
  • First Online:
Information and Communication Technologies in Tourism 2015

Abstract

Opinion mining is crucial for hoteliers and other tourism industries in order to improve their service from the analysis of services failures and recovery. The extensive use of the Internet and social networks has shifted the way tourism information is shared and spread. Travel agencies, hotels, restaurants, tourist destinations and other actors require the aid of new technologies to get an insight of the vast amount of customer generated reviews. Develop and integrate text analysis technologies is usually difficult and expensive, because it involves the use of Natural Language Processing techniques. This paper introduces the OpeNER European project, a set of free Open Source and ready-to-use text analysis tools to perform text processing tasks like Named Entity Recognition and Opinion detection. The paper also provides an example of a possible application of the OpeNER results in the geolocation of hotel reviews.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.opener-project.eu

  2. 2.

    KAF documents are XML files too verbose to be represented in this paper. More information about KAF format and examples can be found at the OpeNER website.

  3. 3.

    http://www.zoover.com

References

  • Bagga, A., & Baldwin, B. (1999). Cross-document event coreference: Annotations, experiments, and observations. In Proceedings of the workshop on coreference and its applications.

    Google Scholar 

  • Bosma, W., Vossen, P., & Soroa, A. (2009). KAF: A generic semantic annotation format. In Proceedings of the GL2009 workshop on semantic annotation.

    Google Scholar 

  • Brants, T. (2000). TnT: A statistical part-of-speech tagger. In Proceedings of the sixth conference on applied natural language processing. Seattle, WA.

    Google Scholar 

  • Brereton, R. G., & Lloyd, G. R. (2010). Support vector machines for classification and regression. The Analyst, 135, 230–267.

    Article  Google Scholar 

  • Collins, M. (2002). Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proceedings of the ACL-02 conference on empirical methods in natural language processing (pp. 1–8). Philadelphia, PA: Association for Computational Linguistics.

    Google Scholar 

  • Ghose, A., Ipeirotis, P., & Li, B. (2009). The economic impact of user-generated content on the Internet: Combining text mining with demand estimation in the hotel industry. In Proceedings of the 20th workshop on information systems and economics (WISE).

    Google Scholar 

  • Giesbrecht, E., & Evert, S. (2009). Is part-of-speech tagging a solved task? An evaluation of POS taggers for the German web as corpus. In Web as corpus workshop WAC5 (p. 27).

    Google Scholar 

  • Gräbner, D., Zanker, M., Fliedl, G., & Fuchs, M. (2012). Classification of customer reviews based on sentiment analysis. In Proceedings of the 19th conference on information and communication technologies in tourism (ENTER) (pp. 460–470). Helsingborg, Sweden: Springer.

    Google Scholar 

  • Hu, M., & Liu, B. (2004). Mining opinion features in customer reviews. Association for the Advancement of Artificial Intelligence, 4(4), 755–760.

    Google Scholar 

  • Kasper, W., & Vela, M. (2011). Sentiment analysis for hotel reviews. Computational Linguistics-Applications Conference, 231527, 45–52.

    Google Scholar 

  • Lau, K., Lee, K., & Ho, Y. (2005). Text mining for the hotel industry. Cornell Hotel and Restaurant Administration Quarterly, 46(3), 344–362.

    Article  Google Scholar 

  • Lee, H., Peirsman, Y., Chang, A., Chambers, N., Surdeanu, M., & Jurafsky, D. (2011a). Stanford’ s multi-pass sieve coreference resolution system at the CoNLL-2011 shared task. In Proceedings of the fifteenth conference on computational natural language learning: Shared task (pp. 28–34). Portland, OR: Association for Computational Linguistics.

    Google Scholar 

  • Lee, M. J., Singh, N., & Chan, E. S. W. (2011b). Service failures and recovery actions in the hotel industry: A text-mining approach. Journal of Vacation Marketing, 17(3), 197–207.

    Article  Google Scholar 

  • Liu, B. (2010). Sentiment analysis and subjectivity. In N. Indurkhya & F. J. Damerau (Eds.), Handbook of natural language processing (pp. 1–38). New York, NY: ACM press.

    Google Scholar 

  • Liu, S., Law, R., Rong, J., Li, G., & Hall, J. (2013). Analyzing changes in hotel customers’ expectations by trip mode. International Journal of Hospitality Management, 34, 359–371.

    Article  Google Scholar 

  • Marchetti, A., Tesconi, M., Abbate, S., Lo Duca, A., D’Errico, A., Frontini F., & Monachini, M. (2013). Tour-pedia: A web application for the analysis and visualization of opinions for tourism domain. In Z. Vetulani & H. Uszkoreit (Eds.), The 6th Language & Technology Conference on Human Language Technology (pp. 594–595). Poznan, Poland.

    Google Scholar 

  • Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1), 3–26.

    Article  Google Scholar 

  • Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, 2(1–2), 1–135.

    Article  Google Scholar 

  • Popescu, A.-M., & Etzioni, O. (2007). Extracting product features and opinions from reviews. In A. Kao & S. R. Poteet (Eds.), Natural language processing and text mining (pp. 9–28). London: Springer.

    Chapter  Google Scholar 

  • Rao, D., McNamee, P., & Dredze, M. (2013). Entity linking: Finding extracted entities in a knowledge base. In Multi-source, multilingual information extraction and summarization (pp. 93–115). Berlin: Springer.

    Google Scholar 

  • Řehůřek, R., & Kolkus, M. (2009). Language identification on the web: Extending the dictionary method. Lecture Notes in Computer Science, 5449, 357–368.

    Article  Google Scholar 

  • Sil, A., Cronin, E., Nie, P., Yang, Y., Popescu, A.-M., & Yates, A. (2012). Linking named entities to any database. In EMNLP-CoNLL 2012 (pp. 116–127). Stroudsburg, PA: Association for Computational Linguistics.

    Google Scholar 

  • Sutton, C., & McCallum, A. (2012). An introduction to conditional random fields. Foundations and Trends in Machine Learning, 4, 267–373.

    Article  Google Scholar 

  • Webster, J. J., & Kit, C. (1992). Tokenization as the initial phase in NLP. In Proceedings of COLING-92 (pp. 1106–1110). Nantes, France: International Committee on Computational Linguistics.

    Google Scholar 

  • Xiang, Z., & Gretzel, U. (2010). Role of social media in online travel information search. Tourism Management, 31(2), 179–188. Elsevier.

    Google Scholar 

  • Ye, Q., Zhang, Z., & Law, R. (2009). Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Systems with Applications, 36(3), 6527–6535. Elsevier.

    Google Scholar 

Download references

Acknowledgements

OpeNER has been funded by the European Commission under the FP7-ICT-2011-SME- DCL-296451.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aitor García-Pablos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

García-Pablos, A., Cuadros, M., Linaza, M.T. (2015). OpeNER: Open Tools to Perform Natural Language Processing on Accommodation Reviews. In: Tussyadiah, I., Inversini, A. (eds) Information and Communication Technologies in Tourism 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-14343-9_10

Download citation

Publish with us

Policies and ethics