Abstract
Data mining techniques have been recognized as powerful tools for predictive modeling tourist decision-making process. However, two practical yet important problems have not been resolved by the data miners in empirical tourism research. Firstly, comprehensibility-the role of the data mining should not only generate accurate predictions, but also provide insights why certain prediction is made. But most widely used data mining methods that can generalize well are black-box in nature and can provide little information on the tourist decision-making facts. Secondly, the lack of training samples-it is usually rather difficult to collect enough training samples through surveying the tourist on site, especially for surveying the tourist’s decision-making facts. Many data mining methods may not achieve satisfactory performance if learned on small data set. In this paper, we show that these two problems can be addressed simultaneously using a twice-learning framework on the travel preference data. The results indicate that by addressing these two problems properly, we can predict tourist preferences accurately as well as extracting meaningful insights which would be useful for tourism marketing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Au, N., Law, R.: Categorical classification of tourism dining. Annals of Tourism Research 29, 819–833 (2002)
Gibson, H., Yiannakis, A.: Tourist roles: Needs and the lifecourse. Annals of Tourism Research 29, 358–383 (2002)
Hansen, L.K., Salamon, P.: Neural network ensembles. IEEE Trans. Pattern Analysis and Machine Intelligence 12, 993–1001 (1990)
Jiang, Y., Li, M., Zhou, Z.-H.: Generation of Comprehensible Hypotheses from Gene Expression Data. In: Li, J., Yang, Q., Tan, A.-H. (eds.) BioDM 2006. LNCS (LNBI), vol. 3916, pp. 116–123. Springer, Heidelberg (2006)
Kim, H., Gu, Z.: Financial features of dividend-paying firms in the hospitality industry: A logistic regression analysis. International Journal of Hospitality Management 28, 359–366 (2009)
Kon, S.C., Turner, W.L.: Neural network forecasting of tourism demand. Tourism Economics 11, 301–328 (2005)
Law, R., Au, N.: A neural network model to forecast Japanese demand for travel to Hong Kong. Tourism Management 20, 89–97 (1999)
Pai, P.F., Hong, W.C., Chang, P.T., Chen, C.T.: The application of support vector machines to forecast tourist arrivals in Barbados: An empirical study. International Journal of Management 23, 375–385 (2006)
Palmer, A., Jose Montano, J.J., Sese, A.: Designing an artificial neural network for forecasting tourism time-series. Tourism Management 27, 781–790 (2006)
Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Mateo (1993)
Song, H., Li, G.: Tourism demand modeling and forecasting A review of recent research. Tourism Management 29, 203–220 (2008)
Cao, L.: In-depth Behavior Understanding and Use: the Behavior Informatics Approach. Information Science 180(17), 3067–3085 (2010)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Wong, J.-Y., Yeh, C.: Tourism hesitation in destination decision making. Annals of Tourism Research 36, 6–23 (2009)
Wong, K.K.F., Song, H., Chon, K.S.: Bayesian models for tourism demand forecasting. Tourism Management 27, 773–780 (2006)
World Tourism Organization. Chinese outbound tourism, Madrid (2003)
Zhou, Z.-H., Jiang, Y.: Medical diagnosis with C4.5 rule preceded by artificial neural network ensemble. IEEE Transactions on Information Technology in Biomedicine 7, 37–42 (2003)
Zhou, Z.-H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Transactionson Knowledge and Data Engineering 16, 770–773 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, C., Zhang, J. (2012). Mining Tourist Preferences with Twice-Learning. In: Cao, L., Huang, J.Z., Bailey, J., Koh, Y.S., Luo, J. (eds) New Frontiers in Applied Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 7104. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28320-8_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-28320-8_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28319-2
Online ISBN: 978-3-642-28320-8
eBook Packages: Computer ScienceComputer Science (R0)