Abstract
We present a new clustering algorithm for handling complexities encountered in analysing data sets of hotel ratings and analyse its performance in a clustering case study. In the setting we address, business constraints and coordinates (among other individual attributes of objects) are unknown and only distances between objects are available to the clustering algorithm, a situation that arises in a wide range of clustering applications. Our algorithm constitutes an application of meta-analytics, in which we tailor a metaheuristic procedure to address a challenging problem at the intersection of predictive and prescriptive analytics. Our work builds on and extends the ideas of our clustering algorithm introduced in previous work which employs the Tabu Search metaheuristic to assure clusters exhibit a property we call cohesiveness. The special characteristics of the present hotel classification problem are handled by integrating our previous method with a new form of hierarchical clustering. Our computational analysis discloses that our algorithm obtains clusters that exhibit greater cohesiveness than those produced by the classical K-means method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
The terms “intensification” and “diversification”, now widely used in many metaheuristic algorithms, were originally introduced in Tabu Search.
- 3.
All data sets can be downloaded from http://times.cs.uiuc.edu/~wang296/Data.
References
Ajith Abraham, Swagatam Das, and Sandip Roy. Swarm intelligence algorithms for data clustering. In Soft computing for knowledge discovery and data mining, pages 279–313. Springer, 2008.
Jay April, Marco Better, Fred Glover, James P Kelly, and Gary Kochenberger. Strategic workforce optimization: Ensuring workforce readiness with OptForceTM. Report P-20, OptTek Systems, Inc., 2014.
Michael J Brusco, Douglas Steinley, J Dennis Cradit, and Renu Singh. Emergent clustering methods for empirical OM research. Journal of Operations Management, 30 (6): 454–466, 2012.
Buyang Cao and Fred Glover. Creating balanced and connected clusters to improve service delivery routes in logistics planning. Journal of Systems Science and Systems Engineering, 19 (4): 453–480, 2010.
Buyang Cao, Fred Glover, and Cesar Rego. A tabu search algorithm for cohesive clustering problems. Journal of Heuristics, 21 (4): 457–477, 2015.
Michael B Cohen, Sam Elder, Cameron Musco, Christopher Musco, and Madalina Persu. Dimensionality reduction for k-means clustering and low rank approximation. In Proceedings of the Forty-Seventh Annual ACM on Symposium on Theory of Computing, pages 163–172. ACM, 2015.
Pawel Foszner and Andrzej Polański. Aspect analyzer distributed system for bi-clustering analysis. In Man–Machine Interactions 4, pages 411–420. Springer, 2016.
Fred Glover. A template for scatter search and path relinking. Lecture notes in computer science, 1363: 13–54, 1998.
Fred Glover and Manuel Laguna. Tabu Search. Springer-Verlag, 1997.
Fred Glover, Manuel Laguna, and Rafael Martí. Fundamentals of scatter search and path relinking. Control and cybernetics, 29 (3): 653–684, 2000.
Fred Glover, Manuel Laguna, and Rafael Martí. New ideas and applications of scatter search and path relinking. In New optimization techniques in engineering, pages 367–383. Springer, 2004.
Johannes Grabmeier and Andreas Rudolph. Techniques of cluster algorithms in data mining. Data Mining and Knowledge Discovery, 6 (4): 303–360, 2002.
Tülin İnkaya, Sinan Kayalıgil, and Nur Evin Özdemirel. Swarm intelligence-based clustering algorithms: A survey. In Unsupervised Learning Algorithms, pages 303–341. Springer, 2016.
Carmela Iorio, Gianluca Frasso, Antonio D´Ambrosio, and Roberta Siciliano. Parsimonious time series clustering using p-splines. Expert Systems with Applications, 52: 26–38, 2016.
Anil K Jain, M Narasimha Murty, and Patrick J Flynn. Data clustering: a review. ACM computing surveys (CSUR), 31 (3): 264–323, 1999.
Gordon S Linoff and Michael JA Berry. Data mining techniques: for marketing, sales, and customer relationship management. John Wiley & Sons, 2011.
Amin Mohebi, Saeed Aghabozorgi, Teh Ying Wah, Tutut Herawan, and Ramin Yahyapour. Iterative big data clustering algorithms: a review. Software: Practice and Experience, 46 (1): 107–129, 2016.
Celal Ozturk, Emrah Hancer, and Dervis Karaboga. Dynamic clustering with improved binary artificial bee colony algorithm. Applied Soft Computing, 28: 69–80, 2015.
Mauricio GC Resende, Celso C Ribeiro, Fred Glover, and Rafael Martí. Scatter search and path-relinking: Fundamentals, advances, and applications. In Handbook of metaheuristics, pages 87–107. Springer, 2010.
Alexander Strehl and Joydeep Ghosh. Relationship-based clustering and visualization for high-dimensional data mining. INFORMS Journal on Computing, 15 (2): 208–230, 2003.
Hongning Wang, Yue Lu, and Chengxiang Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 783–792. ACM, 2010.
Hongning Wang, Yue Lu, and ChengXiang Zhai. Latent aspect rating analysis without aspect keyword supervision. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 618–626. ACM, 2011.
Hongchen Wu, Xinjun Wang, Zhaohui Peng, and Qingzhong Li. Div-clustering: exploring active users for social collaborative recommendation. Journal of Network and Computer Applications, 36 (6): 1642–1650, 2013.
Acknowledgements
We are indebted to a reviewer for insightful critical observations and suggestions that have helped to improve our chapter. The authors would like to thank our student team including Zheng Xu and Fang Yu for their efforts in implementing the algorithm, data preparation, and data analysis. This work was partially supported by the China Intelligent Urbanization Co-Creation Center [grant number CIUC20150011] and was also supported in part by the Key Laboratory of International Education Cooperation of Guangdong University of Technology.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Cao, B., Rego, C., Glover, F. (2019). Hotel Classification Using Meta-Analytics: A Case Study with Cohesive Clustering. In: Moscato, P., de Vries, N. (eds) Business and Consumer Analytics: New Ideas. Springer, Cham. https://doi.org/10.1007/978-3-030-06222-4_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-06222-4_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-06221-7
Online ISBN: 978-3-030-06222-4
eBook Packages: Computer ScienceComputer Science (R0)