Abstract
An intelligent clustering algorithm named DBSCAN-M is proposed for the purpose of data mining, which improves the recognition rate of noise under the circumstance of high noise density forming new clusters. The proposed algorithm is a synthesis of density clustering theory from DBSCAN (Density-Based Spatial Clustering of Applications with Noise) and mutual reinforcement from HITS (Hypertext Induced Topic Search) within search engine technology. The core points and clusters in the data set are mutually reinforced, thereby the capability of accurate identification of the noise is enhanced beneath high noise density. An algorithmic model was established, and simulations are taken by the WEKA software with real data sets from the University of California. Results showed that the proposed algorithm can obtain a more accurate recognition of the noises contrasting with the usual DBSCAN algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liu, Z.H., Jiang, Z.Q., Zuo, R.S.: Study of fussy clustering of engineering geological environment with GIS. J. China Univ. Min. Technol. 13, 196–200 (2004)
Liu, J., Yan, Z., Yang, L.T.: Fusion–an aide to data mining in internet of things. Inf. Fusion. 23, 1–2 (2015)
Chen, M.S., Han, J., Yu, P.S.: Data mining: an overview from a database perspective. Knowl. Data Eng. 8, 866–883 (1996)
Knorr, E.M., Ng, R.T.: A unified notion of outliers: properties and computation. In: Proceedings of the KDD 1997, pp. 219–222. AAAI Press, Phoenix (1997)
Zhang, Y., Liu, J., Li, H.: An outlier detection method based on probability. Comput. Eng. 3, 47–55 (2013)
Liang, B.: A hierarchical clustering based global outlier detection method. In: 5th IEEE International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA), pp. 1213–1215. IEEE Press, New York (2010)
Feng, P.J., Ge, L.D.: Adaptive DBSCAN-based algorithm for constellation reconstruction and modulation identification. In: Radio Science Conference, pp. 177–180. IEEE Press, New York (2004)
Birant, D., Kut, A.: ST-DBSCAN: an algorithm for clustering spatial–temporal data. Data Knowl. Eng. 60, 208–221 (2007)
Koteshwariah, C.B., Kisore, N.R. Ravi, V.: A fuzzy version of generalized DBSCAN clustering algorithm. In: Proceedings of the Second ACM IKDD Conference on Data Sciences, pp. 128–129. ACM Press (2015)
Smiti, A., Eloudi, Z.: Soft DBSCAN: Improving DBSCAN clustering method using fuzzy set theory. In: 6th International Conference on Human System Interaction (HSI), pp. 380–385. IEEE Press, New York (2013)
Chen, F., Deng, P., Wan, J., et al: Data mining for the internet of things: literature review and challenges. Int. J. Distrib. Sens. Netw. vol. 2015, p. 14, 501, Article id. 431047 (2015)
Shah, M.: Big Data and the Internet of Things. arXiv preprint (2015). arXiv:1503.07092
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the KDD 1997, pp. 226–231, AAAI Press, Phoenix (1996)
Han, J., Kamber, M., Pei, J., Kamber, M., Pei, J.: Data mining: concepts and techniques, 3rd ed. San Francisco. 29, S103–S109 (2001)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM (JACM) 46, 604–632 (1999)
Ding, C., He, X., Husbands, P., Zha, H., Simon, H.D.: Pagerank, hits and a unified framework for link analysis. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 353–354. ACM Press (2002)
Flake, G.W., Lawrence, S., Giles, L.C., Coetzee, F.M.: Self-organization and identification of web communities. Comput. 35, 66–70 (2002)
Chakrabarti, S., Dom, B.E., Kumar, S.R., Raghavan, P., Rajagopalan, S., Tomkins, A., et al.: Mining the web’s link structure. Comput. 32, 60–67 (1999)
Mitra, S., Nandy, J.: KDDClus: a simple method for multi-density clustering. In: SKAD 2011-Soft Computing Applications and Knowledge Discovery, pp. 72–76 (2011)
Parimala, M., Lopez, D., Senthilkumar, N.C.: A survey on density based clustering algorithms for mining large spatial databases. Int. J. Adv. Sci. Technol. 31, 59–66 (2011)
Acknowledgements
This work was supported by the Hunan Provincial Natural Science Foundation of China under Grant 14JJ5009, and the Post Doctoral Foundation of Central South University, China.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Li, Y., Guo, C., Shi, R., Liu, X., Mei, Y. (2015). DBSCAN-M: An Intelligent Clustering Algorithm Based on Mutual Reinforcement. In: Wang, G., Zomaya, A., Martinez, G., Li, K. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2015. Lecture Notes in Computer Science(), vol 9529. Springer, Cham. https://doi.org/10.1007/978-3-319-27122-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-27122-4_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27121-7
Online ISBN: 978-3-319-27122-4
eBook Packages: Computer ScienceComputer Science (R0)