skip to main content
10.1145/3474124.3474204acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3Conference Proceedingsconference-collections
research-article

Restricted Randomness DBSCAN : A faster DBSCAN Algorithm

Published:04 November 2021Publication History

ABSTRACT

Data Mining is the process of extracting useful and accurate information or patterns from large databases using different algorithms and methods of machine learning. To analyze the data, Clustering is one of the methods in which similar data is grouped together and DBSCAN clustering algorithm is the one, which is broadly used in numerous practical applications. This paper presents a more efficient density based clustering algorithm, which has the ability to discover cluster faster than the existing DBSCAN algorithm. The efficiency is achieved by restricting the randomness of choosing points from the dataset. Our proposed algorithm named Restricted Randomness DBSCAN (RR DBSCAN) is compared with conventional DBSCAN algorithm over 9 datasets on the basis of Silhouette Coefficient, Time taken in formation of clusters and accuracy. The results show that RR DBSCAN performs better than traditional DBSCAN in terms of accuracy and time taken to form clusters.

References

  1. [1] Phyu, Thair Nu, ”Survey of classification techniques in data mining.” Proceedings of the International MultiConference of Engineers and Computer Scientists. Vol. 1. 2009.Google ScholarGoogle Scholar
  2. [2] Kotsiantis, Sotiris, and Dimitris Kanellopoulos, ”Association rules mining: A recent overview.” GESTS International Transactions on Computer Science and Engineering 32.1 (2006): 71-82.Google ScholarGoogle Scholar
  3. [3] Berkhin, Pavel. ”A survey of clustering data mining techniques.” Grouping multidimensional data. Springer, Berlin, Heidelberg, 2006. 25-71.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] McCarty, John A., and Manoj Hastak. ”Segmentation approaches in data-mining: A comparison of RFM, CHAID, and logistic regression.” Journal of business research 60.6 (2007): 656-662.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Kamalpreet K. Jassar, Kanwalvir S. Dhindsa, ”Comparative study of spatial data mining techniques” International Journal of computer applications, 2015.Google ScholarGoogle Scholar
  6. [6] X.Y. Wang, J.M. Garibaldi, ”A comparison of fuzzy and non-fuzzy clustering techniques in cancer diagnosis”The University of Nottingham.Google ScholarGoogle Scholar
  7. [7] Ting Liu, Charles Rosenberg, henry A. Rowley, ”Clustering billions of images with large scale nearest neighbor search”, IEEE workshop on applications ofComputer Vision, 2007.Google ScholarGoogle Scholar
  8. [8] Kemal Akkaya, Fatih Senel, Brian McLaughlan, “Clustering of Wireless Sensor and actor networks based on sensor distribution and inter-actor connectivity”. Journal of Parallel and Distributed Computing 69.6 (2009): 573-587.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Koutsoukas, Alexios, et al. ”From in silico target prediction to multi-target drug design: current databases, methods and applications.” Journal of proteomics 74.12 (2011): 2554-2574.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Johnson, Stephen C. ”Hierarchical clustering schemes.” Psychometrika 32.3 (1967): 241-254.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Sousa, Lúcia & Gama, João. The Application of Hierarchical Clustering Algorithms for Recognition Using Biometrics of the Hand. International Journal of Advanced Engineering Research and Science (IJAERS). ISSN: 2349-6495.Google ScholarGoogle Scholar
  12. [12] Martin Ester, Hans P. Kriegel, J. Sander, Xiaowei Xu, ”A density based algorithm for discovering clusters in large spatial database with noise”, KDD-96.Google ScholarGoogle Scholar
  13. [13] Fu X., Wang Y., Ge Y., Chen P., Teng S. Research and Application of DBSCAN Algorithm Based on Hadoop Platform. In: Zu Q., Vargas-Vera M., Hu B. (eds) Pervasive Computing and the Networked World. ICPCA/SWS 2013. Lecture Notes in Computer Science, vol 8351. Springer, Cham.Google ScholarGoogle Scholar
  14. [14] Pappas, Thrasyvoulos N. ”An adaptive clustering algorithm for image segmentation.” IEEE Transactions on signal processing40.4 (1992): 901-914.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Agrawal, R., Gehrke, J., Gunopulos, D., & Raghavan, P. ”Automatic subspace clustering of high dimensional data for data mining applications” (Vol. 27, No. 2, pp. 94-105). ACM.Google ScholarGoogle Scholar
  16. [16] Hartigan, John A., and Manchek A. Wong. ”Algorithm AS 136: A k-means clustering algorithm.” Journal of the Royal Statistical Society. Series C (Applied Statistics) 28.1 (1979): 100-108.Google ScholarGoogle Scholar
  17. [17] Ray, Siddheswar, and Rose H. Turi. ”Determination of number of clusters in k-means clustering and application in colour image segmentation.” Proceedings of the 4th international conference on advances in pattern recognition and digital techniques. 1999.Google ScholarGoogle Scholar
  18. [18] J.Sander, “Density Based Clustering ”,in Enclyclopedia of machine learning, Springer, 2011, pp.270-273.Google ScholarGoogle Scholar
  19. [19] Ankita, Thakur Manish K. ”Modified DBSCAN Using Particle Swarm Optimization for Spatial Hotspot Identification.” 2018 Eleventh International Conference on Contemporary Computing (IC3). IEEE, 2018.Google ScholarGoogle Scholar
  20. [20] Martino, F. D., and Sessa, S., “A fuzzy particle swarm optimization algorithm and its application to hotspot events in spatial analysis,” J Ambient Intell Human Comput, Springer, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Smiti, Abir, and Zied Eloudi. ”Soft dbscan: Improving dbscan clustering method using fuzzy set theory.” 2013 6th International Conference on Human System Interactions (HSI). IEEE, 2013.Google ScholarGoogle Scholar
  22. [22] Ienco, Dino, and Gloria Bordogna. ”Fuzzy extensions of the DBScan clustering algorithm.” Soft Computing 22.5 (2018): 1719-1730.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Viswanath, P., and Rajwala Pinkesh. ”l-dbscan: A fast hybrid density based clustering method.” 18th International Conference on Pattern Recognition (ICPR’06). Vol. 1. IEEE, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Liu, Bing. ”A fast density-based clustering algorithm for large databases.” 2006 International Conference on Machine Learning and Cybernetics. IEEE, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Aranganayagi, S., and K. Thangavel. ”Clustering categorical data using silhouette coefficient as a relocating measure.” International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007). Vol. 2. IEEE, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    IC3-2021: Proceedings of the 2021 Thirteenth International Conference on Contemporary Computing
    August 2021
    483 pages
    ISBN:9781450389204
    DOI:10.1145/3474124

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 4 November 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)3

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format