Abstract
We have developed a clustering algorithm called CLIMIS to demonstrate the advantages of implementing a data mining algorithm in a database management system (DBMS). CLIMIS clusters data held in a DBMS, stores the resulting clusters in the DBMS and executes inside the DBMS. By tightly coupling CLIMIS with the database environment the algorithm scales better to large databases. This is achieved through an index-like structure that uses the database to overcome memory limitations. We further improve the performance of the algorithm by using a technique called adaptive clustering, which controls the size of the clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fisher D. H., 1987, Knowledge Acquisition Via Incremental Conceptual Clustering, Machine Learning (2), pp. 139–172.
Netz, A., Chaudhuri, S., Bernhardt, J., Fayyad, U., 2000, Integration of Data Mining and Relational Databases, in Proceedings of the 26th International Conference on Very Large Databases, Cairo, Egypt, pp. 285–296.
Oracle Relational Database Management System, 2002, http://www.oracle.com/.
Witten I. H., Frank E., 2000, Data Mining, Morgan Kaufmann Publishers.
Zhang T., Ramakrishnan R., Livny M., 1996, BIRCH: An Efficient Data Clustering Method for Very Large Databases, in Proceedings-ACM-SIGMOD International Conference on Management of Data, Montreal, Canada, pp. 103–114.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lepinioti, K., McKearney, S. (2002). Implementing Data Mining in a DBMS. In: Eaglestone, B., North, S., Poulovassilis, A. (eds) Advances in Databases. BNCOD 2002. Lecture Notes in Computer Science, vol 2405. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45495-0_11
Download citation
DOI: https://doi.org/10.1007/3-540-45495-0_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43905-9
Online ISBN: 978-3-540-45495-3
eBook Packages: Springer Book Archive