Skip to main content

Implementing Data Mining in a DBMS

  • Conference paper
  • First Online:
Advances in Databases (BNCOD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2405))

Included in the following conference series:

  • 243 Accesses

Abstract

We have developed a clustering algorithm called CLIMIS to demonstrate the advantages of implementing a data mining algorithm in a database management system (DBMS). CLIMIS clusters data held in a DBMS, stores the resulting clusters in the DBMS and executes inside the DBMS. By tightly coupling CLIMIS with the database environment the algorithm scales better to large databases. This is achieved through an index-like structure that uses the database to overcome memory limitations. We further improve the performance of the algorithm by using a technique called adaptive clustering, which controls the size of the clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fisher D. H., 1987, Knowledge Acquisition Via Incremental Conceptual Clustering, Machine Learning (2), pp. 139–172.

    Google Scholar 

  2. Netz, A., Chaudhuri, S., Bernhardt, J., Fayyad, U., 2000, Integration of Data Mining and Relational Databases, in Proceedings of the 26th International Conference on Very Large Databases, Cairo, Egypt, pp. 285–296.

    Google Scholar 

  3. Oracle Relational Database Management System, 2002, http://www.oracle.com/.

  4. Witten I. H., Frank E., 2000, Data Mining, Morgan Kaufmann Publishers.

    Google Scholar 

  5. Zhang T., Ramakrishnan R., Livny M., 1996, BIRCH: An Efficient Data Clustering Method for Very Large Databases, in Proceedings-ACM-SIGMOD International Conference on Management of Data, Montreal, Canada, pp. 103–114.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lepinioti, K., McKearney, S. (2002). Implementing Data Mining in a DBMS. In: Eaglestone, B., North, S., Poulovassilis, A. (eds) Advances in Databases. BNCOD 2002. Lecture Notes in Computer Science, vol 2405. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45495-0_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-45495-0_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43905-9

  • Online ISBN: 978-3-540-45495-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics