Skip to main content

A Fast Algorithm for Density-Based Clustering in Large Database

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1574))

Abstract

Clustering in large database is an important and useful data mining activity. It expects to find out meaningful patterns among the data set. Some new requirements of clustering have been raised : good efficiency for large database; easy to determine the input parameters; separate noise from the clusters [1]. However, conventional clustering algorithms seldom can fulfill all these requirements. The notion of density-based clustering has been proposed which satisfies all these requirements

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. Ester, H. Kriegel, J. Sander, and X. Xu. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proc. of 2nd International Conf. on Knowledge Discovery and Data Mining, pages 226–231, Portland, Oregon, Aug. 1996.

    Google Scholar 

  2. R. T. Ng, J. Han. Efficient and Effective Clustering Methods for Spatial Data Mining. In Proc. of the 20th VLDB Conf., pages 144–155, Santiago, Chile, Sept. 1994.

    Google Scholar 

  3. M. Ester, H. Kriegel, and X. Xu. A Database Interface for Clustering in Large Spatial Databases. In Proc. of the First International Conf. on Knowledge Discovery in Databases and Data Mining, pages 94–99, Montreal, Canada, Aug. 1995.

    Google Scholar 

  4. T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 103–114, Montreal, Quebec, June 1996.

    Google Scholar 

  5. S. Guha, R. Ratogi, and K. Shim. CURE: An Efficient Clustering Algorithm for Large Databases. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 73–84, Seattle, Washington, June 1998.

    Google Scholar 

  6. M. Ester, H. Kriegel, J. Sander, M. Wimmer and Xiaowei Xu. Incremental Clustering for Mining in a Data Warehousing Environment. In Proc. of the 24th VLDB Conf., pages 323–333, New York city, New York, Aug. 1998.

    Google Scholar 

  7. M. Stonebraker, J. Frew, K. Gardels, J. Meredith. The Sequoia 2000 Benchmark. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 2–11, Washington, D.C., May, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhou, B., Cheung, D.W., Kao, B. (1999). A Fast Algorithm for Density-Based Clustering in Large Database. In: Zhong, N., Zhou, L. (eds) Methodologies for Knowledge Discovery and Data Mining. PAKDD 1999. Lecture Notes in Computer Science(), vol 1574. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48912-6_45

Download citation

  • DOI: https://doi.org/10.1007/3-540-48912-6_45

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65866-5

  • Online ISBN: 978-3-540-48912-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics