Paper
25 February 1999 Probabilistic analysis of the RNN-CLINK clustering algorithm
Sheau-Dong Lang, Li-Jen Mao, Wen-Lin Hsu
Author Affiliations +
Abstract
Clustering is among the oldest techniques used in data mining applications. Typical implementations of the hierarchical agglomerative clustering methods (HACM) require an amount of O(N2)-space, when there are N data objects, making such algorithms impractical for problems involving large datasets. The well-known clustering algorithm RNN- CLINK requires only O(N)-space, but O(N3)-time in the worst case, although the average time appears to be O(N2-log N). We provide a probabilistic interpretation of the average time complexity of the algorithm. We also report experimental results, using the randomly generated bit vectors, and using the NETNEWS articles as the input, to support our theoretical analysis.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Sheau-Dong Lang, Li-Jen Mao, and Wen-Lin Hsu "Probabilistic analysis of the RNN-CLINK clustering algorithm", Proc. SPIE 3695, Data Mining and Knowledge Discovery: Theory, Tools, and Technology, (25 February 1999); https://doi.org/10.1117/12.339988
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Statistical analysis

Chromium

Tin

Data mining

Statistical modeling

Algorithm development

Analytical research

RELATED CONTENT

Evaluating word semantic properties using Sketch Engine
Proceedings of SPIE (February 14 2015)
Haidilao customer comment data sentiment analysis
Proceedings of SPIE (May 23 2022)
DEM interpolation based on artificial neural networks
Proceedings of SPIE (December 02 2005)
Nonparametric robust regression estimate
Proceedings of SPIE (November 01 2006)

Back to Top