To read this content please select one of the options below:

EFFICIENCY OF HIERARCHIC AGGLOMERATIVE CLUSTERING USING THE ICL DISTRIBUTED ARRAY PROCESSOR

EDIE M. RASMUSSEN (Department of Information Studies, University of Sheffield Western Bank, Sheffield, S10 2TN)
PETER WILLETT (Department of Information Studies, University of Sheffield Western Bank, Sheffield, S10 2TN)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 January 1989

201

Abstract

The implementation of hierarchic agglomerative methods of cluster anlaysis for large datasets is very demanding of computational resources when implemented on conventional computers. The ICL Distributed Array Processor (DAP) allows many of the scanning and matching operations required in clustering to be carried out in parallel. Experiments are described using the single linkage and Ward's hierarchical agglomerative clustering methods on both real and simulated datasets. Clustering runs on the DAP are compared with the most efficient algorithms currently available implemented on an IBM 3083 BX. The DAP is found to be 2.9–7.9 times as fast as the IBM, the exact degree of speed‐up depending on the size of the dataset, the clustering method, and the serial clustering algorithm that is used. An analysis of the cycle times of the two machines is presented which suggests that further, very substantial speed‐ups could be obtained from array processors of this type if they were to be based on more powerful processing elements.

Citation

RASMUSSEN, E.M. and WILLETT, P. (1989), "EFFICIENCY OF HIERARCHIC AGGLOMERATIVE CLUSTERING USING THE ICL DISTRIBUTED ARRAY PROCESSOR", Journal of Documentation, Vol. 45 No. 1, pp. 1-24. https://doi.org/10.1108/eb026836

Publisher

:

MCB UP Ltd

Copyright © 1989, MCB UP Limited

Related articles