Abstract
Until recently, workstations were overwhelmingly superior to personal computers in terms of performance. However, recent PC technology has dramatically increased its CPU, main memory, and cache memory performance. Therefore massively parallel computer systems are moving away from proprietary components such as CPU, disks, etc. to commodity parts.
As far as applications are concerned, we believe that data intensive applications such as ad-hoc query processing and data mining is very important for parallel processors in addition to the conventional scientific applications. Since ATM connected PC clusters are very promising from the cost/performance point of view, we are examining the feasibility of implementing data mining over PC clusters. In this paper, we report our preliminary experimental results for parallel data mining on 2 suites of ATM connected PC clusters, consisting of 8 PCs. Although there are several kinds of problems such as immaturity of NIC and driver software, we achieved reasonably good performance for parallel data mining.
Preview
Unable to display preview. Download preview PDF.
References
Blumrich, M., Li, K., Alpert, R., Dubnicki, C., Felten, E., Sandberg, J.: Vertual Memory Mapped Network Interface for the SHRIMP Multicomputer. Proceedings of the Twenty-First International Symposium on Computer Architecture. (1994) 142–153
Huang, C., McKinley, P. K.: Communication Issues in Parallel Computing Across ATM Networks. IEEE Parallel and Distributed Technology. 2, 4 (1994) 73–86
Sterling, T., Saverese, D., Becker, D. J., Fryxell, B., Olson, K.: Communication Overhead for Space Science Applications on the Beowulf Parallel Workstaion. Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing. (1995) 23–30
Carter, R., Laroco, J.: Commodity Clusters: Performance Comparison Between PC's and Workstations. Proceedings of the Fifth IEEE International Symposium on High Performance Distributed Computing. (1995) 292–304
Osborne, R., Zheng, Q., Howard, J., Casley, R., Hahn, D., Nakabayashi, T.: DART — A Low Overhead ATM Network Interface Chip. Proceedings of the HOT Interconnects IV. (1996) 175–186
Agrawal, R., Imielinski, T., Swami, A.: Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering. 5, 6 (1993) 914–925
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data (1993) 207–216
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. Proceedings of the 20th International Conference on Very Large Data Bases (1994)
Shintani, T., Kitsuregawa, M.: Hash Based Parallel Algorithms for Mining Association Rules. Proceedings of the Fourth IEEE International Conference on Parallel and Distributed Information Systems (1996) 19–30
Heinanen, J.: Multiprotocol Encapsulation over ATM Adaptation Layer 5. RFC1483 (1993)
Laubach, M.: Classical IP and ARP over ATM. RFC1577 (1994)
Information Networks Division, Hewlett-Packard Company: Netperf: A Network Performance Benchmark, Revision 2.0. Tech. Rep., Hewlett-Packard Company (1995) http://www.cup.hp.com/netperf/NetperfPage.html.
Snir, M., Otto, S. W., Lederman, S. H., Walker, D. W., Dongarra, J.: MPI: The Complete Reference. The MIT Press (1996)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oguchi, M., Shintani, T., Tamura, T., Kitsuregawa, M. (1997). Characteristics of a parallel data mining application implemented on an ATM connected PC cluster. In: Hertzberger, B., Sloot, P. (eds) High-Performance Computing and Networking. HPCN-Europe 1997. Lecture Notes in Computer Science, vol 1225. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0031603
Download citation
DOI: https://doi.org/10.1007/BFb0031603
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62898-9
Online ISBN: 978-3-540-69041-2
eBook Packages: Springer Book Archive