Skip to main content

Characteristics of a parallel data mining application implemented on an ATM connected PC cluster

  • Conference paper
  • First Online:
Book cover High-Performance Computing and Networking (HPCN-Europe 1997)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1225))

Included in the following conference series:

Abstract

Until recently, workstations were overwhelmingly superior to personal computers in terms of performance. However, recent PC technology has dramatically increased its CPU, main memory, and cache memory performance. Therefore massively parallel computer systems are moving away from proprietary components such as CPU, disks, etc. to commodity parts.

As far as applications are concerned, we believe that data intensive applications such as ad-hoc query processing and data mining is very important for parallel processors in addition to the conventional scientific applications. Since ATM connected PC clusters are very promising from the cost/performance point of view, we are examining the feasibility of implementing data mining over PC clusters. In this paper, we report our preliminary experimental results for parallel data mining on 2 suites of ATM connected PC clusters, consisting of 8 PCs. Although there are several kinds of problems such as immaturity of NIC and driver software, we achieved reasonably good performance for parallel data mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blumrich, M., Li, K., Alpert, R., Dubnicki, C., Felten, E., Sandberg, J.: Vertual Memory Mapped Network Interface for the SHRIMP Multicomputer. Proceedings of the Twenty-First International Symposium on Computer Architecture. (1994) 142–153

    Google Scholar 

  2. Huang, C., McKinley, P. K.: Communication Issues in Parallel Computing Across ATM Networks. IEEE Parallel and Distributed Technology. 2, 4 (1994) 73–86

    Google Scholar 

  3. Sterling, T., Saverese, D., Becker, D. J., Fryxell, B., Olson, K.: Communication Overhead for Space Science Applications on the Beowulf Parallel Workstaion. Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing. (1995) 23–30

    Google Scholar 

  4. Carter, R., Laroco, J.: Commodity Clusters: Performance Comparison Between PC's and Workstations. Proceedings of the Fifth IEEE International Symposium on High Performance Distributed Computing. (1995) 292–304

    Google Scholar 

  5. Osborne, R., Zheng, Q., Howard, J., Casley, R., Hahn, D., Nakabayashi, T.: DART — A Low Overhead ATM Network Interface Chip. Proceedings of the HOT Interconnects IV. (1996) 175–186

    Google Scholar 

  6. Agrawal, R., Imielinski, T., Swami, A.: Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering. 5, 6 (1993) 914–925

    Google Scholar 

  7. Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data (1993) 207–216

    Google Scholar 

  8. Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. Proceedings of the 20th International Conference on Very Large Data Bases (1994)

    Google Scholar 

  9. Shintani, T., Kitsuregawa, M.: Hash Based Parallel Algorithms for Mining Association Rules. Proceedings of the Fourth IEEE International Conference on Parallel and Distributed Information Systems (1996) 19–30

    Google Scholar 

  10. Heinanen, J.: Multiprotocol Encapsulation over ATM Adaptation Layer 5. RFC1483 (1993)

    Google Scholar 

  11. Laubach, M.: Classical IP and ARP over ATM. RFC1577 (1994)

    Google Scholar 

  12. Information Networks Division, Hewlett-Packard Company: Netperf: A Network Performance Benchmark, Revision 2.0. Tech. Rep., Hewlett-Packard Company (1995) http://www.cup.hp.com/netperf/NetperfPage.html.

    Google Scholar 

  13. Snir, M., Otto, S. W., Lederman, S. H., Walker, D. W., Dongarra, J.: MPI: The Complete Reference. The MIT Press (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bob Hertzberger Peter Sloot

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Oguchi, M., Shintani, T., Tamura, T., Kitsuregawa, M. (1997). Characteristics of a parallel data mining application implemented on an ATM connected PC cluster. In: Hertzberger, B., Sloot, P. (eds) High-Performance Computing and Networking. HPCN-Europe 1997. Lecture Notes in Computer Science, vol 1225. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0031603

Download citation

  • DOI: https://doi.org/10.1007/BFb0031603

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-62898-9

  • Online ISBN: 978-3-540-69041-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics