Skip to main content

A Proposal of High Performance Data Mining System

  • Conference paper
  • First Online:
Book cover Applied Parallel Computing (PARA 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2367))

Included in the following conference series:

Abstract

In recent years, new decision support system (DSS) based on the technologies of data warehouse, data mining and on-line analytical processing appeared. As the accumulated amount of data becomes enormous too much, the data quantitative problem, the data qualitative problem and the data presentation problem occur in data mining in large-scale databases and data warehouses. An effective way to enhance the power and flexibility of data mining in data warehouses and large databases is to integrate data mining with OLAP in DSS. Parallel and distributed processing are also two important components of successful large-scale data mining applications. In this paper, a high performance data mining scheme is proposed. The overall architecture and the mechanism of the system are described.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. S. Scott Morton, Management Decision Systems: Computer Based Support for Decision Making, Boston, Division of Research, Graduate School of Business Administration, Harvard University, 1971.

    Google Scholar 

  2. S. Alter, Decision Support Systems: Current Practice and Continuing Challenges, Addison-Wesley, Reading, MA, 1980.

    Google Scholar 

  3. F. Jane and B. W. Gerald, Representing Modeling Knowledge in an Intelligent Decision Support System, Decision Support Systems, 1986.

    Google Scholar 

  4. M. S. Y. Wang and James F. Courtney, JR., A Conceptual Architecture for Generalized Decision Support System Software, IEEE Transaction on S.M.C., Vol. Smc-14, 1984.

    Google Scholar 

  5. LIU Zhen, WANG Shuwen and JIANG Zhanhua, Research on Decision Support System Architecture, Journal of Jilin University of Technology, No. 3, 1994.

    Google Scholar 

  6. W. H. Inmon, Building the Data Warehouse, New York: John Wiley & Sons, 1993.

    Google Scholar 

  7. W. H. Inmon, Claudia Imhoff and Ryan Sousa, Corporate Information Factory, Wiley & Sons, 1997.

    Google Scholar 

  8. W. H. Inmon and R. H. Terdeman, Claudia Imhoff, Exploration Warehousing: Turning Business Information into Business Opportunity, Wiley, 2000.

    Google Scholar 

  9. U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996.

    Google Scholar 

  10. M.S. Chen, J. Han, and P.S. Yu. Data mining: An overview from a database perspective, IEEE Transactions on Knowledge and Data Engineering, 1996.

    Google Scholar 

  11. Frawley, W., Piatetsky-Shapiro, G., and Matheus, C, Knowledge Discovery in Databases: An Overview, Knowledge Discovery in Databases, eds. G. Piatetsky-Shapiro and W. Frawley, 1–27, Cambridge, Mass.: AAAI Press / The MIT Press, 1991.

    Google Scholar 

  12. E. F. Codd, E. S. Codd and C. T. Salley, Beyond Decision Support, Computer-world, Vol. 27, No. 30, July 1993.

    Google Scholar 

  13. Qing Chen, Mining Exceptions and Quantitative Association Rules in Olap Data Cube, IEEE Transactions on Knowledge and Data Engineering, 1999.

    Google Scholar 

  14. Y. Chang, et al., An object Transaction Service Based on the CORBA Architecture, International Conference on Distributed Platforms, Dresden, 1996.

    Google Scholar 

  15. R. Hubert, Distributed Object Technology in EDS, International Conference on distributed Platforms, Dresden, 1996. 1

    Google Scholar 

  16. C.C. Fabris and A.A. Freitas. Incorporating deviation-detection functionality into the OLAP paradigm. Proc. XVI Brazilian Symp. on Databases (SBBD-2001), pp. 274–285. Rio de Janeiro, Brazil, 2001.

    Google Scholar 

  17. Masato Oguchi, Masaru Kitsuregawa, Data Mining on PC Cluster connected with Storage Area Network: Its Preliminary Experimental Results, IEEE International Conference on Communications, Helsinki, Finland, 2001.

    Google Scholar 

  18. Masato Oguchi and Masaru Kitsuregawa, Using Available Remote Memory Dynamically for Parallel Data Mining Application on ATM-Connected PC Cluster, Proc. of the International Parallel and Distributed Processing Symposium, IEEE Computer Society, 2000

    Google Scholar 

  19. C.C. Bojarczuk, H.S. Lopes, A.A. Freitas. Genetic programming for knowledge discovery in chest pain diagnosis. IEEE Engineering in Medicine and Biology magazine-special issue on data mining and knowledge discovery, 19(4), July/Aug. 2000.

    Google Scholar 

  20. D.L.A. Araujo, H.S. Lopes, A.A. Freitas. A parallel genetic algorithm for rule discovery in large databases. Proc. 1999 IEEE Systems, Man and Cybernetics Conf., v. III, Tokyo, Oct. 1999.

    Google Scholar 

  21. Mohammed J. Zaki, Parallel Sequence Mining on Shared-Memory Machines, Journal of Parallel and Distributed Computing, 61, 2001.

    Google Scholar 

  22. Jeffrey P. Bradford and Jose A. B. Fortes, Characterization and Parallelization of Decision-Tree Induction, Journal of Parallel and Distributed Computing, 61, 2001

    Google Scholar 

  23. Diane J. Cook, Lawrence B. Holder, Gehad Galal, and Ron Maglothin, Approched to Parallel Graph-Based Knowledge Discovery, Journal of Parallel and Distributed Computing, 61, 2001.

    Google Scholar 

  24. Sanjay Goil, PARSIMONY: An Infrastructure for Paralle Multidimensional Analysis and Data Mining, Journal of Parallel and Distributed Computing, 61, 2001.

    Google Scholar 

  25. Liu Zhen and Guo Minyi, A Proposal of Integrating Data Mining and On-Line Analytical Processing in Data Warehouse, Proceedings of 2001 International Conferences on Info-tech and Info-net, 2001.

    Google Scholar 

  26. A.A. Freitas, Generic, Set-Oriented Primitives to Support Data-Parallel Knowledge Discovery in Relational Databases Systems, thesis of the doctoral degree, Department of Computer Science, University of Essex, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, Z., Guo, M. (2002). A Proposal of High Performance Data Mining System. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-48051-X_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43786-4

  • Online ISBN: 978-3-540-48051-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics