Skip to main content

A strategy for increasing the efficiency of rule discovery in data mining

  • Conference paper
  • First Online:
  • 707 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1280))

Abstract

Increasing the efficiency of rule discovery is currently a major focus of research interest in data mining. Strategies available to the data miner include data sampling, knowledge-guided discovery, attribute reduction, parallelisation of the discovery process, and focusing on the discovery of a restricted class of rules, or those which appear most promising according to some measure of rule interest. This paper presents a new approach which combines the strategies of focusing on rules which appear most interesting, exploiting structural features of the data set when possible, and decomposition of the discovery process into sub-tasks which can be executed independently on parallel processsors.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cendrowska, J.: PRISM: an algorithm for inducing modular rules. International Journal of Man-Machine Studies 27 (1987) 349–370

    Article  MATH  Google Scholar 

  2. Frawley, W.J., Piatetsky-Shapiro, G., Matheus, C.J.: Knowledge Discovery in Databases: an Overview. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 1–27

    Google Scholar 

  3. McSherry, D.: An algorithm for the discovery of characteristic rules. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 4/1–3

    Google Scholar 

  4. McSherry, D.: Qualitative assessment of rule interest in data mining. Proceedings of the Sixteenth Annual Technical Conference of the BCS Specialist Group on Expert Systems, Cambridge, December 1996, 204–215

    Google Scholar 

  5. Murphy, P.M., Aha, D.W.: UCI Repository of Machine Learning Databases. http://www.ics.uci.edu/~mlearn/MLRepository.html (1995)

    Google Scholar 

  6. Nelson, C.: Improving customer retention with knowledge guided data mining. BCS Specialist Group on Expert Systems Newsletter, No. 33 (1995) 15–20

    Google Scholar 

  7. Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 229–248

    Google Scholar 

  8. Quinlan, J.R.: Induction of decision trees. Machine Learning 1 (1986) 81–106

    Google Scholar 

  9. Shortland, R.J., Scarfe, R.T.: Data mining applications in BT. BT Technology Journal 12 (1994) 17–22

    Google Scholar 

  10. Simoudis, E., John, G., Kerber, R., Livezey, B., Miller, P.: Developing customer vulnerability models using data mining techniques. Proceedings of IDA-95, Baden-Baden, August 1995, 181–185

    Google Scholar 

  11. Smyth, P., Goodman, R.M.: Rule induction using information theory. In Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 159–176

    Google Scholar 

  12. Thompson, S., Bramer, M.A.: Parallel knowledge discovery: a review of existing techniques. Digest No. 96/198 (Institution of Electrical Engineers, London, 1996) 5/1–5

    Google Scholar 

  13. Ziarko, W.: Discovery, analysis, and representation of data dependencies in databases. In Piatetsky-Shapiro, G., Prawley, W.J. (eds.) Knowledge Discovery in Databases (AAAI Press, Menlo Park, CA, 1991) 195–209

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Xiaohui Liu Paul Cohen Michael Berthold

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag

About this paper

Cite this paper

McSherry, D. (1997). A strategy for increasing the efficiency of rule discovery in data mining. In: Liu, X., Cohen, P., Berthold, M. (eds) Advances in Intelligent Data Analysis Reasoning about Data. IDA 1997. Lecture Notes in Computer Science, vol 1280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052857

Download citation

  • DOI: https://doi.org/10.1007/BFb0052857

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63346-4

  • Online ISBN: 978-3-540-69520-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics