Skip to main content

A Knowledge Scout for Discovering Medical Patterns: Methodology and System SCAMP

  • Conference paper
Flexible Query Answering Systems

Part of the book series: Advances in Soft Computing ((AINSC,volume 7))

Abstract

Knowledge scouts are software agents that autonomously synthesize knowledge of interest to a given user (target knowledge) by applying inductive database operators to a local or distributed dataset. This paper describes briefly a method and a scripting language for developing knowledge scouts, and then reports on experiments with a knowledge scout, SCAMP, for discovering patterns characterizing relationships among lifestyles, symptoms and diseases in a large medical database. Discovered patterns are presented in two forms: (1) attributional rules, which are expressions in attributional calculus, and (2) association graphs, which graphically and abstractly represent relations expressed by the rules. Preliminary results indicate a high potential utility of the presented methodology for deriving useful and understandable knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinski, T. and Swami, A. (1993) Mining Association Rules between Sets of Items in Large Databases. Proceedings of the ACM SIGMOD Conference on Management of Data, 207–216.

    Google Scholar 

  2. Bergadano, F., Matwin S., Michalski, R.S. and Zhang, J. (1992) Learning Two-tiered Descriptions of Flexible Concepts: The POSEJDON System. Machine Learning 8, 5–43.

    Google Scholar 

  3. Boulicaut, J., Klemettinen, M. and Mannila, H. (1998) Querying Inductive Databases: A Case Study on the MINE RULE Operator. Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery (PKDD’98).

    Google Scholar 

  4. Finin, T., Fritzson, R., McKay, D. and McEntire, R. (1994) KQML as an Agent Communication Language. Proceedings of the Third International Conference on Information and Knowledge Management (CIKM’94), ACM Press.

    Google Scholar 

  5. Fischthal, S. (1997) A Description and User’s Guide for CLUSTER/2C++ A Program for Conjunctive Conceptual Clustering. Reports of the Machine Learning and Inference Laboratory, MLI 97–10, George Mason University, Fairfax, VA.

    Google Scholar 

  6. Han, J., Fu, Y., Wang, W., Chiang, J., Gong, W., Koperski, K., Li, D., Lu, Y., Rajan, A., Stefanovic, N., Xia, B. and Zaiane, O.R. (1996) DBMiner: A System for Mining Knowledge in Large Relational Databases. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 250–255.

    Google Scholar 

  7. Imielinski, T., Virmani, A. and Abdulghani, A. (1996) DataMine: Application Programming Interface and Query Language for Database Mining. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 256–261.

    Google Scholar 

  8. Kaufman, K. (1997) INLEN: A Methodology and Integrated System for Knowledge Discovery in Databases. Ph.D. dissertation, George Mason University, Fairfax, VA.

    Google Scholar 

  9. Kaufman, K. and Michalski, R.S. (1998) Discovery Planning: Multistrategy Learning in Data Mining. Proceedings of the Fourth International Workshop on Multistrategy Learning, 14–20.

    Google Scholar 

  10. Kaufman, K. and Michalski, R.S. (1999) Learning From Inconsistent and Noisy Data: The AQ18 Approach. Proceedings of the Eleventh International Symposium on Methodologies for Intelligent Systems.

    Google Scholar 

  11. Mannila, H. (1997) Inductive Databases and Condensed Representations for Data Mining. in Maluszynski, J. (ed.), Proceedings of the International Logic Programming Symposium, MIT Press, Cambridge.

    Google Scholar 

  12. Meo, R., Psaila, G. and Ceri, S. (1996) A New SQL-like Operator for Mining Association Rules. Proceedings of the 22nd VLDB Conference.

    Google Scholar 

  13. Michalski, R. S. (1975) Synthesis of Optimal and Quasi-Optimal Variable-Valued Logic Formulas. Proceedings of the 1975 International Symposium on Multiple-Valued Logic, 76–87.

    Google Scholar 

  14. Michalski, R. S. (1976) Class notes for the course on Databases, Computer Science Department, University of Illinois at Champaign-Urbana.

    Google Scholar 

  15. Michalski, R. S. (1983) A Theory and Methodology of Inductive Learning. In Michalski, R.S. Carbonell, J.G. and Mitchell, T.M. (eds.), Machine Learning: An Artificial Intelligence Approach, Tioga Publishing, Palo Alto, 83–129.

    Google Scholar 

  16. Michalski, R.S. (1997) Seeking Knowledge in the Deluge of Facts. Fundamenta Informaticae 30, 283–297.

    Google Scholar 

  17. Michalski, R.S. (2000) NATURAL INDUCTION: Theory, Methodology and Applications to Machine Learning and Knowledge Mining. Reports of the Machine Learning and Inference Laboratory, George Mason University, Fairfax, VA (to appear).

    Google Scholar 

  18. Michalski R. S. and Kaufman, K. (1998) Data Mining and Knowledge Discovery: A Review of Issues and Multistrategy Methodology. In Michalski, R.S., Bratko, I. and Kubat, M. (eds.), Machine Learning and Data Mining: Methods and Applications, John Wiley & Sons, London, 71–112.

    Google Scholar 

  19. Michalski, R.S. and Kaufman, K. (2000) Building Knowledge Scouts Using KGL Metalanguage. Fundamenta Informaticae 40, 433–447.

    Google Scholar 

  20. Michalski, R.S. and Stepp, R. (1983) Automated Construction of Classifications: Conceptual Clustering versus Numerical Taxonomy. IEEE Trans. on Pattern Analysis and Machine Intelligence 5, 396–410.

    Google Scholar 

  21. Reinke, R.E. (1984) Knowledge Acquisition and Refinement Tools for the ADVISE Meta-Expert System. Master’s Thesis, Department of Computer Science, University of Illinois, Urbana, IL.

    Google Scholar 

  22. Sarawagi, S., Thomas, S. and Agrawal, R. (1998) Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications. Proceedings of the ACM SIGMOD International Conference on Management of Data.

    Google Scholar 

  23. Zhang, Q. and Michalski, R.S. (2000) KV: A Knowledge Visualization System Employing General Logic Diagrams. Reports of the Machine Learning and Inference Laboratory, George Mason University, Fairfax, VA (to appear).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kaufman, K.A., Michalski, R.S. (2001). A Knowledge Scout for Discovering Medical Patterns: Methodology and System SCAMP. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-7908-1834-5_45

  • Publisher Name: Physica, Heidelberg

  • Print ISBN: 978-3-7908-1347-0

  • Online ISBN: 978-3-7908-1834-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics