Skip to main content
Log in

Mining for knowledge in databases: The INLEN architecture, initial implementation and first results

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

The architecture of an intelligent multistrategy assistant for knowledge discovery from facts, INLEN, is described and illustrated by an exploratory application. INLEN integrates a database, a knowledge base, and machine learning methods within a uniform user-oriented framework. A variety of machine learning programs are incorporated into the system to serve as high-levelknowledge generation operators (KGOs). These operators can generate diverse kinds of knowledge about the properties and regularities existing in the data. For example, they can hypothesize general rules from facts, optimize the rules according to problem-dependent criteria, determine differences and similarities among groups of facts, propose new variables, create conceptual classifications, determine equations governing numeric variables and the conditions under which the equations apply, deriving statistical properties and using them for qualitative evaluations, etc. The initial implementation of the system, INLEN 1b, is described, and its performance is illustrated by applying it to a database of scientific publications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Baim, P.W., “The PROMISE Method for Selecting Most Relevant Attributes for Inductive Learning Systems,” Report No. UIUCDCS-F-82-898, Department of Computer Science, University of Illinois, Urbana IL, 1982.

    Google Scholar 

  2. Baskin, A.B. and Michalski, R.S., An Integrated Approach to the Construction of Knowledge-Based Systems: Experiences with ADVISE and Related Programs. In Guida, G. and Tasso, C. (Eds.),Topics in Expert System Design. Elsevier, Amsterdam, pp. 111–143, 1989.

    Google Scholar 

  3. Bloedorn, E. and Michalski, R.S., “Data-Driven Constructive Induction in AQ17-DCI: A Method and Experiments,”Reports of Machine Learning and Inference Laboratory, MLI 89-12, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1992, to appear.

    Google Scholar 

  4. Boose, J.H., and Gaines, B.R. (Eds.),Proceedings of the 4th Knowledge Acquisition for Knowledge-Based Systems Workshop, Banff, Canada, 1989.

    Google Scholar 

  5. Boose, J.H., Gaines, B.R., and Ganascia, J.G. (Eds.),Proceedings of the Third European Workshop on Knowledge Acquisition for Knowledge-based Systems, Paris, 1989.

  6. Boose, J.H., Gaines, B.R., and Linster, M. (Eds.),Proceedings of the Second European Workshop on Knowledge Acquisition for Knowledge-based Systems, Bonn, 1988.

  7. Cestnik, B., Kokonenko, I., and Bratko, I., “ASSISTANT 86: A knowledge elicitation tool for sophisticates users,” inProc. Second European Working Session on Learning, Bratko, I. and Lavrac, N. (Eds.), Bled, Yugoslavia, 1987.

    Google Scholar 

  8. Collins, A. and Michalski, R.S., The Logic of Plausible Reasoning: A Core Theory,”Cognitive Science vol. 13, pp. 1–49, 1989.

    Google Scholar 

  9. Cramm, S.A., “ESEL/2: A Program for Selecting the Most Representative Training Events for Inductive Learning,” Report NO. UIUCDCS-F-83-901, Department of Computer Science, University of Illinois, Urbana, IL, 1983.

    Google Scholar 

  10. Davis, J.H., “CONVART: A Program for Constructive Induction on Time Dependent Data,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1981.

    Google Scholar 

  11. Dietterich, T. and Michalski, R.S., Learning to Predict Sequences. In Michalski, R.S., Carbonell, J.G. and Mitchell, T. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. II. Morgan Kaufmann Publishers, Los Altos, CA, pp. 63–106, 1986.

    Google Scholar 

  12. Dontas, K., “Applause: An Implementation of the Collins-Michalski Theory of Plausible Reasoning,” Master's Thesis, University of Tennessee, Knoxville, TN, 1988.

    Google Scholar 

  13. Falkenhainer, B. and Michalski, R.S., Integrating Quantitative and Qualitative Discovery in the ABACUS System. In Kodratoff and Michalski, R.S. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. HI. Morgan Kaufmann, San Mateo, CA, 1990.

    Google Scholar 

  14. Greene, G., “Quantitative Discovery: Using Depedencies to Discover Non-Linear Terms,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1988.

    Google Scholar 

  15. Hong, J., Mozetic, I., and Michalski, R.S., “AQ15: Incremental Learning of Attribute-Based Descriptions from Examples, the Method and User's Guide,” Report No. UIUCDCS-F-86-949, Department of Computer Science, University of Illinois, Urbana IL, 1986.

    Google Scholar 

  16. International Intelligent Systems, Inc.,User's Guide to AURORA 2.0: A Discovery System, International Intelligent Systems, Inc., Fairfax, VA, 1988.

    Google Scholar 

  17. Katz, B., Fermanian, T.W., and Michalski, R.S., “AgAssistant: An Experimental Expert System Builder for Agricultural Applications,” Report No. UIUCDCS-F-87-978, Department of Computer Science, University of Illinois, Urbana, IL, 1987.

    Google Scholar 

  18. Kaufman, K., Michalski, R.S., and Kerschberg, L., “Mining for Knowledge in Data: Goals and General Description of the INLEN System,” IJCAI-89 Workshop on Knowledge Discovery in Databases, Detroit, MI, 1989.

  19. Kaufman, K., Michalski, R.S., and Schultz, A., “EMERALD-1: An Integrated System of Machine Learning and Discovery Programs for Education and Research, User's Guide,”Reports of Machine Learning and Inference Laboratory, MLI 89-12, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1989.

    Google Scholar 

  20. Kaufman, K., Michalski, R.S., and Schultz, A., “EMERALD-1: An Integrated System of Machine Learning and Discovery Programs for Education and Research, Programmer's Guide for the Sun Workstation,”Reports of Machine Learning and Inference Laboratory, MLI 90-13, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1990.

    Google Scholar 

  21. Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the First International Workshop, Benjamin/Cummings, Menlo Park, CA, 1986.

    Google Scholar 

  22. Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the First International Conference, Benjamin/Cummings, Menlo Park, CA, 1987.

    Google Scholar 

  23. Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the Second International Conference, Benjamin/Cummings, Menlo Park, CA, 1988.

    Google Scholar 

  24. Kokar, M.M., Coper: A Methodology for Learning Invariant Functional Descriptions. In Michalski, R.S., Mitchell, T.M., and Carbonell, J.G. (Eds.),Machine Learning, Kluwer, Boston, 1986.

    Google Scholar 

  25. Laird, J.E. (Ed.),Proceedings of the Fifth International Conference on Machine Learning, University of Michigan, Ann Arbor, MI, 1988.

    Google Scholar 

  26. Langely, P., Bradshaw, G.L., and Simon, H.A., Rediscovering Chemistry with the BACON System. In Michalski, R.S., Carbonell, J.G. and Mitchell, T.M. (Eds.),Machine Learning: An Artificial Intelligence Approach. Morgan Kaufmann, San Mateo, CA, 1983.

    Google Scholar 

  27. Layman, T.C., “A PASCAL Program to Convert Extended Entry Decision Tables into Optimal Decision Trees,” Department of Computer Science, Internal Report, University of Illinois, Urbana, IL, 1979.

    Google Scholar 

  28. Michalski, R.S., “Designing Extended Entry Decision Tables and Optimal Decision Trees Using Decision Diagrams,” Report No. UIUCDCS-R-78-898, Department of Computer Science, University of Illinois, Urbana IL, 1978.

    Google Scholar 

  29. Michalski, R.S., Theory and Methodology of Inductive Learning. In Michalski, R.S., Carbonell, J.G., and Mitchell, T.M. (Eds.),Machine Learning: An Artificial Intelligence Approach. Morgan Kaufmann, San, Mateo, CA, 1983.

    Google Scholar 

  30. Michalski, R.S., “Toward a Unified Theory of Learning: Multistrategy Task-adaptive Learning,”Reports of Machine Learning and Inference Laboratory, MLI 90-1, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1990.

    Google Scholar 

  31. Michalski R.S. and Baskin, A.B., “Integrating multiple knowledge representations and learning capabilities in an expert system: The ADVISE system,” inProc. 8th Int. Joint Conf. Artif. Intell. Karlsruhe, West Germany, pp. 256–258, 1983.

  32. Michalski, R.S., Baskin, A.B., and Spackman, K.A., “A logic-based approach to conceptual database analysis,” inSixth Ann. Sympo. Computer Appl. Medical Care, George Washington University Medical Center, Washington, DC, pp. 792–796, 1982.

    Google Scholar 

  33. Michalski, R.S., Baskin, A.B., Uhrik, C., and Channic, T., “The ADVISE.1 Meta-Expert System: The General Design and a Technical Description,” Report No. UIUCDCS-F-87-962, Department of Computer Science, University of Illinois, Urbana, IL, 1987.

    Google Scholar 

  34. Michalski, R.S., Ko, H., and Chen, K., “SPARC/E(V.2), An Eleusis Rule Generator and Game Player,” ISG 85-11, UIUCDCS-F-85-941, Department of Computer Science, University of Illinois, Urbana, IL, 1985.

    Google Scholar 

  35. Michalski, R.S., Ko, H., and Chen, K., Qualitative Process Prediction: A Method and Program SPARC/G. In Guetler, C. (Ed.),Expert Systems. Academic Press, London, 1986.

    Google Scholar 

  36. Michalski, R.S., and Larson, J.B., “Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: the Underlying Methodology and the Description of Programs ESEL and AQ11,” Report No. 867, Department of Comptuer Science, University of Illinois, Urbana, IL, 1978.

    Google Scholar 

  37. Michalski, R.S. and Larson, J.B., rev. by Chen, K., “Incremental Generation of VL1 Hypotheses: the Underlying Metholdogy and the Description of the Program AQ11,” Report No. UIUCDCS-F-83-905, Department of Computer Science, University of Illinois, Urbana, IL, 1983.

    Google Scholar 

  38. Michalski, R.S., Mozetic, I., Hong, J., and Lavrac, N., “The AQ15 Inductive Learning System: An Overview and Experiments,” Report No. UIUCDCS-R-86-1260, Department of Computer Science, University of Illinois, Urbana, IL, 1986.

    Google Scholar 

  39. Michalski, R.S., and Stepp, R.E., “Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy,”IEEE Transactions on Pattern Analysis and Machine Intelligence, 1983.

  40. Michalski, R.S., Stepp, R.E., and Diday, E., A Recent Advance in Data Analysis: Clustering Objects into Classes Characterized by Conjunctive Concepts. In Kanall, L.N. and Rosenfeld, A. (Eds.),Progress in Pattern Recognition, Vol. 1. North-Holland, New York, pp. 33–56, 1981.

    Google Scholar 

  41. Quinlan, J.R., Probabilistic Decision Trees. In Kodratoff, Y. and Michalski, R.S. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. III. Morgan Kaufmann, San Mateo, CA, 1990.

    Google Scholar 

  42. Reinke, R.E., “Knowledge Acquisition and Refinement Tools for the ADVISE Meta-Expert System,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1984.

    Google Scholar 

  43. Segre, A.M. (Ed.),Proceedings of the Sixth International Workshop on Machine Learning, Cornell University, Ithaca, NY, 1989.

    Google Scholar 

  44. Spackman, K.A., “QUIN: Integration of Inferential Operators within a Relational Database,” ISG 83-13, UIUCDCS-F-83-917, M.S. Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1983.

    Google Scholar 

  45. Stepp, R.E., “Learning without Negative Examples via Variable-Valued Logic Characterizations: The Uniclass Inductive Program AQ7UN1,” Report No. 982, Department of Computer Science, University of Illinois, Urbana, IL, 1979.

    Google Scholar 

  46. Stepp, R.E., “A Description and User's Guide for CLUSTER/2, a Programe for Conceptual Clustering,” Department of Computer Science, University of Illinois, Urbana, IL, 1983.

    Google Scholar 

  47. Stepp, R.E., “Conjunctive Conceptual Clustering: A Methodology and Experimentation,” Ph.D. Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1984.

    Google Scholar 

  48. Wnek, J. and Michalski, R.S., “Hypothesis-driven constructive induction in AQ17: A method and experiments,” inIJCAI-91 Workshop on Evaluating and Changing Representations in Machine Learning, Sydney, Australia, 1991.

  49. Wnek, J. and Michalski, R.S., “An experimental comparison of symbolic and subsymbolic learning paradigms: phase I —learning logic-style concpets,”inProc. First International Workshop Multistrategy Learning, Harpers Ferry, WV, pp. 324–339, 1991.

  50. Wnek, J., Sarma, J., Wahab, A., and Michalski, R.S., Comparing Learning Paradigms via Diagrammatic Visualizatioin. In Ras, Z., Zemankova, M., and Emrich, M. (Eds.),Methodologies for Intelligent Systems 5, Elsevier, New York, pp. 428–437, 1990.

    Google Scholar 

  51. Zhang, J., Integrating Symbolic and Subsymbolic Approaches: Learning Flexible Concepts. In Michalski, R.S. and Tecuci, G. (Eds.),Machine Learning: A Multistrategy Approach, Vol. IV. Morgan Kaufmann, San Mateo, CA, to appear.

  52. Zhang, J. and Michalski, R.S., “Combining Symbolic and Subsymbolic Representations in Learning Flexible Concepts: The FCLS System,”Reports of Machine Learning and Inference Laboratory, Center for Artificial Intelligence, George Mason University, Fairfax, VA, to appear.

  53. Zytkow, J.M., “Combining many searches in the FAHRENHEIT discovery system,” inProc. Fourth Int. Workshop Machine Learning, Irvine, CA, pp. 281–287, 1987.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Michalski, R.S., Kerschberg, L., Kaufman, K.A. et al. Mining for knowledge in databases: The INLEN architecture, initial implementation and first results. J Intell Inf Syst 1, 85–113 (1992). https://doi.org/10.1007/BF01006415

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01006415

Keywords