Abstract
The architecture of an intelligent multistrategy assistant for knowledge discovery from facts, INLEN, is described and illustrated by an exploratory application. INLEN integrates a database, a knowledge base, and machine learning methods within a uniform user-oriented framework. A variety of machine learning programs are incorporated into the system to serve as high-levelknowledge generation operators (KGOs). These operators can generate diverse kinds of knowledge about the properties and regularities existing in the data. For example, they can hypothesize general rules from facts, optimize the rules according to problem-dependent criteria, determine differences and similarities among groups of facts, propose new variables, create conceptual classifications, determine equations governing numeric variables and the conditions under which the equations apply, deriving statistical properties and using them for qualitative evaluations, etc. The initial implementation of the system, INLEN 1b, is described, and its performance is illustrated by applying it to a database of scientific publications.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Baim, P.W., “The PROMISE Method for Selecting Most Relevant Attributes for Inductive Learning Systems,” Report No. UIUCDCS-F-82-898, Department of Computer Science, University of Illinois, Urbana IL, 1982.
Baskin, A.B. and Michalski, R.S., An Integrated Approach to the Construction of Knowledge-Based Systems: Experiences with ADVISE and Related Programs. In Guida, G. and Tasso, C. (Eds.),Topics in Expert System Design. Elsevier, Amsterdam, pp. 111–143, 1989.
Bloedorn, E. and Michalski, R.S., “Data-Driven Constructive Induction in AQ17-DCI: A Method and Experiments,”Reports of Machine Learning and Inference Laboratory, MLI 89-12, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1992, to appear.
Boose, J.H., and Gaines, B.R. (Eds.),Proceedings of the 4th Knowledge Acquisition for Knowledge-Based Systems Workshop, Banff, Canada, 1989.
Boose, J.H., Gaines, B.R., and Ganascia, J.G. (Eds.),Proceedings of the Third European Workshop on Knowledge Acquisition for Knowledge-based Systems, Paris, 1989.
Boose, J.H., Gaines, B.R., and Linster, M. (Eds.),Proceedings of the Second European Workshop on Knowledge Acquisition for Knowledge-based Systems, Bonn, 1988.
Cestnik, B., Kokonenko, I., and Bratko, I., “ASSISTANT 86: A knowledge elicitation tool for sophisticates users,” inProc. Second European Working Session on Learning, Bratko, I. and Lavrac, N. (Eds.), Bled, Yugoslavia, 1987.
Collins, A. and Michalski, R.S., The Logic of Plausible Reasoning: A Core Theory,”Cognitive Science vol. 13, pp. 1–49, 1989.
Cramm, S.A., “ESEL/2: A Program for Selecting the Most Representative Training Events for Inductive Learning,” Report NO. UIUCDCS-F-83-901, Department of Computer Science, University of Illinois, Urbana, IL, 1983.
Davis, J.H., “CONVART: A Program for Constructive Induction on Time Dependent Data,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1981.
Dietterich, T. and Michalski, R.S., Learning to Predict Sequences. In Michalski, R.S., Carbonell, J.G. and Mitchell, T. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. II. Morgan Kaufmann Publishers, Los Altos, CA, pp. 63–106, 1986.
Dontas, K., “Applause: An Implementation of the Collins-Michalski Theory of Plausible Reasoning,” Master's Thesis, University of Tennessee, Knoxville, TN, 1988.
Falkenhainer, B. and Michalski, R.S., Integrating Quantitative and Qualitative Discovery in the ABACUS System. In Kodratoff and Michalski, R.S. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. HI. Morgan Kaufmann, San Mateo, CA, 1990.
Greene, G., “Quantitative Discovery: Using Depedencies to Discover Non-Linear Terms,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1988.
Hong, J., Mozetic, I., and Michalski, R.S., “AQ15: Incremental Learning of Attribute-Based Descriptions from Examples, the Method and User's Guide,” Report No. UIUCDCS-F-86-949, Department of Computer Science, University of Illinois, Urbana IL, 1986.
International Intelligent Systems, Inc.,User's Guide to AURORA 2.0: A Discovery System, International Intelligent Systems, Inc., Fairfax, VA, 1988.
Katz, B., Fermanian, T.W., and Michalski, R.S., “AgAssistant: An Experimental Expert System Builder for Agricultural Applications,” Report No. UIUCDCS-F-87-978, Department of Computer Science, University of Illinois, Urbana, IL, 1987.
Kaufman, K., Michalski, R.S., and Kerschberg, L., “Mining for Knowledge in Data: Goals and General Description of the INLEN System,” IJCAI-89 Workshop on Knowledge Discovery in Databases, Detroit, MI, 1989.
Kaufman, K., Michalski, R.S., and Schultz, A., “EMERALD-1: An Integrated System of Machine Learning and Discovery Programs for Education and Research, User's Guide,”Reports of Machine Learning and Inference Laboratory, MLI 89-12, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1989.
Kaufman, K., Michalski, R.S., and Schultz, A., “EMERALD-1: An Integrated System of Machine Learning and Discovery Programs for Education and Research, Programmer's Guide for the Sun Workstation,”Reports of Machine Learning and Inference Laboratory, MLI 90-13, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1990.
Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the First International Workshop, Benjamin/Cummings, Menlo Park, CA, 1986.
Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the First International Conference, Benjamin/Cummings, Menlo Park, CA, 1987.
Kerschberg, L. (Ed.),Expert Database Systems: Proceedings from the Second International Conference, Benjamin/Cummings, Menlo Park, CA, 1988.
Kokar, M.M., Coper: A Methodology for Learning Invariant Functional Descriptions. In Michalski, R.S., Mitchell, T.M., and Carbonell, J.G. (Eds.),Machine Learning, Kluwer, Boston, 1986.
Laird, J.E. (Ed.),Proceedings of the Fifth International Conference on Machine Learning, University of Michigan, Ann Arbor, MI, 1988.
Langely, P., Bradshaw, G.L., and Simon, H.A., Rediscovering Chemistry with the BACON System. In Michalski, R.S., Carbonell, J.G. and Mitchell, T.M. (Eds.),Machine Learning: An Artificial Intelligence Approach. Morgan Kaufmann, San Mateo, CA, 1983.
Layman, T.C., “A PASCAL Program to Convert Extended Entry Decision Tables into Optimal Decision Trees,” Department of Computer Science, Internal Report, University of Illinois, Urbana, IL, 1979.
Michalski, R.S., “Designing Extended Entry Decision Tables and Optimal Decision Trees Using Decision Diagrams,” Report No. UIUCDCS-R-78-898, Department of Computer Science, University of Illinois, Urbana IL, 1978.
Michalski, R.S., Theory and Methodology of Inductive Learning. In Michalski, R.S., Carbonell, J.G., and Mitchell, T.M. (Eds.),Machine Learning: An Artificial Intelligence Approach. Morgan Kaufmann, San, Mateo, CA, 1983.
Michalski, R.S., “Toward a Unified Theory of Learning: Multistrategy Task-adaptive Learning,”Reports of Machine Learning and Inference Laboratory, MLI 90-1, Center for Artificial Intelligence, George Mason University, Fairfax, VA, 1990.
Michalski R.S. and Baskin, A.B., “Integrating multiple knowledge representations and learning capabilities in an expert system: The ADVISE system,” inProc. 8th Int. Joint Conf. Artif. Intell. Karlsruhe, West Germany, pp. 256–258, 1983.
Michalski, R.S., Baskin, A.B., and Spackman, K.A., “A logic-based approach to conceptual database analysis,” inSixth Ann. Sympo. Computer Appl. Medical Care, George Washington University Medical Center, Washington, DC, pp. 792–796, 1982.
Michalski, R.S., Baskin, A.B., Uhrik, C., and Channic, T., “The ADVISE.1 Meta-Expert System: The General Design and a Technical Description,” Report No. UIUCDCS-F-87-962, Department of Computer Science, University of Illinois, Urbana, IL, 1987.
Michalski, R.S., Ko, H., and Chen, K., “SPARC/E(V.2), An Eleusis Rule Generator and Game Player,” ISG 85-11, UIUCDCS-F-85-941, Department of Computer Science, University of Illinois, Urbana, IL, 1985.
Michalski, R.S., Ko, H., and Chen, K., Qualitative Process Prediction: A Method and Program SPARC/G. In Guetler, C. (Ed.),Expert Systems. Academic Press, London, 1986.
Michalski, R.S., and Larson, J.B., “Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: the Underlying Methodology and the Description of Programs ESEL and AQ11,” Report No. 867, Department of Comptuer Science, University of Illinois, Urbana, IL, 1978.
Michalski, R.S. and Larson, J.B., rev. by Chen, K., “Incremental Generation of VL1 Hypotheses: the Underlying Metholdogy and the Description of the Program AQ11,” Report No. UIUCDCS-F-83-905, Department of Computer Science, University of Illinois, Urbana, IL, 1983.
Michalski, R.S., Mozetic, I., Hong, J., and Lavrac, N., “The AQ15 Inductive Learning System: An Overview and Experiments,” Report No. UIUCDCS-R-86-1260, Department of Computer Science, University of Illinois, Urbana, IL, 1986.
Michalski, R.S., and Stepp, R.E., “Automated Construction of Classifications: Conceptual Clustering Versus Numerical Taxonomy,”IEEE Transactions on Pattern Analysis and Machine Intelligence, 1983.
Michalski, R.S., Stepp, R.E., and Diday, E., A Recent Advance in Data Analysis: Clustering Objects into Classes Characterized by Conjunctive Concepts. In Kanall, L.N. and Rosenfeld, A. (Eds.),Progress in Pattern Recognition, Vol. 1. North-Holland, New York, pp. 33–56, 1981.
Quinlan, J.R., Probabilistic Decision Trees. In Kodratoff, Y. and Michalski, R.S. (Eds.),Machine Learning: An Artificial Intelligence Approach, Vol. III. Morgan Kaufmann, San Mateo, CA, 1990.
Reinke, R.E., “Knowledge Acquisition and Refinement Tools for the ADVISE Meta-Expert System,” Master's Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1984.
Segre, A.M. (Ed.),Proceedings of the Sixth International Workshop on Machine Learning, Cornell University, Ithaca, NY, 1989.
Spackman, K.A., “QUIN: Integration of Inferential Operators within a Relational Database,” ISG 83-13, UIUCDCS-F-83-917, M.S. Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1983.
Stepp, R.E., “Learning without Negative Examples via Variable-Valued Logic Characterizations: The Uniclass Inductive Program AQ7UN1,” Report No. 982, Department of Computer Science, University of Illinois, Urbana, IL, 1979.
Stepp, R.E., “A Description and User's Guide for CLUSTER/2, a Programe for Conceptual Clustering,” Department of Computer Science, University of Illinois, Urbana, IL, 1983.
Stepp, R.E., “Conjunctive Conceptual Clustering: A Methodology and Experimentation,” Ph.D. Thesis, Department of Computer Science, University of Illinois, Urbana, IL, 1984.
Wnek, J. and Michalski, R.S., “Hypothesis-driven constructive induction in AQ17: A method and experiments,” inIJCAI-91 Workshop on Evaluating and Changing Representations in Machine Learning, Sydney, Australia, 1991.
Wnek, J. and Michalski, R.S., “An experimental comparison of symbolic and subsymbolic learning paradigms: phase I —learning logic-style concpets,”inProc. First International Workshop Multistrategy Learning, Harpers Ferry, WV, pp. 324–339, 1991.
Wnek, J., Sarma, J., Wahab, A., and Michalski, R.S., Comparing Learning Paradigms via Diagrammatic Visualizatioin. In Ras, Z., Zemankova, M., and Emrich, M. (Eds.),Methodologies for Intelligent Systems 5, Elsevier, New York, pp. 428–437, 1990.
Zhang, J., Integrating Symbolic and Subsymbolic Approaches: Learning Flexible Concepts. In Michalski, R.S. and Tecuci, G. (Eds.),Machine Learning: A Multistrategy Approach, Vol. IV. Morgan Kaufmann, San Mateo, CA, to appear.
Zhang, J. and Michalski, R.S., “Combining Symbolic and Subsymbolic Representations in Learning Flexible Concepts: The FCLS System,”Reports of Machine Learning and Inference Laboratory, Center for Artificial Intelligence, George Mason University, Fairfax, VA, to appear.
Zytkow, J.M., “Combining many searches in the FAHRENHEIT discovery system,” inProc. Fourth Int. Workshop Machine Learning, Irvine, CA, pp. 281–287, 1987.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Michalski, R.S., Kerschberg, L., Kaufman, K.A. et al. Mining for knowledge in databases: The INLEN architecture, initial implementation and first results. J Intell Inf Syst 1, 85–113 (1992). https://doi.org/10.1007/BF01006415
Issue Date:
DOI: https://doi.org/10.1007/BF01006415