Abstract
Rule Discovery in Databases integrates machine learning, probabilistic techniques and database concepts to learn a range comprehensible knowledge in sparse, noisy and redundant data. The discovery enables the learning of rules from data and extract their underlying structure. In this paper, we present the probabilistic index and the notion of minimal set of discovered rules which enhance runtime performance, improve discovery accuracy, resist noise, converges with the size of the sample, and eliminates coarse and redundant rules. This index can be used within the framework of an incremental discovery system. In other words, in this paper, we describe the rule intensity measurement which is an index that answers the question ‘What is the probability of having a rule of the form ‘IF premise THEN Conclusion’; the premise and conclusion are conjunctions of propositions ?’
Preview
Unable to display preview. Download preview PDF.
References
T. M. Anwar, H. W. Beck, S. B. Navathe. “Knowledge Mining by Imprecise Querying: A Classification-Based Approach, “IEEE 8 th Int. Conf. on Data Eng. Phoenix, Arizona, Feb. 1992.
Briand H., Crampes J. B., Hebrail Y., Herin-Aime D., Kouloumdjan, Sabatier R., “Les systemes d'Information”, DUNOD edition, 1986.
Clark P., Niblett T. “The NC2 induction algorithm. Machine Learning”, 3. p 261–283 1988.
Cendrowska J. “An Algorithm for inducing modular rules”, Int J. Man-Machine Studies, p 349–370, 1987.
Diday E. Mennessier M.O. “Analyse symbolique pour la prévision de séries chronologiques pseudo-périodiques”, Induction symbolique et numérique à partir de données, p 179–192,Cépaduès-éditions, 91.
Frawley W. J., Piatetsky-Shapiro, C. J. Matheus, “Knowledge discovery in databases: an overview”, in Knowledge Dicovery in Databases. Cambridge, MA: AAAI/MIT, 1991, pages 1–27.
Ganascia J. G., “CHARADE: A rule system learning intelligence”, IJCAI, Milan, Italy, Août 1987.
R. M. Goodman, P. Smyth, “The induction of probabilistic rules set — the rule algorithm”, Proceedings of the sixth international workshop on machine learning, Splatz B. ed., p 129–132, San Mateo, CA Morgan Kaufmann 1989.
Gras R., Larher A. “L'implication statistique, une nouvelle méthode d'analyse de données”, Mathématiques, Informatique et Sciences Humaines n∘120.
Ho Tu Bao, Tong Thi Thanh Huyen, “A method for generating rules from examples and its application”, Symbolic Numeric Data Analysis And Learning, p 493–504, Nova Sciences Publishers 1991.
Kodrattof Y., Tecuci G., “Techniques of design and DISCIPLE learning apprentice”, Knowledge acquisition and learning, Kaufmann edition, pages 655–668, 1993.
W. Klosgen, “Visualization and adaptivity in the statics interpreter EXPLORA”, in Workshop Notes from the 9th Nat. Conf. Art. Intell.: Knowledge Discovery in Databases. American Association for Artificial Intelligence, Anaheim, CA, July 1991, pages 25–34.
C. J. Matheus, P. K. Chan, G. Piatetsky-Shapiro, “Systems for Knowledge Discovery in Databases”, IEEE Trans. Knowl. Data Eng., vol 5, n 6, 1993.
J. R. Quinlan,“Generating Production Rules from Decisions Trees”. The 10 th International Conference on Artificial Intelligence, p 304–307, 1987.
M. Sebag, M. Schoenauer, “Un réseau de règles d'apprentissage”, Induction symbolique et numérique à partir de données, p 241–255, Cépaduès-éditions, 91.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fleury, L., Djeraba, C., Briand, H., Philippe, J. (1995). Some aspects of rule discovery in data bases. In: Bhalla, S. (eds) Information Systems and Data Management. CISMOD 1995. Lecture Notes in Computer Science, vol 1006. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60584-3_32
Download citation
DOI: https://doi.org/10.1007/3-540-60584-3_32
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60584-3
Online ISBN: 978-3-540-47799-0
eBook Packages: Springer Book Archive