Abstract
One of the concerns of knowledge discovery in databases is the production of association rules. An association rule A \(\longrightarrow\) B defines a relationship between two sets of attributes A and B, caracterising the data studied. Such a rule means that objects sharing attributes of A will “likely” have those contained in B. Yet, this notion of “likeliness” depends on the datamining context.
Many interestingness measures have been introduced in order to quantify this likeliness. This panel of measures is heterogeneous and the ranking of extracted rules, according to measures, may differ largely.
This contribution explores a new approach for assessing the quality of rules: aggregating valued relations. For each measure, a valued relation is built out of the numerical values it takes on the rules, and represents the preference of a rule over another. The aim in using such tools is to take into account the intensity of preference expressed by various measures, and should reduce incomparability issues related to differences in their co-domains. It also has the advantage of relating the numerical nature of measures compared to pure binary approaches.
We studied several aggregation operators. In this contribution we discuss results obtained on a toy example using the simplest of them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD Int. Conf., pp. 207–216 (1993)
Arrow, K.J.: Social Choice and Individual Values. Cowles Foundations and Wiley (1951)
Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1(4), 143–154 (2002)
Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Bodenhofer, U., Demirci, M.: Strict fuzzy orderings in a similarity-based setting. In: Proc. Joint 4th Conf. EUSFLAT and 11e LFA, pp. 297–302 (2005)
Bodenhofer, U., Küng, J.: Fuzzy orderings in flexible query answering systems. Soft Computing 8(7), 512–522 (2004)
Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)
Brans, J., Mareschal, B., Vincke, P.: PROMETHEE: A New Family of Outranking Methods in MCDM. In: IFORS 1984 (1984)
Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)
Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16(1), 22–29 (1990)
Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)
Fishburn, P.: Stochastic utility. In: Handbook of Utility Theory, vol. 1, pp. 273–319. Kluwer Academic, Dordrecht (1998)
Fodor, J., Roubens, M.: Fuzzy preference modelling and multicriteria decision support. Kluwer Academic Publishers, Dordrecht (1994)
Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)
Gras, R., Almouloud, S.A., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)
Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1(1-2), 69–80 (2001)
Guigues, J.L., Duquenne, V.: Familles minimales d’implications informatives résultant d’un tableau de données binaires. Math. Sci. hum. 25(95), 5–18 (1986)
International Business Machines. IBM Intelligent Miner User’s Guide, Version 1 Release 1, SH12-6213-00 (1996)
Jeffreys, H.J.: Some tests of significance treated by the theory of probability. In: Proc. of the Cambridge Philosophical Society, vol. 31, pp. 203–222 (1935)
Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)
Legrain, A.: Agrégation de mesures de qualité de règles d’association, rapport de DEA MIASH (2004)
Lehn, R., Guillet, F., Kuntz, P., Briand, H., Philippé, J.: Felix: An interactive rule mining interface in a KDD process. In: HCP 1999, pp. 169–174 (1999)
Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. RNTI (1), 123–134 (2003)
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical report, GET / ENST Bretagne (2004)
Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. EGC 2003 1(17), 271–282 (2003)
Lenca, P., Meyer, P., Vaillant, B., Picouet, P., Lallich, S.: Évaluation et analyse multicritère des mesures de qualité des règles d’association. RNTI-E-1, 219–246 (2004)
Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1(17), 247–262 (2003)
Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61(4) (1947)
Pearson, K.: Mathematical contributions to the theory of evolution. iii. regression, heredity and panmixia. Philosophical Transactions of the Royal Society A (1896)
Peneva, V., Popchev, I.: Properties of the aggregation operators related with fuzzy relations. Fuzzy Sets and Systems (139), 615–633 (2003)
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)
Saminger, S., Mesiar, R., Bodenhofer, U.: Domination of aggregation operators and preservation of transitivity. Internat. J. Uncertain. Fuzziness Knowledge-Based Systems 10(suppl.), 11–35 (2002)
Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: EKAW 1988, pp. 28–1 – 28–20 (1988)
Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Eighth ACM SIGKDD Int. Conf., pp. 32–41 (2002)
Vaillant, B., Lenca, P., Lallich, S.: A clustering of interestingness measures. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 290–297. Springer, Heidelberg (2004)
Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: HCP 2003, pp. 187–191 (2003)
Zhang, T.: Association rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805. Springer, Heidelberg (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barthélemy, JP., Legrain, A., Lenca, P., Vaillant, B. (2006). Aggregation of Valued Relations Applied to Association Rule Interestingness Measures. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2006. Lecture Notes in Computer Science(), vol 3885. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11681960_21
Download citation
DOI: https://doi.org/10.1007/11681960_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32780-6
Online ISBN: 978-3-540-32781-3
eBook Packages: Computer ScienceComputer Science (R0)