Skip to main content

Aggregation of Valued Relations Applied to Association Rule Interestingness Measures

  • Conference paper
Modeling Decisions for Artificial Intelligence (MDAI 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3885))

Abstract

One of the concerns of knowledge discovery in databases is the production of association rules. An association rule A \(\longrightarrow\) B defines a relationship between two sets of attributes A and B, caracterising the data studied. Such a rule means that objects sharing attributes of A will “likely” have those contained in B. Yet, this notion of “likeliness” depends on the datamining context.

Many interestingness measures have been introduced in order to quantify this likeliness. This panel of measures is heterogeneous and the ranking of extracted rules, according to measures, may differ largely.

This contribution explores a new approach for assessing the quality of rules: aggregating valued relations. For each measure, a valued relation is built out of the numerical values it takes on the rules, and represents the preference of a rule over another. The aim in using such tools is to take into account the intensity of preference expressed by various measures, and should reduce incomparability issues related to differences in their co-domains. It also has the advantage of relating the numerical nature of measures compared to pure binary approaches.

We studied several aggregation operators. In this contribution we discuss results obtained on a toy example using the simplest of them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD Int. Conf., pp. 207–216 (1993)

    Google Scholar 

  2. Arrow, K.J.: Social Choice and Individual Values. Cowles Foundations and Wiley (1951)

    Google Scholar 

  3. Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1(4), 143–154 (2002)

    Google Scholar 

  4. Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

  5. Bodenhofer, U., Demirci, M.: Strict fuzzy orderings in a similarity-based setting. In: Proc. Joint 4th Conf. EUSFLAT and 11e LFA, pp. 297–302 (2005)

    Google Scholar 

  6. Bodenhofer, U., Küng, J.: Fuzzy orderings in flexible query answering systems. Soft Computing 8(7), 512–522 (2004)

    Article  MATH  Google Scholar 

  7. Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)

    Google Scholar 

  8. Brans, J., Mareschal, B., Vincke, P.: PROMETHEE: A New Family of Outranking Methods in MCDM. In: IFORS 1984 (1984)

    Google Scholar 

  9. Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)

    Google Scholar 

  10. Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16(1), 22–29 (1990)

    Google Scholar 

  11. Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)

    Article  Google Scholar 

  12. Fishburn, P.: Stochastic utility. In: Handbook of Utility Theory, vol. 1, pp. 273–319. Kluwer Academic, Dordrecht (1998)

    Google Scholar 

  13. Fodor, J., Roubens, M.: Fuzzy preference modelling and multicriteria decision support. Kluwer Academic Publishers, Dordrecht (1994)

    Book  MATH  Google Scholar 

  14. Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)

    MATH  Google Scholar 

  15. Gras, R., Almouloud, S.A., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)

    Google Scholar 

  16. Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1(1-2), 69–80 (2001)

    Google Scholar 

  17. Guigues, J.L., Duquenne, V.: Familles minimales d’implications informatives résultant d’un tableau de données binaires. Math. Sci. hum. 25(95), 5–18 (1986)

    Google Scholar 

  18. International Business Machines. IBM Intelligent Miner User’s Guide, Version 1 Release 1, SH12-6213-00 (1996)

    Google Scholar 

  19. Jeffreys, H.J.: Some tests of significance treated by the theory of probability. In: Proc. of the Cambridge Philosophical Society, vol. 31, pp. 203–222 (1935)

    Google Scholar 

  20. Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)

    Google Scholar 

  21. Legrain, A.: Agrégation de mesures de qualité de règles d’association, rapport de DEA MIASH (2004)

    Google Scholar 

  22. Lehn, R., Guillet, F., Kuntz, P., Briand, H., Philippé, J.: Felix: An interactive rule mining interface in a KDD process. In: HCP 1999, pp. 169–174 (1999)

    Google Scholar 

  23. Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. RNTI (1), 123–134 (2003)

    Google Scholar 

  24. Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical report, GET / ENST Bretagne (2004)

    Google Scholar 

  25. Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. EGC 2003 1(17), 271–282 (2003)

    Google Scholar 

  26. Lenca, P., Meyer, P., Vaillant, B., Picouet, P., Lallich, S.: Évaluation et analyse multicritère des mesures de qualité des règles d’association. RNTI-E-1, 219–246 (2004)

    Google Scholar 

  27. Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1(17), 247–262 (2003)

    Google Scholar 

  28. Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61(4) (1947)

    Google Scholar 

  29. Pearson, K.: Mathematical contributions to the theory of evolution. iii. regression, heredity and panmixia. Philosophical Transactions of the Royal Society A (1896)

    Google Scholar 

  30. Peneva, V., Popchev, I.: Properties of the aggregation operators related with fuzzy relations. Fuzzy Sets and Systems (139), 615–633 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  31. Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)

    Google Scholar 

  32. Saminger, S., Mesiar, R., Bodenhofer, U.: Domination of aggregation operators and preservation of transitivity. Internat. J. Uncertain. Fuzziness Knowledge-Based Systems 10(suppl.), 11–35 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  33. Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: EKAW 1988, pp. 28–1 – 28–20 (1988)

    Google Scholar 

  34. Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Eighth ACM SIGKDD Int. Conf., pp. 32–41 (2002)

    Google Scholar 

  35. Vaillant, B., Lenca, P., Lallich, S.: A clustering of interestingness measures. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 290–297. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  36. Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: HCP 2003, pp. 187–191 (2003)

    Google Scholar 

  37. Zhang, T.: Association rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805. Springer, Heidelberg (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Barthélemy, JP., Legrain, A., Lenca, P., Vaillant, B. (2006). Aggregation of Valued Relations Applied to Association Rule Interestingness Measures. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2006. Lecture Notes in Computer Science(), vol 3885. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11681960_21

Download citation

  • DOI: https://doi.org/10.1007/11681960_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32780-6

  • Online ISBN: 978-3-540-32781-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics