Aggregation of Valued Relations Applied to Association Rule Interestingness Measures

Barthélemy, Jean-Pierre; Legrain, Angélique; Lenca, Philippe; Vaillant, Benoît

doi:10.1007/11681960_21

Jean-Pierre Barthélemy²²,
Angélique Legrain²²,
Philippe Lenca²² &
…
Benoît Vaillant^22,23

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3885))

Included in the following conference series:

International Conference on Modeling Decisions for Artificial Intelligence

849 Accesses
4 Citations

Abstract

One of the concerns of knowledge discovery in databases is the production of association rules. An association rule A \(\longrightarrow\) B defines a relationship between two sets of attributes A and B, caracterising the data studied. Such a rule means that objects sharing attributes of A will “likely” have those contained in B. Yet, this notion of “likeliness” depends on the datamining context.

Many interestingness measures have been introduced in order to quantify this likeliness. This panel of measures is heterogeneous and the ranking of extracted rules, according to measures, may differ largely.

This contribution explores a new approach for assessing the quality of rules: aggregating valued relations. For each measure, a valued relation is built out of the numerical values it takes on the rules, and represents the preference of a rule over another. The aim in using such tools is to take into account the intensity of preference expressed by various measures, and should reduce incomparability issues related to differences in their co-domains. It also has the advantage of relating the numerical nature of measures compared to pure binary approaches.

We studied several aggregation operators. In this contribution we discuss results obtained on a toy example using the simplest of them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD Int. Conf., pp. 207–216 (1993)
Google Scholar
Arrow, K.J.: Social Choice and Individual Values. Cowles Foundations and Wiley (1951)
Google Scholar
Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1(4), 143–154 (2002)
Google Scholar
Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Bodenhofer, U., Demirci, M.: Strict fuzzy orderings in a similarity-based setting. In: Proc. Joint 4th Conf. EUSFLAT and 11^e LFA, pp. 297–302 (2005)
Google Scholar
Bodenhofer, U., Küng, J.: Fuzzy orderings in flexible query answering systems. Soft Computing 8(7), 512–522 (2004)
Article MATH Google Scholar
Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)
Google Scholar
Brans, J., Mareschal, B., Vincke, P.: PROMETHEE: A New Family of Outranking Methods in MCDM. In: IFORS 1984 (1984)
Google Scholar
Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)
Google Scholar
Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16(1), 22–29 (1990)
Google Scholar
Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)
Article Google Scholar
Fishburn, P.: Stochastic utility. In: Handbook of Utility Theory, vol. 1, pp. 273–319. Kluwer Academic, Dordrecht (1998)
Google Scholar
Fodor, J., Roubens, M.: Fuzzy preference modelling and multicriteria decision support. Kluwer Academic Publishers, Dordrecht (1994)
Book MATH Google Scholar
Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)
MATH Google Scholar
Gras, R., Almouloud, S.A., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)
Google Scholar
Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1(1-2), 69–80 (2001)
Google Scholar
Guigues, J.L., Duquenne, V.: Familles minimales d’implications informatives résultant d’un tableau de données binaires. Math. Sci. hum. 25(95), 5–18 (1986)
Google Scholar
International Business Machines. IBM Intelligent Miner User’s Guide, Version 1 Release 1, SH12-6213-00 (1996)
Google Scholar
Jeffreys, H.J.: Some tests of significance treated by the theory of probability. In: Proc. of the Cambridge Philosophical Society, vol. 31, pp. 203–222 (1935)
Google Scholar
Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)
Google Scholar
Legrain, A.: Agrégation de mesures de qualité de règles d’association, rapport de DEA MIASH (2004)
Google Scholar
Lehn, R., Guillet, F., Kuntz, P., Briand, H., Philippé, J.: Felix: An interactive rule mining interface in a KDD process. In: HCP 1999, pp. 169–174 (1999)
Google Scholar
Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. RNTI (1), 123–134 (2003)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical report, GET / ENST Bretagne (2004)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. EGC 2003 1(17), 271–282 (2003)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Picouet, P., Lallich, S.: Évaluation et analyse multicritère des mesures de qualité des règles d’association. RNTI-E-1, 219–246 (2004)
Google Scholar
Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1(17), 247–262 (2003)
Google Scholar
Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61(4) (1947)
Google Scholar
Pearson, K.: Mathematical contributions to the theory of evolution. iii. regression, heredity and panmixia. Philosophical Transactions of the Royal Society A (1896)
Google Scholar
Peneva, V., Popchev, I.: Properties of the aggregation operators related with fuzzy relations. Fuzzy Sets and Systems (139), 615–633 (2003)
Article MathSciNet MATH Google Scholar
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)
Google Scholar
Saminger, S., Mesiar, R., Bodenhofer, U.: Domination of aggregation operators and preservation of transitivity. Internat. J. Uncertain. Fuzziness Knowledge-Based Systems 10(suppl.), 11–35 (2002)
Article MathSciNet MATH Google Scholar
Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: EKAW 1988, pp. 28–1 – 28–20 (1988)
Google Scholar
Tan, P.-N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Eighth ACM SIGKDD Int. Conf., pp. 32–41 (2002)
Google Scholar
Vaillant, B., Lenca, P., Lallich, S.: A clustering of interestingness measures. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 290–297. Springer, Heidelberg (2004)
Chapter Google Scholar
Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: HCP 2003, pp. 187–191 (2003)
Google Scholar
Zhang, T.: Association rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805. Springer, Heidelberg (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Technopôle de Brest Iroise, CS 83818, GET / ENST Bretagne – Cnrs Umr 2872 TAMCIC, 29238 Cedex 3, Brest, France
Jean-Pierre Barthélemy, Angélique Legrain, Philippe Lenca & Benoît Vaillant
UBS – IUT de Vannes – Département STID, 8, rue Montaigne, BP 561, 56017, Vannes, France
Benoît Vaillant

Authors

Jean-Pierre Barthélemy
View author publications
You can also search for this author in PubMed Google Scholar
Angélique Legrain
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Lenca
View author publications
You can also search for this author in PubMed Google Scholar
Benoît Vaillant
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IIIA, Artificial Intelligence Research Institute CSIC, Spanish National Research Council, Campus UAB s/n, Catalonia, 08193, Bellaterra, Spain
Vicenç Torra
Toho Gakuen, 3-1-10 Naka, 186-0004, Tokyo, Kunitachi, Japan
Yasuo Narukawa
Department of Computer Science and Mathematics Intelligent Technologies for Advanced Knowledge Acquisition Research Group,, University Rovira i Virgili, Av. Països Catalans, 26., 43007, Tarragona, Catalonia, Spain
Aïda Valls
Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, UNESCO Chair in Data Privacy, Av. Països Catalans 26, E-43007, Tarragona, Catalonia, Spain
Josep Domingo-Ferrer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Barthélemy, JP., Legrain, A., Lenca, P., Vaillant, B. (2006). Aggregation of Valued Relations Applied to Association Rule Interestingness Measures. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2006. Lecture Notes in Computer Science(), vol 3885. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11681960_21

Download citation

DOI: https://doi.org/10.1007/11681960_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32780-6
Online ISBN: 978-3-540-32781-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics