Skip to main content
Log in

Visualizing the behavior and some symmetry properties of Bayesian confirmation measures

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Bayesian confirmation measures, a special class of interestingness measures, are functions usually adopted in ranking inductive rules generated by data mining methods such as association rule mining, decision trees, rough sets. Till now a plethora of measures have been defined in many different ways. Identifying and effectively distinguishing among them is a difficult task. In this paper we propose a unified visual approach aimed at comparing and classifying a large subset of Bayesian confirmation measures (those satisfying the initial and final probability dependence condition). We first reduce the set of variables in their analytical expression to only two, thus allowing to draw their contour lines on the plane. We observe that two dimensional contour lines plots represent a sort of fingerprints of the confirmation measures and, therefore, this geometric visualization can be used as an effective tool in order to investigate properties and behavior of the measures. We highlight the potential of this approach not only to study known measures but also in order to invent new measures satisfying given required characteristics. We finally define, following the geometry of the plots, a new set of symmetry properties of confirmation measures and describe geometrically four classical symmetries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

Notes

  1. IFPD stands for initial and final probability dependence (Crupi et al. 2010)

  2. ROC stands for receiver operating characteristics (see e.g. Fawcett 2006)

  3. Conjunction effects are “situations in which two hypotheses H1 and H2 are both confirmed by some evidence E, and in which the conjunction \(H1\wedge H2\) is even more highly confirmed” (Atkinson 2012).

  4. Even if in the general definition of confirmation space, i.e. the domain of a Bayesian confirmation measure c(xy), we stated that it corresponds to the open square (0,1)x(0,1) in many cases it can be extended to the square (0,1]x(0,1) by including also the points of the line x=1, i.e. \(P(H{\vert }E)=1\).

  5. Flach (2003) and Fürnkranz and Flach (2003a) discovered rather similar geometric symmetries of evaluation metrics on the ROC space but mentioned them without an explicit definition.

  6. The logarithm has been used in order to meet Bayesian confirmation measures sign condition.

References

  • Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data (SIGMOD ’93). ACM, New York, pp 207–216

  • Atkinson D (2012) Confirmation and justification. A commentary on Shogenji’s measure. Synthese 184(1):49–61

    Article  MathSciNet  Google Scholar 

  • Błaszczyński J, Greco S, Matarazzo B, Słowiński R, Szeląg M (2012) jMAF-dominance-based rough set data analysis framework, rough sets and intelligent systems—Professor Zdzisław Pawlak in memoriam., Intelligent systems reference librarySpringer, Berlin

    Google Scholar 

  • Błaszczyński J, Słowiński R, Szeląg M (2011) Sequential covering rule induction algorithm for variable consistency rough set approaches. Inform Sci 181(5):987–1002

    Article  MathSciNet  Google Scholar 

  • Carnap R (1950) Logical foundations of probability. University of Chicago Press, Chicago

    MATH  Google Scholar 

  • Celotto E (2014) Sensitivity and simmetry of Confirmation Measures, Working Paper n. 22/2014, October 2014. Department Management, Università Ca’ Foscari Venezia (link: http://virgo.unive.it/wpideas/storage/2014wp22.pdf), ISSN: 2239-2734

  • Celotto E, Ellero A, Ferretti P (2015) Conveying tourist ratings into an overall destination evaluation. Proc Soc Behav Sci 188:35–41

    Article  Google Scholar 

  • Christensen D (1999) Measuring confirmation. J Philos 96(9):437–461

    Article  MathSciNet  Google Scholar 

  • Crupi V, Festa R, Buttasi C (2010) Towards a grammar of Bayesian confirmation. In: Suárez M, Dorato M, Rédei M (eds) Epistemology and methodology of science. Springer, Dordrecht, pp 73–93

    Google Scholar 

  • Crupi V, Tentori K (2014) State of the field: measuring information and confirmation. Stud Hist Philos Sci 47(2014):81–90

    Article  Google Scholar 

  • Crupi V, Tentori K, Gonzalez M (2007) On Bayesian measures of evidential support: theoretical and empirical issues. Philos Sci 74(2):229–252

    Article  MathSciNet  Google Scholar 

  • Eells E, Fitelson B (2002) Symmetries and asymmetries in evidential support. Philos Stud 107(2):129–142

    Article  Google Scholar 

  • Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874

    Article  MathSciNet  Google Scholar 

  • Finch H-A (1999) Confirming power of observations metricized for decisions among hypotheses. Philos Sci 27, pp 293–207 (part I), pp 391–404 (part II)

  • Fitelson B (2001) Studies in Bayesian confirmation theory. Ph.D. Thesis, University of Wisconsin, Madison

  • Fitelson B (2007) Likelihoodism, Bayesianism, and relational confirmation. Synthese 156(3):473–489

    Article  MathSciNet  Google Scholar 

  • Flach P-A (2003) The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the 20th International conference on machine learning (ICML’03), AAAI Press, pp 194–201

  • Fürnkranz J (1999) Separate-and-conquer rule learning. Artif Intell Rev 13(1):3–54

    Article  Google Scholar 

  • Fürnkranz J (2005) From local to global patterns: evaluation issues in rule learning algorithms. In: Proceedings of the 2004 international conference on Local Pattern Detection (LPD’04), Springer, Berlin, pp 20–38

    Google Scholar 

  • Fürnkranz J, Flach P-A (2003a) An analysis of rule evaluation metrics. In: Proceedings of the 20th international conference on machine learning (ICML’03), AAAI Press, Washington, DC, pp 202–209

  • Fürnkranz J, Flach P-A (2003b) An analysis of rule learning heuristics. Department of Computer Science, University of Bristol, CSTR-03-002, February 2003

  • Fürnkranz J, Flach P-A (2005) ROC ‘n’ rule learning—towards a better understanding of covering algorithms. Mach Learn 58(1):39–77

    Article  Google Scholar 

  • Geng L, Hamilton H-J (2006) Interestingness measures for data mining: a survey. ACM Comput Surv 38(3):1–32

    Article  Google Scholar 

  • Glass D-H (2013) Confirmation measures of association rule interestingness. Knowl Based Syst 44:65–77

    Article  Google Scholar 

  • Good I-J (1950) Probability and the weighing of evidence. Hafners, New York

    MATH  Google Scholar 

  • Good I-J (1985) Weight of Evidence: A Brief Survey, Bayesian Statistics 2, In: J. M. Bernardo, M. H. DeGroot, D. V. Lindley and A. F. M. Smith (eds). Proceedings of the valencia international meetings on Bayesian statistics, , Elsevier Science Publishers B.V, Auckland, pp 249–270

  • Greco S, Matarazzo B, Słowiński R (2008) Parameterized rough set model using rough membership and Bayesian confirmation measures. Int J Approx Reason 49(2):285–300

    Article  MathSciNet  Google Scholar 

  • Greco S, Matarazzo B, Slowinski R, Stefanowski J (2001) An algorithm for induction of decision rules consistent with dominance principle, In: Revised papers from the second international conference on rough sets and current trends in computing (RSCTC ’00), Springer, pp 304–313

  • Greco S, Pawlak Z, Słowiński R (2004) Can Bayesian confirmation measures be useful for rough set decision rules? Eng Appl Artif Intell 17(4):345–361

    Article  Google Scholar 

  • Greco S, Słowiński R, Szczęch I (2012) Properties of rule interestingness measures and alternative approaches to normalization of measures. Inform Sci 216:1–16

    Article  MathSciNet  Google Scholar 

  • Kemeny J, Oppenheim P (1952) Degrees of factual support. Philos Sci 19:307–324

    Article  Google Scholar 

  • Keynes J-M (1921) A treatise on probability. Macmillan, London

    MATH  Google Scholar 

  • Lavrač N, Flach P-A, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP ’99), Springer, pp 174–185

  • Lenca P, Meyer P, Vaillant B, Lallich S (2008) On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur J Oper Res 184(2):610–626

    Article  Google Scholar 

  • Mortimer H (1988) The logic of induction. Prentice Hall, paramus

    MATH  Google Scholar 

  • Nozick R (1981) Philosophical explanations. Clarendon Press, Oxford

    Google Scholar 

  • Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, Boston

    Book  Google Scholar 

  • Popper K-R (1959) The logic of scientific discovery. Hutchinson, London

    MATH  Google Scholar 

  • Quinlan J-R (1986) Induction of decision trees. Mach Learn 1(1):81–106

    Google Scholar 

  • Rescher N (1958) A theory of evidence. Philos Sci 25:83–94

    Article  Google Scholar 

  • Rips L-J (2001) Two kinds of reasoning. Psychol Sci 12:129–134

    Article  Google Scholar 

  • Shogenji T (2012) The degree of epistemic justification and the conjunction fallacy. Synthese 184(1):29–48

    Article  MathSciNet  Google Scholar 

  • Susmaga R, Szczęch I (2013a) The Property of \(\chi ^{2}_{\{01\}}\)-Concordance for Bayesian confirmation measures. In: Proceedings of the 10th international conference modelling decisions for artificial intelligence (MDAI 2013), LNCS, Springer, vol 8234, pp 226–236

  • Susmaga R, Szczęch I (2013) Visualization of interestingness measures. In: Proceedings of the 6th language & technology conference: human language technologies as a challenge for computer science and linguistics. Fundacja UAM, Poznań, pp 95–99

  • Tan P-N, Kumar V, Srivastava J (2004) Selecting the right objective measure for association analysis. Inform Syst 29(4):293–313

    Article  Google Scholar 

  • Tentori K, Crupi V, Bonini N, Osherson D (2007) Comparison of confirmation measures. Cognition 103:107–119

    Article  Google Scholar 

  • Tew C, Giraud-Carrier C, Tanner K, Burton S (2014) Behavior-based clustering and analysis of interestingness measures for association rule mining. Data Min Knowl Discov 28(4):1004–1045

    Article  MathSciNet  Google Scholar 

  • Todhunter I (1865) A history of mathematical theory of probability from the time of Pascal to that of Laplace. Macmillan, London reprinted, (1949) 1965. Chelsea Publishing Company, New York

  • Vilalta R, Oblinger D (2000) A quantification of distance-bias between evaluation metrics in classification. In: Proceedings of the 17th international conference on machine learning (ICML-00) Stanford. Morgan Kaufmann, pp 1087–1094

  • Yao Y-Y, Zhong N (1999) An analysis of quantitative measures associated with rules, In: Proceedings of the third Pacific-Asia conference on methodologies for knowledge discovery and data mining (PAKDD ’99), Springer, pp 479–488

  • Yule G-U (1900) On the association of attributes in statistics: with illustrations from the material of the Childhood Society, & c. Philos Trans R Soc Lond A 194:257–319

    Article  Google Scholar 

  • Yule G-U (1912) On the methods of measuring the association between two attributes. J R Stat Soc 75:579–652

    Article  Google Scholar 

  • Zhou B, Yao Y-Y (2014) Feature selection based on confirmation-theoretic rough sets, In: Proceedings of the 9th international conference on rough sets and current trends in computing (RSCTC 2014), Lecture Notes in Computer Science, vol 8536, pp 181–188

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Emilio Celotto.

Additional information

Responsible editor: Johannes Fürnkranz.

The present paper is part of the research project Development of a Web marketing tool for assessing and positioning tourist-cultural destinations jointly supported by the Department of Management of the University Ca’ Foscari-Venezia, CISET-International Centre for Studies on Tourism Economics and Fondazione Ca’ Foscari Venezia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Celotto, E. Visualizing the behavior and some symmetry properties of Bayesian confirmation measures. Data Min Knowl Disc 31, 739–773 (2017). https://doi.org/10.1007/s10618-016-0487-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-016-0487-5

Keywords

Navigation