New Similarity Rules for Mining Data

Di Gesù, Vito; Friedman, Jerome H.

doi:10.1007/11731177_26

Vito Di Gesù^20,21 &
Jerome H. Friedman²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3931))

Included in the following conference series:

848 Accesses

Abstract

Variability and noise in data-sets entries make hard the discover of important regularities among association rules in mining problems. The need exists for defining flexible and robust similarity measures between association rules. This paper introduces a new class of similarity functions, SF’s, that can be used to discover properties in the feature space X and to perform their grouping with standard clustering techniques. Properties of the proposed SF’s are investigated and experiments on simulated data-sets are also shown to evaluate the grouping performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Framework for Interestingness Measures for Association Rules with Discrete and Continuous Attributes Based on Statistical Validity

An innovative clustering approach utilizing frequent item sets

Article 26 April 2024

Novel fuzzy similarity measures and their applications in pattern recognition and clustering analysis

Article 06 July 2023

References

Khöler, W., Wallach, H.: Figural after-effects: an investigation of visual processes. In: Proc. Amer. phil. Soc., vol. 88, pp. 269–357 (1944)
Google Scholar
Tenenbaum, J.B.: Rules and Similarity in Concept Learning. In: Solla, S.A., Leen, T.K., Müuller, K.-R. (eds.) Advances in Neural Information Processing Systems, vol. 12, pp. 59–65. MIT Press, Cambridge (2000)
Google Scholar
Di Gesú, V., Roy, S.: Pictorial indexes and soft image distances. In: Pal, N.R., Sugeno, M. (eds.) AFSS 2002. LNCS, vol. 2275, pp. 200–215. Springer, Heidelberg (2002)
Google Scholar
Di Gesú, V., Lo Bosco, G.: A Genetic Integrated Fuzzy Classifier. Pattern Recognition Letters 26(4), 411–420 (2005)
Article Google Scholar
Pearce, M., Blake, D.J., Tinsley, J.M., Byth, B.C., Campbell, L., Monaco, A.P., Davies, K.E.: The utrophin and dystrophin genes share similarities in genomic structure. In: Human Molecular Genetics, vol. 2, pp. 1765–1772. Oxford Univ.Press, Oxford (1993)
Google Scholar
Morgenstern, B.: DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. Bioinformatics 15, 211–218 (1999)
Article Google Scholar
Varré, J.S., Delahaye, J.P., Rivals, E.: Transformation Distances: a family of dissimilarity measures based on movement of segments. Bioinformatics 15, 194–202 (1999)
Article Google Scholar
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington D.C (1993)
Google Scholar
Tsumoto, S., Hirano, S.: Visualization of Rules Similarity using Multidimensional Scaling. In: Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003) (2003)
Google Scholar
Lent, B., Swami, A., Widom, J.: Clustering Association Rules. In: ICDE, pp. 220–231 (1997)
Google Scholar
Gupta, G., Strehl, A., Ghosh, J.: Distance based clustering of association rules. In: Intelligent Engineering Systems Through Articial Neural Networks (Proceedings of ANNIE 1999), vol. 9, pp. 759–764. ASME Press (1999)
Google Scholar
Friedman, J.H., Fisher, I.: Bump Hunting in High-Dimensional Data, Stanford University, Department of Statistics, Technical Report (1998)
Google Scholar
Friedman, J.H., Popescu, B.E.: Importance Sampled Learning Ensamble, Stanford University, Department of Statistics, Technical Report (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Università di Palermo, DMA, Italy
Vito Di Gesù
Department of Statistics, Stanford University, Stanford, CA, USA
Vito Di Gesù & Jerome H. Friedman

Authors

Vito Di Gesù
View author publications
You can also search for this author in PubMed Google Scholar
Jerome H. Friedman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimentimento di Scienze dell’Informazione, via Comelico 39/41, 20135, Milano, Italy
Bruno Apolloni
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi (SA), Italy
Maria Marinaro
Department of Mathematics and Computer Science, University of Catania, Viale A. Doria 6, 95125, Catania, Italy
Giuseppe Nicosia
Department of Mathematics and Informatics, University of Salerno, Via Ponte Don Melillo, 84084, Fisciano (SA), Italy
Roberto Tagliaferri

Copyright information

About this paper

Cite this paper

Di Gesù, V., Friedman, J.H. (2006). New Similarity Rules for Mining Data. In: Apolloni, B., Marinaro, M., Nicosia, G., Tagliaferri, R. (eds) Neural Nets. WIRN NAIS 2005 2005. Lecture Notes in Computer Science, vol 3931. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731177_26

Download citation

DOI: https://doi.org/10.1007/11731177_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33183-4
Online ISBN: 978-3-540-33184-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

New Similarity Rules for Mining Data

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Framework for Interestingness Measures for Association Rules with Discrete and Continuous Attributes Based on Statistical Validity

An innovative clustering approach utilizing frequent item sets

Novel fuzzy similarity measures and their applications in pattern recognition and clustering analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

New Similarity Rules for Mining Data

Abstract

Access this chapter

Preview

Similar content being viewed by others

A Framework for Interestingness Measures for Association Rules with Discrete and Continuous Attributes Based on Statistical Validity

An innovative clustering approach utilizing frequent item sets

Novel fuzzy similarity measures and their applications in pattern recognition and clustering analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation