Skip to main content

Advertisement

Log in

A novel unsupervised fuzzy clustering method for preprocessing of quantitative attributes in association rule mining

  • Published:
Information Technology and Management Aims and scope Submit manuscript

Abstract

In Association rule mining, the quantitative attribute values are converted into Boolean values using fixed intervals. Conventional association rule mining algorithms are then applied to find relations among the attribute values. These intervals may not be concise and meaningful enough for human users to easily obtain non trivial knowledge from those rules discovered. Clustering techniques can be used for segmenting quantitative values into meaningful groups instead of fixed intervals. But the conventional clustering techniques like k-means and c-means require the user to specify the number of clusters and initial cluster centres. This initialization is one of the major challenges of clustering. A novel fuzzy based unsupervised clustering algorithm proposed by the authors is extended to segment quantitative values into fuzzy clusters in this paper. Membership values of quantitative items in the partitioning fuzzy clusters are used with weighted fuzzy rule mining techniques to find natural association rules. This fuzzy based method for handling quantitative attributes is compared with that of fixed intervals and segmenting using conventional k-means clustering method along with Apriori algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Pujari AK (2001) Data mining techniques. University Press, Hyderabad

    Google Scholar 

  2. Hegland M (2003) Data mining–challenges, models, methods and algorithms. Publications of ANU Data Mining group, Draft

  3. Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. In: ACM SIGMOD Record vol 22, No. 2. ACM, pp 207–216

  4. Han J, Kamber M (2001) Data mining: concepts and techniques. Elsevier, Delhi

    Google Scholar 

  5. Au WH, Chan KC (1999) FARM: a data mining system for discovering fuzzy association rules. In: IEEE international fuzzy systems conference proceedings, 1999. FUZZ-IEEE’99, vol 3, pp 1217–1222

  6. Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8:962–969

    Article  Google Scholar 

  7. Binu T, Raju G (2010) A fuzzy threshold based modified clustering algorithm for natural data exploration. Lecture Notes Comput Sci 6122:167–172

    Article  Google Scholar 

  8. Padmaja V Mining weighted association rules. Padmaja Int J Adv Eng Sci Technol (IJAEST) 11(1):153–156

  9. Helm BL (2007) Fuzzy association rules an implementation in R. Master’s Thesis, Vienna University of Economics and Business Administration Vienna

  10. Mangalampalli A, Pudi V (2008) Fuzzy logic-based pre-processing for fuzzy association rule mining. IIIT Hyderabad, India

  11. Kruse R, Borgelt C, Nauck D (1999) Fuzzy data analysis: challenges and perspectives. In: IEEE International Fuzzy systems conference proceedings, 1999. FUZZ-IEEE’99, vol 3, pp 1211–1216

  12. Halkidi M (2000) Quality assessment and uncertainty handling in data mining process. In: Proceedings of EDBT conference, Konstanz, Germany

  13. Wai-Ho A, Chan KCC (2001) Classification with degree of membership: a fuzzy approach. In: Proceedings on IEEE international conference on data mining (ICDM)

  14. Tao F, Murtagh F, Farid M (2003) Weighted association rule mining using weighted support and significance framework. In: Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 661–666

  15. Khan MS, Muyeba M, Tjortjis C, Coenen F (2007) An effective fuzzy healthy association rule mining algorithm (FHARM). In: Proceedings of UKCI, p 14

  16. Muyeba M, Khan M, Coenen F (2009) Fuzzy weighted association rule mining with weighted support and confidence framework. New Front Appl Data Mining 5433:49–61

    Google Scholar 

  17. Muyeba M, Khan MS, Coenen F (2010) Effective mining of weighted fuzzy association rules. Computer 90:9

    Google Scholar 

  18. Cai CH, Fu AW, Cheng CH, Kwong WW (1998) Mining association rules with weighted items. In: Database engineering and applications symposium, 1998. Proceedings on IEEE international IDEAS’98, pp 68–77

  19. Klawonn F, Keller A (1999) Fuzzy clustering based on modified distance measures. Adv Intell Data Anal 1642:291–301

    Google Scholar 

  20. Sankar KP, Mitra P (2002) Data mining in soft computing framework: a survey. IEEE Trans Neural Netw 13:11–18

    Google Scholar 

  21. Hong TP, Lee YC (2008) An overview of mining fuzzy association rules. In: Bustince H, Herrera F, Montero J (eds) Studies in fuzziness and soft computing, vol 220. Springer, Berlin, pp 397–410

  22. Miller RJ, Yang Y (1997) Association rules over interval data. In: ACM SIGMOD Record, vol 26, No. 2, ACM, pp 452–461

  23. Xiong X, Chan KL, Tan KL (2004) Similarity-driven cluster merging method for unsupervised fuzzy clustering. In: Proceedings of the 20th conference on uncertainty in artificial intelligence, AUAI Press, Arlington, VA, pp 611–618

  24. Looney CG (1999) A fuzzy clustering and fuzzy merging algorithm. Reno, NV

  25. Raju G, Singh A, Kumar TS, Thomas B (2008) Integration of fuzzy logic in data mining: a comparative case study. In: Proceedings of international conference on mathematics and computer science, Loyola College, Chennai, pp 128–136

  26. Cox E (2005) Fuzzy modeling and genetic algorithms for data mining and exploration. Elsevier, India

    Google Scholar 

  27. Binu T, Raju G (2008) A modified fuzzy c means algorithm for natural data exploration. J World Acad Sci Eng Technol 49:478–481

    Google Scholar 

  28. Bezdek JC, Ehrlich R, Full W (1984) FCM: the fuzzy c means clustering algorithm. Comput Geosci 10(2):191–203

    Article  Google Scholar 

  29. Thomas B, Raju G (2010) A fuzzy threshold based unsupervised clustering algorithm for natural data exploration. In: IEEE international conference on networking and information technology (ICNIT) 2010, pp 473–477

  30. Fu A, Wong MH, Sze SC, Wong WC, Wong WL, Yu WK (1998) Finding fuzzy sets for the mining of fuzzy association rules for numerical attributes. In: Proceedings of the first international symposium on intelligent data engineering and learning, pp 263–268

  31. Verlinde H, De Cock M, Boute R (2005) Fuzzy versus quantitative association rules: a fair data-driven comparison. IEEE Trans Syst Man Cybern Part B Cybern 36(3):679–684

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Binu Thomas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thomas, B., Raju, G. A novel unsupervised fuzzy clustering method for preprocessing of quantitative attributes in association rule mining. Inf Technol Manag 15, 9–17 (2014). https://doi.org/10.1007/s10799-013-0168-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10799-013-0168-7

Keywords

Navigation