Skip to main content
Log in

Mining item-item and between-set correlated association rules

  • Published:
Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Abstract

To overcome the failure in eliminating suspicious patterns or association rules existing in traditional association rules mining, we propose a novel method to mine item-item and between-set correlated association rules. First, we present three measurements: the association, correlation, and item-set correlation measurements. In the association measurement, the all-confidence measure is used to filter suspicious cross-support patterns, while the all-item-confidence measure is applied in the correlation measurement to eliminate spurious association rules that contain negatively correlated items. Then, we define the item-set correlation measurement and show its corresponding properties. By using this measurement, spurious association rules in which the antecedent and consequent item-sets are negatively correlated can be eliminated. Finally, we propose item-item and between-set correlated association rules and two mining algorithms, I&ISCoMine_AP and I&ISCoMine_CT. Experimental results with synthetic and real retail datasets show that the proposed method is effective and valid.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Agrawal, R., Imielinski, T., Swami, A., 1993. Mining association rules between sets of items in large databases. ACM SIGMOD Rec., 22(2):207–216. [doi:10.1145/170035.170 072]

    Article  Google Scholar 

  • Alvarez, S.A., 2003. Chi-Squared Computation for Association Rules: Preliminary Results. Technical Report No. BC-CS-2003-01, Computer Science Department, Boston College, MA.

    Google Scholar 

  • Brin, S., Motwani, R., Silverstein, C., 1997. Beyond market baskets: generalizing association rules to correlations. ACM SIGMOD Rec., 26(2):256–276. [doi:10.1145/253260.253327]

    Google Scholar 

  • Hahsler, M., Hornik, K., 2007. New probabilistic interest measures for association rules. Intell. Data Anal., 11(5):437–455.

    Google Scholar 

  • Han, J., Pei, J., Yin, Y., Mao, R., 2004. Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min. Knowl. Disc., 8(1):53–87. [doi:10.1023/B:DAMI.0000005258.31418.83]

    Article  MathSciNet  Google Scholar 

  • IBM Almaden Research Center, 2009. Quest Synthetic Data Generation Code. Available from http://www.almaden.ibm.com/cs/projects/iis/hdb/Projects/data_mining/datasets/syndata.html [Accessed on Jan. 21, 2009].

  • Kenett, R.S., Salini, S., 2008a. Relative linkage disequilibrium: a new measure for association rules. LNCS, 5077:189–199. [doi:10.1007/978-3-540-70720-2_15]

    Google Scholar 

  • Kenett, R.S., Salini, S., 2008b. Relative linkage disequilibrium applications to aircraft accidents and operational risks. IEEE Trans. Mach. Learn. Data Min., 1(2):83–96.

    Google Scholar 

  • Kim, W.Y., Lee, Y.K., Han, J., 2004. CCMine: efficient mining of confidence-closed correlated patterns. LNAI, 3056:569–579. [doi:10.1007/b97861]

    Google Scholar 

  • Lee, Y.K., Kim, W.Y., Cai, Y.D., Han, J., 2003. CoMine: Efficient Mining of Correlated Patterns. Proc. 3rd IEEE Int. Conf. on Data Mining, p.581–584. [doi:10.1109/ICDM. 2003.1250982]

  • Lenca, P., Meyer, P., Vaillant, B., Lallich, S., 2008. On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur. J. Oper. Res., 184(2):610–626. [doi:10.1016/j.ejor.2006. 10.059]

    Article  MATH  Google Scholar 

  • Liu, B., Hsu, W., Ma, Y., 1999. Pruning and Summarizing the Discovered Associations. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.125–134. [doi:10.1145/312129.312216]

  • Lu, H., Han, J., Feng, L., 1998. Stock Movement Prediction and N-dimensional Inter-transaction Association Rules. Proc. 3rd ACM-SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, p.1–7.

  • Omiecinski, E.R., 2003. Alternative interesting measures for mining associations in databases. IEEE Trans. Knowl. Data Eng., 15(1):57–69. [doi:10.1109/TKDE.2003.1161 582]

    Article  MathSciNet  Google Scholar 

  • Ozden, B., Ramaswamy, S., Silberschatz, A., 1998. Cyclic Association Rules. Proc. 14th Int. Conf. on Data Engineering, p.412–421. [doi:10.1109/ICDE.1998.655804]

  • Palshikar, G.K., Kale, M.S., Apte, M.M., 2007. Association rules mining using heavy itemsets. Data Knowl. Eng., 61(1):93–113. [doi:10.1016/j.datak.2006.04.009]

    Article  Google Scholar 

  • Qin, M., Hwang, K., 2004. Frequent Episode Rules for Internet Anomaly Detection. Proc. 3rd IEEE Int. Symp. on Network Computing and Applications, p.161–168. [doi:10.1109/NCA.2004.1347773]

  • Ruggeri, F., Kenett, R.S., Faltin, F.W., 2007. Encyclopedia of Statistics in Quality and Reliability. Wiley, Chichester, England.

    MATH  Google Scholar 

  • Shen, B., Yao, M., 2009a. Mining associated and item-item correlated frequent patterns. J. Zhejiang Univ. (Eng. Sci.), 43(12):2171–2177 (in Chinese). [doi:10.3785/j.issn.1008-973X.2009.12.008]

    Google Scholar 

  • Shen, B., Yao, M., 2009b. A new kind of dynamic association rule and its mining algorithms. Control Dec., 24(9):1310–1315 (in Chinese).

    Google Scholar 

  • Shen, B., Yao, M., Wu, Z.H., Gao, Y.J., 2010. Mining dynamic association rules with comments. Knowl. Inform. Syst., 23(1):73–98. [doi:10.1007/s10115-009-0207-1]

    Article  Google Scholar 

  • Tan, P.N., Kumar, V., Srivastava, J., 2002. Selecting the Right Interestingness Measure for Association Patterns. Proc. 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.32–41. [doi:10.1145/775047.775053]

  • Xiong, H., Tan, P.N., Kumar, V., 2006. Hyperclique pattern discovery. Data Min. Knowl. Disc., 13(2):219–242. [doi:10.1007/s10618-006-0043-9]

    Article  MathSciNet  Google Scholar 

  • Zhang, S., Chen, F., Wu, X., Zhang, C., Wang, R., 2006. Identifying Bridging Rules Between Conceptual Clusters. Proc. 12th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.815–820. [doi:10.1145/1150402.1150509]

  • Zhang, S., Chen, F., Jin, Z., Wang, R., 2009. Mining class-bridge rules based on rough sets. Exp. Syst. Appl., 36(3):6453–6460. [doi:10.1016/j.eswa.2008.07.044]

    Article  Google Scholar 

  • Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006a. Mining both associated and correlated patterns. LNCS, 3994:468–475. [doi:10.1007/11758549]

    Google Scholar 

  • Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006b. Efficiently mining mutually and positively correlated patterns. LNCS, 4093:118–125. [doi:10.1007/11811305]

    Google Scholar 

  • Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006c. Efficiently mining both association and correlation rules. LNCS, 4223:369–372. [doi:10.1007/11881599]

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Min Yao.

Additional information

Project supported by the National Natural Science Foundation of China (Nos. 10876036 and 70871111) and the Ningbo Natural Science Foundation, China (No. 2010A610113)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shen, B., Yao, M., Xie, Lj. et al. Mining item-item and between-set correlated association rules. J. Zhejiang Univ. - Sci. C 12, 96–109 (2011). https://doi.org/10.1631/jzus.C0910717

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.C0910717

Key words

CLC number

Navigation