Abstract
To overcome the failure in eliminating suspicious patterns or association rules existing in traditional association rules mining, we propose a novel method to mine item-item and between-set correlated association rules. First, we present three measurements: the association, correlation, and item-set correlation measurements. In the association measurement, the all-confidence measure is used to filter suspicious cross-support patterns, while the all-item-confidence measure is applied in the correlation measurement to eliminate spurious association rules that contain negatively correlated items. Then, we define the item-set correlation measurement and show its corresponding properties. By using this measurement, spurious association rules in which the antecedent and consequent item-sets are negatively correlated can be eliminated. Finally, we propose item-item and between-set correlated association rules and two mining algorithms, I&ISCoMine_AP and I&ISCoMine_CT. Experimental results with synthetic and real retail datasets show that the proposed method is effective and valid.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Swami, A., 1993. Mining association rules between sets of items in large databases. ACM SIGMOD Rec., 22(2):207–216. [doi:10.1145/170035.170 072]
Alvarez, S.A., 2003. Chi-Squared Computation for Association Rules: Preliminary Results. Technical Report No. BC-CS-2003-01, Computer Science Department, Boston College, MA.
Brin, S., Motwani, R., Silverstein, C., 1997. Beyond market baskets: generalizing association rules to correlations. ACM SIGMOD Rec., 26(2):256–276. [doi:10.1145/253260.253327]
Hahsler, M., Hornik, K., 2007. New probabilistic interest measures for association rules. Intell. Data Anal., 11(5):437–455.
Han, J., Pei, J., Yin, Y., Mao, R., 2004. Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min. Knowl. Disc., 8(1):53–87. [doi:10.1023/B:DAMI.0000005258.31418.83]
IBM Almaden Research Center, 2009. Quest Synthetic Data Generation Code. Available from http://www.almaden.ibm.com/cs/projects/iis/hdb/Projects/data_mining/datasets/syndata.html [Accessed on Jan. 21, 2009].
Kenett, R.S., Salini, S., 2008a. Relative linkage disequilibrium: a new measure for association rules. LNCS, 5077:189–199. [doi:10.1007/978-3-540-70720-2_15]
Kenett, R.S., Salini, S., 2008b. Relative linkage disequilibrium applications to aircraft accidents and operational risks. IEEE Trans. Mach. Learn. Data Min., 1(2):83–96.
Kim, W.Y., Lee, Y.K., Han, J., 2004. CCMine: efficient mining of confidence-closed correlated patterns. LNAI, 3056:569–579. [doi:10.1007/b97861]
Lee, Y.K., Kim, W.Y., Cai, Y.D., Han, J., 2003. CoMine: Efficient Mining of Correlated Patterns. Proc. 3rd IEEE Int. Conf. on Data Mining, p.581–584. [doi:10.1109/ICDM. 2003.1250982]
Lenca, P., Meyer, P., Vaillant, B., Lallich, S., 2008. On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur. J. Oper. Res., 184(2):610–626. [doi:10.1016/j.ejor.2006. 10.059]
Liu, B., Hsu, W., Ma, Y., 1999. Pruning and Summarizing the Discovered Associations. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.125–134. [doi:10.1145/312129.312216]
Lu, H., Han, J., Feng, L., 1998. Stock Movement Prediction and N-dimensional Inter-transaction Association Rules. Proc. 3rd ACM-SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, p.1–7.
Omiecinski, E.R., 2003. Alternative interesting measures for mining associations in databases. IEEE Trans. Knowl. Data Eng., 15(1):57–69. [doi:10.1109/TKDE.2003.1161 582]
Ozden, B., Ramaswamy, S., Silberschatz, A., 1998. Cyclic Association Rules. Proc. 14th Int. Conf. on Data Engineering, p.412–421. [doi:10.1109/ICDE.1998.655804]
Palshikar, G.K., Kale, M.S., Apte, M.M., 2007. Association rules mining using heavy itemsets. Data Knowl. Eng., 61(1):93–113. [doi:10.1016/j.datak.2006.04.009]
Qin, M., Hwang, K., 2004. Frequent Episode Rules for Internet Anomaly Detection. Proc. 3rd IEEE Int. Symp. on Network Computing and Applications, p.161–168. [doi:10.1109/NCA.2004.1347773]
Ruggeri, F., Kenett, R.S., Faltin, F.W., 2007. Encyclopedia of Statistics in Quality and Reliability. Wiley, Chichester, England.
Shen, B., Yao, M., 2009a. Mining associated and item-item correlated frequent patterns. J. Zhejiang Univ. (Eng. Sci.), 43(12):2171–2177 (in Chinese). [doi:10.3785/j.issn.1008-973X.2009.12.008]
Shen, B., Yao, M., 2009b. A new kind of dynamic association rule and its mining algorithms. Control Dec., 24(9):1310–1315 (in Chinese).
Shen, B., Yao, M., Wu, Z.H., Gao, Y.J., 2010. Mining dynamic association rules with comments. Knowl. Inform. Syst., 23(1):73–98. [doi:10.1007/s10115-009-0207-1]
Tan, P.N., Kumar, V., Srivastava, J., 2002. Selecting the Right Interestingness Measure for Association Patterns. Proc. 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.32–41. [doi:10.1145/775047.775053]
Xiong, H., Tan, P.N., Kumar, V., 2006. Hyperclique pattern discovery. Data Min. Knowl. Disc., 13(2):219–242. [doi:10.1007/s10618-006-0043-9]
Zhang, S., Chen, F., Wu, X., Zhang, C., Wang, R., 2006. Identifying Bridging Rules Between Conceptual Clusters. Proc. 12th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, p.815–820. [doi:10.1145/1150402.1150509]
Zhang, S., Chen, F., Jin, Z., Wang, R., 2009. Mining class-bridge rules based on rough sets. Exp. Syst. Appl., 36(3):6453–6460. [doi:10.1016/j.eswa.2008.07.044]
Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006a. Mining both associated and correlated patterns. LNCS, 3994:468–475. [doi:10.1007/11758549]
Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006b. Efficiently mining mutually and positively correlated patterns. LNCS, 4093:118–125. [doi:10.1007/11811305]
Zhou, Z.M., Wu, Z.H., Wang, C.S., Feng, Y., 2006c. Efficiently mining both association and correlation rules. LNCS, 4223:369–372. [doi:10.1007/11881599]
Author information
Authors and Affiliations
Corresponding author
Additional information
Project supported by the National Natural Science Foundation of China (Nos. 10876036 and 70871111) and the Ningbo Natural Science Foundation, China (No. 2010A610113)
Rights and permissions
About this article
Cite this article
Shen, B., Yao, M., Xie, Lj. et al. Mining item-item and between-set correlated association rules. J. Zhejiang Univ. - Sci. C 12, 96–109 (2011). https://doi.org/10.1631/jzus.C0910717
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.C0910717
Key words
- Item-item and between-set correlated association rules
- All-confidence
- All-item-confidence
- Item-set correlation
- Mining algorithms
- Pruning effect