Abstract
Association rule mining is one of the useful techniques in data mining and knowledge discovery that extracts interesting relationships between items in datasets. Generally, the number of association rules in a particular dataset mainly depends on the measures of ’support’ and ’confidence’. To choose the number of useful rules, normally, the measures of ’support’ and ’confidence’ need to be tried many times. In some cases, the measures of ’support’ and ’confidence’ are chosen by experience. Thus, it is a time consuming to find the optimal measure of ’support’ and ’confidence’ for the process of association rule mining in large datasets. This paper proposes a regression based approach to improve the association rule mining process through predicting the number of rules on datasets. The approach includes a regression model in a generic level for general domains and an instantiation scheme to create concrete models in particular domains for predicting the potential number of association rules on a dataset before mining. The proposed approach can be used in broad domains with different types of datasets to improve the association rule mining process. A case study to build a concrete regression model based on a real dataset is demostrated and the result shows the good performance of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Fukuda, T., Morimoto, S., Morishita, S., Tokuyama, T.: Mining Optimized Association Rules for Numeric Attributes. In: Proceedings of the Fifteenth ACM Symposium on Principles of Database Systems, pp. 182–191 (1996)
Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H., Verkamo, A.I.: Finding Interesting Rules from Large Sets of Discovered Association Rules. In: Proceedings of the 3rd International Conference on Information and Knowledge Management, pp. 401–407 (1994)
Mannila, H., Toivonen, H., Inkeri Verkamo, A.: Efficient Algorithms for Discovering Association Rules. In: Proceedings of the AAAI Workshop on Knowledge Discovery in Databases, pp. 181–192 (1994)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499 (1994)
Shi, L., Jing, X., Xie, X., Yan, J.: Association Rules Applied to Credit Evaluation. In: 2010 International Conference on Computational Intelligence and Software Engineering (CiSE), pp. 1–4 (2010)
Li, G., Law, R., Rong, J., Vu, H.Q.: Incorporating both Positive and Negative Association Rules into the Analysis of Outbound Tourism in Hong Kong. Journal of Travel and Tourism Marketing 27, 812–828 (2010)
Yi, W.G., Lu, M.Y., Liu, Z.: Regression Analysis of the Number of Association Rules. International Journal of Automation and Computing 8, 78–82 (2011)
Han, J., Kamber, M.: Data mining: Concepts and Techniques, 2nd edn. Elsevier Inc. (2006)
Shaharanee, I.N.M., Dillon, T.S., Hadzic, F.: Assertaining Data Mining Rules Using Statistical Approaches. In: 2011 International Symposium on Computing, Communication, and Control, vol. 1, pp. 181–189 (2011)
Brin, S., Motwani, R., Silverstein, C.: Beyond Market Baskets: Generalizing Association Rules to Correlations. In: Pro. of the ACM SIGMOD Conference on Management of Data, pp. 265–276 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Le, D.T., Ren, F., Zhang, M. (2012). A Regression-Based Approach for Improving the Association Rule Mining through Predicting the Number of Rules on General Datasets. In: Anthony, P., Ishizuka, M., Lukose, D. (eds) PRICAI 2012: Trends in Artificial Intelligence. PRICAI 2012. Lecture Notes in Computer Science(), vol 7458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32695-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-32695-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32694-3
Online ISBN: 978-3-642-32695-0
eBook Packages: Computer ScienceComputer Science (R0)