Abstract
Frequent pattern mining is one among the popular data mining techniques. Frequent pattern mining approaches extract interesting associations among the items in a given transactional database. The items of the transactional database can be organized as a concept hierarchy. Notably, frequent pattern mining does not distinguish the patterns by analyzing the categories of the items in a given concept hierarchy. In several applications, it is often useful to distinguish among the frequent patterns by analyzing how the items of the pattern are mapped to different categories of the concept hierarchy. In this paper, we propose a new interestingness measure, designated as diversity rank (drank), for capturing the diversity of a given pattern by analyzing the extent to which the items of the pattern are associated with the categories of the corresponding concept hierarchy. Given a transactional database over a set I of items and the corresponding concept hierarchy on I, we propose a methodology to compute the drank of the given pattern. Furthermore, by extending the notion of drank, we propose an approach to improve the diversity and accuracy of association rule-based recommender system. The results of our performance evaluation on the real-world MovieLens dataset demonstrate that the proposed diversity model extracts different kinds of patterns as compared to frequent patterns. Furthermore, our proposed recommender system approach improves the diversity performance w.r.t. the existing association rule-based recommender system without significantly compromising the accuracy. Overall, the proposed concept hierarchy-based diverse pattern model provides a scope to develop new approaches for improving the performance of frequent pattern mining-based applications.
Similar content being viewed by others
References
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. Trans. Knowl. Data Eng. 17(6), 734–749 (2005)
Aggarwal, C.C., Han, J.: Frequent Pattern Mining. Springer, Berlin (2014)
Aggarwal, C.C., Yu, P.S.: A new framework for itemset generation. In: Proceedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS ’98, pp. 18–24. ACM (1998)
Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the International Conference on Management of Data, SIGMOD ’93, pp. 207–216. ACM (1993)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, VLDB ’94, pp. 487–499. Morgan Kaufmann Publishers Inc. (1994)
Ali, Z., Khusro, S., Ullah, I.: A hybrid book recommender system based on table of contents (toc) and association rule mining. In: Proceedings of the 10th International Conference on Informatics and Systems, INFOS ’16, pp. 68–74. ACM (2016)
Bache, K., Newman, D., Smyth, P.: Text-based measures of document diversity. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’13, pp. 23–31. ACM (2013)
Bradley, K., Smyth, B.: Improving recommendation diversity. In: Proceedings of the 12th National Conference in Artificial Intelligence and Cognitive Science, AICS ’01, pp. 75–84 (2001)
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: Generalizing association rules to correlations. In: Proceedings of the 1997 International Conference on Management of Data, SIGMOD ’97, pp. 265–276. ACM (1997)
Cormen, T.H., Stein, C., Rivest, R.L., Leiserson, C.E.: Introduction to Algorithms, 2nd edn. McGraw-Hill Higher Education, New York (2001)
Deshpande, M., Karypis, G.: Item-based top-n recommendation algorithms. Trans. Inf. Syst. 22(1), 143–177 (2004)
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Discov. 15(1), 55–86 (2007)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the International Conference on Management of Data, SIGMOD ’00, pp. 1–12. ACM (2000)
Harper, F.M., Konstan, J.A.: The movielens datasets: history and context. Trans. Interact. Intell. Syst. 5(4), 19:1–19:19 (2015)
Herlocker, J.L., Konstan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating collaborative filtering recommender systems. Trans. Inf. Syst. 22(1), 5–53 (2004)
IMDb: Internet movie database. (2014) http://www.imdb.com/genre/ . Accessed Oct 2014
Kaufman, K.A., Michalski, R.S.: A method for reasoning with structured and continuous attributes in the inlen-2 multistrategy knowledge discovery system. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, KDD ’96, pp. 232–237. AAAI Press (1996)
Kiran, R.U., Kitsuregawa, M.: An improved neighborhood-restricted association rule-based recommender system. In: Proceedings of the 24th Australasian Database Conference - Volume 137, ADC ’13, pp. 43–50. Australian Computer Society, Inc. (2013)
Kumara Swamy, M., Krishna Reddy, P.: Improving diversity performance of association rule based recommender systems. In: Proceedings of the 26th International Conference on Database and Expert Systems Applications—Part I, DEXA ’15, pp. 499–508. Springer, Berlin (2015)
Kunaver, M., Porl, T.: Diversity in recommender systems a survey. Knowl. Based Syst. 123(C), 154–162 (2017)
Lin, W., Alvarez, S.A., Ruiz, C.: Efficient adaptive-support association rule mining for recommender systems. Data Min. Knowl. Discov. 6(1), 83–105 (2002)
Liu, B., Hsu, W., Ma, Y.: Mining association rules with multiple minimum supports. In: Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, KDD ’99, pp. 337–341. ACM (1999)
Liu, L., Zhu, F., Chen, C., Yan, X., Han, J., Yu, P.S., Yang, S.: Mining diversity on networks. In: Proceedings of the 15th International Conference on Database Systems for Advanced Applications—Volume Part I, DASFAA ’10, pp. 384–398. Springer (2010)
Lu, Y.: Concept hierarchy in data mining: specification, generation and implementation. Master’s thesis, School of Computer Science, Simon Fraser University, Canada (1997)
Mei, Q., Guo, J., Radev, D.: Divrank: The interplay of prestige and diversity in information networks. In: Proceedings of the 16th International Conference on Knowledge Discovery and Data Mining, KDD ’10, pp. 1009–1018. ACM (2010)
Moonen, L., Di Alesio, S., Binkley, D., Rolfsnes, T.: Practical guidelines for change recommendation using association rule mining. In: Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, ASE 2016, pp. 732–743. ACM (2016)
Omiecinski, E.R.: Alternative interest measures for mining associations in databases. Trans. Knowl. Data Eng. 15(1), 57–69 (2003)
Qian, X., Lu, D., Wang, Y., Zhu, L., Tang, Y.Y., Wang, M.: Image re-ranking based on topic diversity. Trans. Image Process. 26(8), 3734–3747 (2017)
Rao, C.: Diversity and dissimilarity coefficients: a unified approach. Theor. Popul. Biol. 21(1), 24–43 (1982)
Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., Riedl, J.: Grouplens: an open architecture for collaborative filtering of netnews. In: Proceedings of the ACM Conference on Computer Supported Cooperative Work, CSCW ’94, pp. 175–186. ACM (1994)
Said, A., Tikk, D., Hotho, A.: The challenge of recommender systems challenges. In: Proceedings of the 6th ACM Conference on Recommender Systems, RecSys ’12, pp. 9–10. ACM (2012)
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Analysis of recommendation algorithms for e-commerce. In: Proceedings of the 2nd Conference on Electronic Commerce, EC ’00, pp. 158–167. ACM (2000)
Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Item-based collaborative filtering recommendation algorithms. In: Proceedings of the 10th International Conference on World Wide Web, WWW ’01, pp. 285–295. ACM (2001)
Spertus, E.: Parasite: mining structural information on the web. Computer Netw. ISDN Syst. 29(8–13), 1205–1215 (1997)
Srikant, R., Agrawal, R.: Mining generalized association rules. In: Proceedings of the 21th International Conference on Very Large Data Bases, VLDB ’95, pp. 407–419. Morgan Kaufmann Publishers Inc. (1995)
Stirling, A.: A general framework for analysing diversity in science, technology and society. J. R. Soc. Interface 4(15), 707–719 (2007)
Su, X., Khoshgoftaar, T.M.: A survey of collaborative filtering techniques. Adv. Artif. Intell. 2009, 4:2–4:2 (2009)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right objective measure for association analysis. Inf. Syst. Knowl. Discov. Data Min. 29(4), 293–313 (2004)
Tanbeer, S.K., Ahmed, C.F., Jeong, B.S., Lee, Y.K.: Discovering periodic-frequent patterns in transactional databases. In: Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD ’09, pp. 242–253. Springer (2009)
Wasilewski, J., Hurley, N.: Intent-aware item-based collaborative filtering for personalised diversification. In: Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization, UMAP ’18, pp. 81–89. ACM (2018)
Weiss, G.M.: Mining with rarity: a unifying framework. SIGKDD Explor. Newsl. 6(1), 7–19 (2004)
Wille, R.: Concept lattices and conceptual knowledge systems. Comput. Math. Appl. 23(6–9), 493–515 (1992)
Zhang, Y., Xu, F.F., Lyu, T., Ren, X., Han, J.: Incorporating diversity into influential node mining. Computing Research Repository (2018)
Ziegler, C.N., McNee, S.M., Konstan, J.A., Lausen, G.: Improving recommendation lists through topic diversification. In: Proceedings of the 14th International Conference on World Wide Web, WWW ’05, pp. 22–32. ACM (2005)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kumara Swamy, M., Krishna Reddy, P. A model of concept hierarchy-based diverse patterns with applications to recommender system. Int J Data Sci Anal 10, 177–191 (2020). https://doi.org/10.1007/s41060-019-00203-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41060-019-00203-2