Mining changes in customer buying behavior for collaborative recommendations

doi:10.1016/j.eswa.2004.10.015

Expert Systems with Applications

Volume 28, Issue 2, February 2005, Pages 359-369

https://doi.org/10.1016/j.eswa.2004.10.015 Get rights and content

Abstract

The preferences of customers change over time. However, existing collaborative filtering (CF) systems are static, since they only incorporate information regarding whether a customer buys a product during a certain period and do not make use of the purchase sequences of customers. Therefore, the quality of the recommendations of the typical CF could be improved through the use of information on such sequences.

In this study, we propose a new methodology for enhancing the quality of CF recommendation that uses customer purchase sequences. The proposed methodology is applied to a large department store in Korea and compared to existing CF techniques. Various experiments using real-world data demonstrate that the proposed methodology provides higher quality recommendations than do typical CF techniques, with better performance, especially with regard to heavy users.

Introduction

Recommender systems have been a recent focus of researchers and practitioners. Many companies hope that the use of recommender systems may be a means of surviving in a competitive environment. Recommender systems are particularly suited to retail business, as compared to other types of business, since retail markets are distinguished by several characteristics, such as repeated buying over a particular time horizon, large numbers of customers, and a wealth of information detailing past customer purchases (Schmittlein & Peterson, 1994).

In general, retail companies operate purchase databases in a longitudinal way, such that all transactions are stored in chronological order. A record-of-transaction database typically contains the transaction date for and the products bought in the course of, a given transaction. Usually, each record also contains a customer ID, particularly when the purchase was made using a credit card or a frequent-buyer card. Therefore, the purchasing sequence of a customer in the database who has made repeat purchases can easily be determined. This purchase sequence provides a description of the changes in a customer's preferences over time. However, in our domain of knowledge, there has been little study of the question of whether recommendations based on purchase sequences may be more accurate than existing recommender system predictions, based on non-sequential patterns. In this study, for the purpose of enhancing the quality of recommendations, we propose a new methodology that considers the way in which a customer's purchase sequence evolves over time.

To date, a variety of recommender systems (Balabanović and Shoham, 1997, Basu et al., 1998, Hill et al., 1995, Lawrence et al., 2001, Resnick et al., 1994, Sarwar et al., 2001, Shardanand and Maes, 1995) has been developed. Collaborative filtering (CF) has thus far been the most successful recommendation technique and has been used in a number of different applications, such as in the recommendation of web pages, movies, articles and products (Hill et al., 1995, Resnick et al., 1994, Shardanand and Maes, 1995). Collaborative filtering works by recommending products to a target customer through a process of identifying people who share similar preferences for products and looking for those products that target customers are most likely to purchase.

The recommendation processes of typical collaborative filtering in retail business consist of the following three steps (Sarwar et al., 2000, Sarwar et al., 2001).

(1) Customer profile construction

The purchase transaction records of a customer for a certain period are used to build a customer profile describing his or her likes and dislikes. The system represents the customer profile, A, such that a_ij is one if a customer i has purchased a product j, and zero, otherwise.

(2) Neighborhood formation

This is the most important part of the CF-based recommender systems. The system finds a set of customers, known as neighbors, who, in the past, have exhibited similar behavior (i.e. bought a similar set of products), through calculating the correlations among customers for the customer profile. A set of K customers is usually found (a neighborhood of size K), which is formed according to the degree of similarity between each of the neighbors and a target customer.

(3) Recommendation generation

Once a neighborhood is formed for a target customer, the system generates a set of the top N products that the target customer is most likely to purchase, by searching for products that the neighbors have purchased and that the target customer has not yet purchased.

As mentioned above, typical collaborative filtering is static, since it only makes use of information relating to whether the customer bought a product during a certain period, and does not use information on the purchase sequences of customers in the determination of the neighbors of a target customer. However, customers in retail business are not static, and their buying behavior changes over time. Thus, the quality of the recommendation of the typical CF could be further improved through the use of the available information on the purchase sequences of customers.

To illustrate the importance of this potential improvement in accuracy, let us consider the following example. Table 1 presents typical transaction records for a retail company and the customer profile is provided in Table 2. This example determines products that target customer ID011 is likely to buy, using transaction records for consumers CID001 through CID011.

Assume that the typical CF algorithm is used for solving the problem, that the neighborhood size (K) is three, and that the number of products recommended (N) is two. The typical collaborative filtering algorithm considers the correlation of preferences between the target customer and the other customers. All of the four customers, CID011, CID001, CID002 and CID003, commonly bought ‘Perfumes’ and ‘Skincare Products.’ The similarities between CID011 and the other three customers are equivalent; that is, the Pearson correlation coefficient is 0.67. Therefore, a recommender system based on the collaborative filtering algorithm will determine that CID001, CID002 and CID003 are the nearest neighbors and have the same preferences as the target customer. However, it is quite difficult to select two products that should be recommended to CID011, because CID001, CID002 and CID003 each purchased different additional products: ‘Dresses,’ ‘Shoes,’ and ‘Knits,’, respectively. In this case, two different products to be recommended to CID011 would have to be selected randomly. Accordingly, the recommendations would not necessarily be very appropriate for the preferences of the target customer.

Table 3 provides a customer profile rearranged with regard to each customer's transaction time. From the table, it can be seen that the purchase sequence of the target customer CID011 occurred in the order of Perfumes, followed by Skincare Products. Similarly, the purchase sequences of CID001, CID002 and CID003 were Perfumes→Skincares→Dresses, Perfumes→Shoes→Skincares, and Skincares→Perfumes→Knits, respectively. Assume now that the neighbors of the target customer are determined based on the purchase sequence of each customer. Customers who have purchase sequences similar to that of CID011 include CID 001 and CID002. The purchase sequences of CID011 and CID001 are exactly same, as both bought the same products during the same month; therefore, the nearest neighbor of the target customer CID011 is CID001. The next nearest neighbor is CID002. Therefore, the products recommendable to CID011 are ‘Dresses,’ and ‘Shoes.’ As mentioned above, when the past purchase sequences of each customer are available, this knowledge can be used to enhance the quality of the recommendations made.

However, if a customer profile rearranged by time is applied directly, this application results in a significant problem, namely, the sparsity problem. It is well known that a sparse data set, having few nonzero entries, decreases recommendation accuracy (Mobasher, Cooly, & Srivastava, 2000). The sparsity level, defined as 1- (nonzero entries/total entries), of Table 2 is 60% (=1−(22/55)), while that of Table 3 is 87% (=1−(22/165)). In general, the sparsity level of a typical data set in the field of recommendation is over 95%. Rearrangement of an input matrix by time results in an increased time dimension, as compared to that of a typical customer profile, and thereby makes the input data set sparser. Therefore, a new solution to the sparsity problem must be found.

In our research, we employ a clustering technique that groups the transactions of customers into homogeneous subgroups. The SOM (Self-Organizing Map) technique, which has been applied frequently of late, is used for clustering (Kohonen, 1990). With the aid of the SOM, all the transactions of customers may be allocated to a certain cluster and a cluster number imposed. The change in the cluster number resulting from each transaction determines a customer purchase sequence.

By observing changes in the cluster number of each customer over time, a buying sequence can be built for each customer. These buying sequences are potentially capable of predicting the future purchases of a target customer. However, since not all buying sequences have a statistical validity sufficient to guarantee the generalization of the prediction, the association rule mining technique may be used to extract the sequential patterns from the buying sequences (Agrawal, Imielinski, & Swami, 1993).

Section snippets

Overall procedure

Generally, most marketing campaigns are conducted based on transactions occurring during a specified time period (e.g. 3 months or 6 weeks). We assume that a time period of length l is used to detect the purchase sequence of a customer and that a product recommendation for a target customer is made at time T. In other words, our problem can be described as follows: When the purchase sequence and buying history of a target customer for the past l−1 periods prior to time T are given, which

Data sets

We used real-world data to examine the performance of the proposed approach. The data used in the experiment were transaction records for those goods sold by the H department store, the third largest department store in Korea, that were commonly purchased by women. In addition, we used transaction records obtained during the eight-month period from May to December 2000, in order to establish the behavioral characteristics of the customers over time. The input data from the H department store

Conclusion

The preferences of customers change over time. In this study, we described a model-based approach for mining the changes in customer buying behavior over time and discussed solutions to several problems: data preprocessing, behavior locus extraction, and recommendation formulation based on extracted loci. Using the derived recommendation list, companies may be able to perform effective one-to-one marketing campaigns by providing individual target customers with personalized product

Acknowledgement

This work was supported by research program 2005 of Kookmin University in Korea.

References (20)

Y.H. Cho et al.
A personalized recommender system based on web usage mining and decision tree induction
Expert Systems With Applications
(2002)
M.A. Nour et al.
Heuristic and optimization approaches to extending the Kohonen self organizing algorithm
European Journal of Operational Research
(1996)
D. Chiang et al.
Goal-oriented sequential pattern for network banking churn analysis
Expert Systems with Applications
(2003)
R. Agrawal et al.
Mining association rules between sets of items in large databases
(1993)
M. Balabanović et al.
Content-based, collaborative recommendation
Communacations of the ACM
(1997)
Basu, C., Hirsh, H., & Cohen, W. (1998). Recommendation as classification: using social and content-based information...
Billsus, D., & Pazzani, M.J. (1998). Learning collaborative information filters. Proceedings of the 15th international...
Y.H. Cho et al.
Application of web usage mining and product taxonomy to collaborative recommendations in e-commerce
Expert Systems With Applications
(2004)
G.C. Chow
Tests of equality between sets of coefficients in two regressions
Econometrica
(1960)
Hill, W., Stead, L., Rosenstein, M., & Furnas, G. (1995). Recommending and evaluating choices in a communication of...

There are more references available in the full text version of this article.

Cited by (78)

Recommending untapped M&A opportunities: A combined approach using principal component analysis and collaborative filtering
2019, Expert Systems with Applications
This study proposes an analysis framework for recommending untapped mergers and acquisitions (M&A) opportunities by combining principal component analysis (PCA) and collaborative filtering. The framework is particularly suitable for suggesting M&A targets that may possess the complementary knowledge resources for improving performance gains. In this study, PCA was firstly applied to reduce the dimensionality of original data and to identify key knowledge categories of firms. Based on these results, the potential knowledge requirements and priority areas of firm's knowledge base can be located. Collaborative filtering technique was then adopted to a firm's knowledge portfolio, which is represented as a set of SIC (standard industrial classification) codes, for making recommendations. The applicability of the framework was demonstrated using a case study approach based on the analysis of the biotechnology M&A market. The proposed framework would be a good complement to existing practice of identifying novel M&A opportunities and can create a basis for constructing a data-driven business intelligence system. From an expert and intelligent systems point of view, the study provides decision makers with a general and flexible modeling approach for assisting them in the formulation of M&A strategy. It enlarges the possibility of decomposing the problem into manageable units by automating the emulation of human decision making.
Recommend products with consideration of multi-category inter-purchase time and price
2018, Future Generation Computer Systems
This study focuses on diversifying recommendations and improving recommendation accuracy by incorporating multi-category purchase interval and price in a novel way. We suggest a method to model the drift of a user’s interest for different categories based on sequential pattern mining. Additionally, we propose a new approach to model personalized multi-category inter-purchase interval for the user. By employing fuzzy set theory, we also put forward an approach to model the price preferences for each user. We propose a recommender system that incorporates the methods mentioned above. The experimental results based on real purchase records show that the proposed recommendation algorithm has high stability and superior performance in recommending products with different characteristics. The results also demonstrate the effectiveness of incorporating purchase interval and price factor in recommender systems. This study demonstrates the existence of particular inter-purchase intervals among different categories, and indicates that price is an important factor influencing customers’ decision making.
Behavior-aware user response modeling in social media: Learning from diverse heterogeneous data
2015, European Journal of Operational Research
With the rapid development of Web 2.0 applications, social media have increasingly become a major factor influencing the purchase decisions of customers. Longitudinal individual and engagement behavioral data generated on social media sites post challenges to integrate diverse heterogeneous data to improve prediction performance in customer response modeling. In this study, a hierarchical ensemble learning framework is proposed for behavior-aware user response modeling using diverse heterogeneous data. In the framework, a general-purpose data transformation and feature extraction strategy is developed to transform the heterogeneous high-dimensional multi-relational datasets into customer-centered high-order tensors and to extract attributes. An improved hierarchical multiple kernel support vector machine (H-MK-SVM) is developed to integrate the external, tag and keyword, individual behavioral and engagement behavioral data for feature selection from multiple correlated attributes and for ensemble learning in user response modeling. The subagging strategy is adopted to deal with large-scale imbalanced datasets. Computational experiments using a real-world microblog database were conducted to investigate the benefits of integrating diverse heterogeneous data. Computational results show that the improved H-MK-SVM using longitudinal individual behavioral data exhibits superior performance over some commonly used methods using aggregated behavioral data and the improved H-MK-SVM using engagement behavioral data performs better than using only the external and individual behavioral data.
A Location-Item-Time sequential pattern mining algorithm for route recommendation
2015, Knowledge-Based Systems
To survive in a rapidly changing environment, theme parks need to provide high quality services in terms of visitor tastes and preferences. Understanding the spatial and temporal behavior of visitors could enhance the attraction management and geographical distribution for visitors. To fulfill the need, this research defined a Location-Item-Time (LIT) sequence to describe visitor’s spatial and temporal behavior. Then, the Location-Item-Time PrefixSpan (LIT-PrefixSpan) mining algorithm is developed to discover frequent LIT sequential patterns. Next, the route suggestion procedure is proposed to retrieve suitable LIT sequential patterns for visitors under the constraints of their intended-visiting time, favorite regions, and favorite recreation facilities. A simplified theme park is used as an example to show the feasibility of the proposed system. The experimental results show that the system can help managers understand visitors’ behavior and provide appropriate visiting experiences for visitors.
A literature review and classification of recommender systems research
2012, Expert Systems with Applications
Recommender systems have become an important research field since the emergence of the first paper on collaborative filtering in the mid-1990s. Although academic research on recommender systems has increased significantly over the past 10 years, there are deficiencies in the comprehensive literature review and classification of that research. For that reason, we reviewed 210 articles on recommender systems from 46 journals published between 2001 and 2010, and then classified those by the year of publication, the journals in which they appeared, their application fields, and their data mining techniques. The 210 articles are categorized into eight application fields (books, documents, images, movie, music, shopping, TV programs, and others) and eight data mining techniques (association rule, clustering, decision tree, k-nearest neighbor, link analysis, neural network, regression, and other heuristic methods). Our research provides information about trends in recommender systems research by examining the publication years of the articles, and provides practitioners and researchers with insight and future direction on recommender systems. We hope that this paper helps anyone who is interested in recommender systems research with insight for future research direction.
A Survey of Sequential Pattern Based E-Commerce Recommendation Systems
2023, Algorithms

View all citing articles on Scopus

¹: Tel.: +82 2 910 4950; fax: +82 2 910 4519.

View full text

Mining changes in customer buying behavior for collaborative recommendations

Abstract

Introduction

Section snippets

Overall procedure

Data sets

Conclusion

Acknowledgement

Expert Systems With Applications

European Journal of Operational Research

Expert Systems with Applications

Mining association rules between sets of items in large databases

Content-based, collaborative recommendation

Communacations of the ACM

Application of web usage mining and product taxonomy to collaborative recommendations in e-commerce

Expert Systems With Applications

Tests of equality between sets of coefficients in two regressions

Econometrica