Mining changes in customer buying behavior for collaborative recommendations
Introduction
Recommender systems have been a recent focus of researchers and practitioners. Many companies hope that the use of recommender systems may be a means of surviving in a competitive environment. Recommender systems are particularly suited to retail business, as compared to other types of business, since retail markets are distinguished by several characteristics, such as repeated buying over a particular time horizon, large numbers of customers, and a wealth of information detailing past customer purchases (Schmittlein & Peterson, 1994).
In general, retail companies operate purchase databases in a longitudinal way, such that all transactions are stored in chronological order. A record-of-transaction database typically contains the transaction date for and the products bought in the course of, a given transaction. Usually, each record also contains a customer ID, particularly when the purchase was made using a credit card or a frequent-buyer card. Therefore, the purchasing sequence of a customer in the database who has made repeat purchases can easily be determined. This purchase sequence provides a description of the changes in a customer's preferences over time. However, in our domain of knowledge, there has been little study of the question of whether recommendations based on purchase sequences may be more accurate than existing recommender system predictions, based on non-sequential patterns. In this study, for the purpose of enhancing the quality of recommendations, we propose a new methodology that considers the way in which a customer's purchase sequence evolves over time.
To date, a variety of recommender systems (Balabanović and Shoham, 1997, Basu et al., 1998, Hill et al., 1995, Lawrence et al., 2001, Resnick et al., 1994, Sarwar et al., 2001, Shardanand and Maes, 1995) has been developed. Collaborative filtering (CF) has thus far been the most successful recommendation technique and has been used in a number of different applications, such as in the recommendation of web pages, movies, articles and products (Hill et al., 1995, Resnick et al., 1994, Shardanand and Maes, 1995). Collaborative filtering works by recommending products to a target customer through a process of identifying people who share similar preferences for products and looking for those products that target customers are most likely to purchase.
The recommendation processes of typical collaborative filtering in retail business consist of the following three steps (Sarwar et al., 2000, Sarwar et al., 2001).
(1) Customer profile construction
The purchase transaction records of a customer for a certain period are used to build a customer profile describing his or her likes and dislikes. The system represents the customer profile, A, such that aij is one if a customer i has purchased a product j, and zero, otherwise.
(2) Neighborhood formation
This is the most important part of the CF-based recommender systems. The system finds a set of customers, known as neighbors, who, in the past, have exhibited similar behavior (i.e. bought a similar set of products), through calculating the correlations among customers for the customer profile. A set of K customers is usually found (a neighborhood of size K), which is formed according to the degree of similarity between each of the neighbors and a target customer.
(3) Recommendation generation
Once a neighborhood is formed for a target customer, the system generates a set of the top N products that the target customer is most likely to purchase, by searching for products that the neighbors have purchased and that the target customer has not yet purchased.
As mentioned above, typical collaborative filtering is static, since it only makes use of information relating to whether the customer bought a product during a certain period, and does not use information on the purchase sequences of customers in the determination of the neighbors of a target customer. However, customers in retail business are not static, and their buying behavior changes over time. Thus, the quality of the recommendation of the typical CF could be further improved through the use of the available information on the purchase sequences of customers.
To illustrate the importance of this potential improvement in accuracy, let us consider the following example. Table 1 presents typical transaction records for a retail company and the customer profile is provided in Table 2. This example determines products that target customer ID011 is likely to buy, using transaction records for consumers CID001 through CID011.
Assume that the typical CF algorithm is used for solving the problem, that the neighborhood size (K) is three, and that the number of products recommended (N) is two. The typical collaborative filtering algorithm considers the correlation of preferences between the target customer and the other customers. All of the four customers, CID011, CID001, CID002 and CID003, commonly bought ‘Perfumes’ and ‘Skincare Products.’ The similarities between CID011 and the other three customers are equivalent; that is, the Pearson correlation coefficient is 0.67. Therefore, a recommender system based on the collaborative filtering algorithm will determine that CID001, CID002 and CID003 are the nearest neighbors and have the same preferences as the target customer. However, it is quite difficult to select two products that should be recommended to CID011, because CID001, CID002 and CID003 each purchased different additional products: ‘Dresses,’ ‘Shoes,’ and ‘Knits,’, respectively. In this case, two different products to be recommended to CID011 would have to be selected randomly. Accordingly, the recommendations would not necessarily be very appropriate for the preferences of the target customer.
Table 3 provides a customer profile rearranged with regard to each customer's transaction time. From the table, it can be seen that the purchase sequence of the target customer CID011 occurred in the order of Perfumes, followed by Skincare Products. Similarly, the purchase sequences of CID001, CID002 and CID003 were Perfumes→Skincares→Dresses, Perfumes→Shoes→Skincares, and Skincares→Perfumes→Knits, respectively. Assume now that the neighbors of the target customer are determined based on the purchase sequence of each customer. Customers who have purchase sequences similar to that of CID011 include CID 001 and CID002. The purchase sequences of CID011 and CID001 are exactly same, as both bought the same products during the same month; therefore, the nearest neighbor of the target customer CID011 is CID001. The next nearest neighbor is CID002. Therefore, the products recommendable to CID011 are ‘Dresses,’ and ‘Shoes.’ As mentioned above, when the past purchase sequences of each customer are available, this knowledge can be used to enhance the quality of the recommendations made.
However, if a customer profile rearranged by time is applied directly, this application results in a significant problem, namely, the sparsity problem. It is well known that a sparse data set, having few nonzero entries, decreases recommendation accuracy (Mobasher, Cooly, & Srivastava, 2000). The sparsity level, defined as 1- (nonzero entries/total entries), of Table 2 is 60% (=1−(22/55)), while that of Table 3 is 87% (=1−(22/165)). In general, the sparsity level of a typical data set in the field of recommendation is over 95%. Rearrangement of an input matrix by time results in an increased time dimension, as compared to that of a typical customer profile, and thereby makes the input data set sparser. Therefore, a new solution to the sparsity problem must be found.
In our research, we employ a clustering technique that groups the transactions of customers into homogeneous subgroups. The SOM (Self-Organizing Map) technique, which has been applied frequently of late, is used for clustering (Kohonen, 1990). With the aid of the SOM, all the transactions of customers may be allocated to a certain cluster and a cluster number imposed. The change in the cluster number resulting from each transaction determines a customer purchase sequence.
By observing changes in the cluster number of each customer over time, a buying sequence can be built for each customer. These buying sequences are potentially capable of predicting the future purchases of a target customer. However, since not all buying sequences have a statistical validity sufficient to guarantee the generalization of the prediction, the association rule mining technique may be used to extract the sequential patterns from the buying sequences (Agrawal, Imielinski, & Swami, 1993).
Section snippets
Overall procedure
Generally, most marketing campaigns are conducted based on transactions occurring during a specified time period (e.g. 3 months or 6 weeks). We assume that a time period of length l is used to detect the purchase sequence of a customer and that a product recommendation for a target customer is made at time T. In other words, our problem can be described as follows: When the purchase sequence and buying history of a target customer for the past l−1 periods prior to time T are given, which
Data sets
We used real-world data to examine the performance of the proposed approach. The data used in the experiment were transaction records for those goods sold by the H department store, the third largest department store in Korea, that were commonly purchased by women. In addition, we used transaction records obtained during the eight-month period from May to December 2000, in order to establish the behavioral characteristics of the customers over time. The input data from the H department store
Conclusion
The preferences of customers change over time. In this study, we described a model-based approach for mining the changes in customer buying behavior over time and discussed solutions to several problems: data preprocessing, behavior locus extraction, and recommendation formulation based on extracted loci. Using the derived recommendation list, companies may be able to perform effective one-to-one marketing campaigns by providing individual target customers with personalized product
Acknowledgement
This work was supported by research program 2005 of Kookmin University in Korea.
References (20)
- et al.
A personalized recommender system based on web usage mining and decision tree induction
Expert Systems With Applications
(2002) - et al.
Heuristic and optimization approaches to extending the Kohonen self organizing algorithm
European Journal of Operational Research
(1996) - et al.
Goal-oriented sequential pattern for network banking churn analysis
Expert Systems with Applications
(2003) - et al.
Mining association rules between sets of items in large databases
(1993) - et al.
Content-based, collaborative recommendation
Communacations of the ACM
(1997) - Basu, C., Hirsh, H., & Cohen, W. (1998). Recommendation as classification: using social and content-based information...
- Billsus, D., & Pazzani, M.J. (1998). Learning collaborative information filters. Proceedings of the 15th international...
- et al.
Application of web usage mining and product taxonomy to collaborative recommendations in e-commerce
Expert Systems With Applications
(2004) Tests of equality between sets of coefficients in two regressions
Econometrica
(1960)- Hill, W., Stead, L., Rosenstein, M., & Furnas, G. (1995). Recommending and evaluating choices in a communication of...
Cited by (78)
Recommending untapped M&A opportunities: A combined approach using principal component analysis and collaborative filtering
2019, Expert Systems with ApplicationsRecommend products with consideration of multi-category inter-purchase time and price
2018, Future Generation Computer SystemsBehavior-aware user response modeling in social media: Learning from diverse heterogeneous data
2015, European Journal of Operational ResearchA Location-Item-Time sequential pattern mining algorithm for route recommendation
2015, Knowledge-Based SystemsA literature review and classification of recommender systems research
2012, Expert Systems with Applications
- 1
Tel.: +82 2 910 4950; fax: +82 2 910 4519.