Abstract
In this work we characterize the session-level behavior of users on an Indian mobile phone comparison shopping website. We also correlate the popularity of handset on various news sources to its popularity on the shopping website. There are three aspects to our study: data analysis, correlation between news sources of product information and popularity of a handset, and behavior prediction. We have used KL divergence to show that a time-homogeneous Markov chain is observed when the number of clicks varies from 5 to 30. Our results depict that Markov chain model does not hold in entirety for comparison shopping setting but tells us how far the Markov chain model holds for this setting. Our analysis corroborates intuition that increasing price leads to decrease in popularity. After the strong correlation between various variables and user behavior was found, we predict the users macro (the overall sales of handset) and micro behavior (whether a user will convert or exit the site) using Markov logic networks. Our predictive model validates the intuition that past browsing behavior is an important predictor for future behavior. Methodology of combining data analysis with machine learning is, in our opinion, a new approach to the empirical study of such data sets.
Similar content being viewed by others
References
eBizMBA. (2013). Top 15 most popular comparison shopping websites: May 2013. http://www.ebizmba.com/articles/shopping-websites.
Chatterjee, P., & Wang, Y. (2012). Online comparison shopping behavior of travel consumers. Journal of Quality Assurance in Hospitality and Tourism, 13(1), 1–23.
Kohavi, R., Brodley, C., Frasca, B., Mason, L., & Zhang, Z. (2000). KDD-Cup 2000 Organizers’ Report: Peeling the onion. SIGKDD Explorations, 2(2), 86–98.
Domingos, P., & Lowd, D. (2009). Markov logic: An interface layer for artificial intelligence. San Rafael, CA: Morgan & Claypool Publishers.
Gupta, M., Mittal, H., Singla, P., & Bagchi, A. (2014). Characterizing comparison shopping behavior: A case study. In Data engineering workshops (ICDEW).
Sarukkai, R. R. (2000). Link prediction and path analysis using Markov chains. Computer Networks, 33(1–6), 337–386.
Yates, R. B., Hurtando, C., Mendoza, M., & Dupret, G. (2005). Modeling user search behavior. In LA-WEB, Web congress.
Deshpande, M., & Karypis, G. (2004). Selective Markov models for predicting web page accesses. ACM Transactions Internet Technology, 4(2), 163–184.
Zukerman, I., Albrecht, D. W., & Nicholson, A. E. (1999). Predicting users’ requests on the WWW. In Proceedings of user modeling (pp. 275–284).
Pirolli, P. L. T., & Pitkow, J. E. (1999). Distributions of surfers’ paths through the World Wide Web: Empirical characterizations. Journal World Wide Web, 2(1), 29–45.
Sen, R., & Hansen, M. (2003). Predicting a web user’s next access based on log data. Journal of Computational Graphics and Statistics, 12, 143–155.
Zhu, J., Hong, J., & Hughes, J. G. (2002). Using Markov chains for link prediction in adaptive web sites. In Proceedings of Software 2002: Computing in an imperfect world (pp. 60–73).
Cadez, I., Heckerman, D., Christopher, M., Padhraic, S., & Steven, W. (2000). Visualization of navigation patterns on a web site using model-based clustering. In Proceedings of conference on Knowledge discovery and data mining (pp. 280–284).
Li, J., & Sadagopan, N. (2008). Characterizing typical and atypical user sessions in clickstreams. In Proceedings of WWW’08.
Levene, M., & Loizou, G. (2003). Computing the entropy of user navigation in the web. Journal of Information Technology and Decision Making, 2, 459–476.
Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. The Annals of Mathematical Statistics, 22(1), 79–86.
Wolfinbarger, M., & Gilly, M. (2000). Consumer motivations for online shopping. In Proceedings of the AMCIS 2000 (pp. 1362–1366), California.
Moe, W. S., & Fader, P. S. (2004). Dynamic conversion behavior at E-commerce sites. Management Science, 50(3), 326–335.
Mongomery, A. L., Li, S., Srinivasan, K., & Lichety, J. C. (2004). Modeling online browsing and path analysis using clickstream data. Marketing Science, 23(4), 579–595.
Sismeiro, C., & Bucklin, R. E. (2004). Modeling purchase behavior at an E-commerce web site: A task completion approach. Journal of Marketing Research, 41(3), 306–323.
Brown, D., & Hayes, N. (2008). Influencer marketing: Who really influences your customers?. Amsterdam: Elsevier.
Parikh, N., & Sundaresan, N. (2008). Scalable and near real-time burst detection from eCommerce queries. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 972–980).
Zhang, H., Parikh, N., Singh, G., & Sundaresan, N. (2013). Chelsea won, and you bought a T-shirt: Characterizing the interplay between Twitter and e-Commerce. In Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining (pp. 829–836).
Zhang, Y., & Pennacchiotti, M. (2013). Predicting purchase behaviors from Social Media. In Proceedings of the 22nd international conference on World Wide Web (WWW’13).
Nguyen, T. (2013, February). Q4 (2012) CSE rankings. http://www.cpcstrategy.com/blog/2013/02/q4-2012-cse-rankings/. Published by CPC Strategy.
Goel, S., Hofman, J. M., Lahaie, S., Pennock, D. M., & Watts, D. J. (2010). Predicting consumer behavior with web search. Proceedings of National Academy of Sciences, 107(41), 17486–17490.
Choi, H., & Varian, H. (2012). Predicting the present with Google trends. The Economic Record, 88, 2–9.
Massy, W. F., & Frank, R. E. (1985). Short term price and dealing effects in selected market segments. Journal of Marketing Research, 2, 171–185.
Gilks, W. R., Richardson, S., & Spiegelhalter, D. J. (Eds.). (1996). Markov chain Monte Carlo in practice. London: Chapman and Hall.
Pentland, A., & Lin, A. (1995). Modeling and prediction of human behavior. Neural Computation, 11, 229–242.
Pentland, A. P., & Wren, C. R. (1998). Dynamic models of human motion. In International conference on automatic face and gesture recognition (pp. 22–27).
Galata, A., Johnson, N., & Hogg, D. (2001). Learning variable length Markov models of behavior. Computer Vision and Image Understanding, 81, 398–413.
Borges, J., & Levene, M. (2000). Data mining of user navigation patterns. In: Web usage analysis and user profiling (pp. 92–112). Heidelberg: Springer.
Singer, P., Helic, D., Taraghi, B., & Strohmaier, M. (2014). Detecting memory and structure in human navigation patterns using Markov chain models of varying order. PLoS ONE, 9(7), e102070.
Chierichetti, F., Kumar, R., Raghavan, P., & Sarlos, T. (2012). Are web users really Markovian? In Proceedings of WWW’12 (pp. 609–618). New York: ACM.
Bishop, C. M. (2006). Pattern recognition and machine learning. New York: Springer.
Singla, P., & Domingos, P. (2006). Entity resolution with Markov logic. In Proceedings of the sixth IEEE international conference on data mining (pp. 572–582). Hong Kong: IEEE Computer Society Press.
Kok, S., Sumner, M., Richardson, M., Singla, P., Poon, H., Lowd, D., et al. (2008). The Alchemy system for statistical relational AI. Technical Report. University of Washington. http://alchemy.cs.washington.edu.
Poon, H., & Domingos, P. (2006). Sound and efficient inference with probabilistic and deterministic dependencies. In Proceedings of AAAI-06. Boston: AAAI Press.
Schölkopf, B., Burges, C., & Smola, A. (Eds.). (1998). Advances in kernel methods: Support vector machines. Cambridge, MA: MIT Press.
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Belmont, CA: Wadsworth.
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81–106.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gupta, M., Mittal, H., Singla, P. et al. Analysis and characterization of comparison shopping behavior in the mobile handset domain. Electron Commer Res 17, 521–551 (2017). https://doi.org/10.1007/s10660-016-9226-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10660-016-9226-7