Abstract
Understanding the dynamics underlying the diffusion of new ideas or technology in a society is an important task with implications for sciences such sociology and economics, as well as important business applications, especially in marketing. In this article, we take a first step in this direction, by studying the problem of how to model, in a simple and useful abstraction, the complex process of innovation diffusion. Our unique input is a database of adoptions \(\mathbb {D}\), which is a relation (User,Item,Time) where a tuple \({\left\langle u, i, t\right\rangle } \in \mathbb {D}\) indicates that the user u adopted the item i at time t. For our aim, we propose a stochastic model which decomposes a diffusion trace (i.e., the sequence of adoptions of the same item i) in an ordered sequence of stages, where each stage is intuitively built around two dimensions: users and relative speed at which adoptions happen. Each stage is characterized by a specific rate of adoption and it involves different users to different extent, while the sequentiality in the diffusion is guaranteed by constraining the transition probabilities among stages. An empirical evaluation on synthetic and real-world adoption datasets shows the effectiveness of the proposed framework in summarizing the adoption process, enabling several analysis tasks such as the identification of adopter categories, clustering and characterization of diffusion traces, and prediction of which users will adopt an item in the next future.












Similar content being viewed by others
Notes
All over the paper, we use the term “stage” when referring to the phenomenon we want to model (e.g., “stage of diffusion”), while we use “state” when referring to the concrete model (e.g., “the state of the HMM).
Publicly available at http://grouplens.org/datasets/hetrec-2011/.
Discontinued in May 25, 2012.
References
Adar E, Adamic LA (2005) Tracking information epidemics in blogspace. In: Proceedings of the 2005 IEEE/WIC/ACM international conference on web intelligence, WI ’05, pp 207–214
Allan J, Papka R, Lavrenko V (1998) On-line new event detection and tracking. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval, pp 37–45
Bakis R (1976) Continuous speech recognition via centisecond acoustic states. Acoust Soc Am J 59:97
Bakshy E, Karrer B, Adamic LA (2009) Social influence and the diffusion of user-created content. In: Proceedings of the 10th ACM conference on electronic commerce, EC ’09, pp 325–334
Bakshy E, Rosenn I, Marlow C, Adamic L (2012) The role of social networks in information diffusion. In: Proceedings of the 21st international conference on World Wide Web, pp 519–528
Bao P, Shen H-W, Huang J, Cheng X-Q (2013) Popularity prediction in microblogging network: a case study on sina weibo. In: Proceedings of the 22nd international conference on World Wide Web companion, pp 177–178. International World Wide Web Conferences Steering Committee
Barbieri N, Bonchi F, Manco G (2012) Topic-aware social influence propagation models. In: 12th IEEE international conference on data mining, ICDM 2012, Brussels, Belgium, December 10–13, 2012, pp 81–90
Barbieri N, Bonchi F, Manco G (2013a) Cascade-based community detection. In: Sixth ACM international conference on web search and data mining, WSDM 2013, Rome, Italy, February 4–8, 2013, pp 33–42
Barbieri N, Bonchi F, Manco G (2013b) Topic-aware social influence propagation models. Knowl Inf Syst 37(3):555–584
Bishop C M et al (2006) Pattern recognition and machine learning, vol 1. Springer, New York
Bonchi F (2011) Influence propagation in social networks: a data mining perspective. IEEE Intell Inf Bull 12(1):8–16
Budak C, Agrawal D, El Abbadi A (2012) Diffusion of information in social networks: is it all local? In: 2012 IEEE 12th international conference on data mining (ICDM), pp 121–130. IEEE
Cheng J, Adamic L, Dow PA, Kleinberg JM, Leskovec J (2014) Can cascades be predicted? In: Proceedings of the 23rd international conference on World wide web, pp 925–936. International World Wide Web Conferences Steering Committee
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 39:1–38
Du N, Song L, Gomez-Rodriguez M, Zha H (2013) Scalable influence estimation in continuous-time diffusion networks. Adv Neural Inf Process Syst 26:3147–3155
Gruhl D, Guha R, Liben-Nowell D, Tomkins A (2004) Information diffusion through blogspace. In: Proceedings of the 13th international conference on World Wide Web, WWW ’04, pp 491–501
Hong L, Dan O, Davison BD (2011) Predicting popular messages in twitter. In: Proceedings of the 20th international conference companion on World Wide Web, WWW ’11, pp 57–58
Kempe D, Kleinberg J, Tardos E (2003) Maximizing the spread of influence through a social network. KDD ’03, pp 137–146
Kleinberg J (2003) Bursty and hierarchical structure in streams. Data Min Knowl Discov 7(4):373–397
Lerman K, Hogg T (2010) Using a model of social dynamics to predict popularity of news. In: Proceedings of the 19th international conference on World wide web, pp 621–630. ACM
Leskovec J, Backstrom L, Kleinberg J (2009) Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp 497–506. ACM
Levinson SE (1986) Continuously variable duration hidden Markov models for automatic speech recognition. Comput Speech Lang 1(1):29–45
Ma Z, Sun A, Cong G (2013) On predicting the popularity of newly emerging hashtags in twitter. J Am Soc Inf Sci Technol 64(7):1399–1410
Mehmood Y, Barbieri N, Bonchi F, Ukkonen A (2013) CSI: community-level social influence analysis. In: Machine learning and knowledge discovery in databases, pp 48–63. Springer
Mele I, Bonchi F, Gionis A (2012) The early-adopter graph and its application to web-page recommendation. In: Proceedings of the 21st ACM international conference on information and knowledge management, pp 1682–1686. ACM
Oates T, Firoiu L, Cohen PR (1999) Clustering time series with hidden Markov models and dynamic time warping. In: Proceedings of the IJCAI-99 workshop on neural, symbolic and reinforcement learning methods for sequence learning, pp 17–21
Rabiner L (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2):257–286
Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66(336):846–850
Ratkiewicz J, Fortunato S, Flammini A, Menczer F, Vespignani A (2010) Characterizing and modeling the dynamics of online popularity. Phys Rev Lett 105(15):158701
Rodriguez MG, Balduzzi D, Schölkopf B (2011) Uncovering the temporal dynamics of diffusion networks. arXiv preprint arXiv:1105.0697
Rogers EM (2003) Diffusion of innovations, 5th edn. Free Press, New York
Saez-Trumper D, Comarela G, Almeida V, Baeza-Yates R, Benevenuto F (2012) Finding trendsetters in information networks. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 1014–1022. ACM
Saito K, Kimura M, Ohara K, Motoda H (2009) Learning continuous-time information diffusion model for social behavioral data analysis. In: Proceedings of the 1st Asian conference on machine learning: advances in machine learning, pp 322–337
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464
Shen H-W, Wang D, Song C, Barabási A-L (2014) Modeling and predicting popularity dynamics via reinforced Poisson processes. arXiv preprint arXiv:1401.0778
Swan R, Allan J (1999) Extracting significant time varying features from text. In: Proceedings of the eighth international conference on Information and knowledge management, pp 38–45. ACM
Swan R, Allan J (2000) Automatic generation of overview timelines. In: Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval, pp 49–56. ACM
Szabo G, Huberman BA (2010) Predicting the popularity of online content. Commun ACM 53(8):80–88
Weng L, Ratkiewicz J, Perra N, Gonçalves B, Castillo C, Bonchi F, Schifanella R, Menczer F, Flammini A (2013) The role of information diffusion in the evolution of social networks. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13, pp 356–364
Yan R, Tang J, Liu X, Shan D, Li X (2011) Citation count prediction: learning to estimate future citations for literature. In: Proceedings of the 20th ACM international conference on Information and knowledge management, pp 1247–1252. ACM
Yang J, Leskovec J (2011) Patterns of temporal variation in online media. In: Proceedings of the fourth ACM international conference on web search and data mining, WSDM ’11, pp 177–186
Yang Y, Pierce T, Carbonell J (1998) A study of retrospective and on-line event detection. In: Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval, SIGIR ’98, pp 28–36
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Compliance with ethical standards
Consent to submit has been received explicitly from all coauthors, as well as from the responsible authorities—tacitly or explicitly—at the institute/organization where the work has been carried out, before the work is submitted. Authors whose names appear on the submission have contributed sufficiently to the scientific work and therefore share collective responsibility and accountability for the results.
Conflict of interest
The authors declare that they have no conflict of interest.
Rights and permissions
About this article
Cite this article
Mehmood, Y., Barbieri, N. & Bonchi, F. Modeling adoptions and the stages of the diffusion of innovations. Knowl Inf Syst 48, 1–27 (2016). https://doi.org/10.1007/s10115-015-0889-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-015-0889-5