Skip to main content
Log in

Prediction of brand stories spreading on social networks

  • Regular Article
  • Published:
Advances in Data Analysis and Classification Aims and scope Submit manuscript

Abstract

Online social network is a major media for many types of information communication. Although the primary purpose of social networks is to connect people, they are more and more used in online marketing to connect businesses with customers as well as to connect customers amongst themselves. Brand stories generated by consumers or businesses can be easily and widely spread. As a result, those stories have a huge influence on the marketplace and indirectly affect the brand success. Understanding and modeling how a piece of information is spread on social media and its spreading level are crucial for business managers; not only to understand the information diffusion, but also for them to better control it. In this paper, we aim at developing models in order to predict the spread of brand stories on social networks, both in term of spreadability and spreading level. We applied several machine learning algorithms using three categories of features based on user-profile, temporal, and content of tweets. Experimental results on three tweet collections about brand stories reveal that our model significantly improves the prediction accuracy by about 4% compared to the related work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://www.smartinsights.com/social-media-marketing/social-media-strategy/new-global-social-media-research/.

  2. https://www.dreamgrow.com/top-15-most-popular-social-networking-sites/.

  3. https://expandedramblings.com/index.php/march-2013-by-the-numbers-a-few-amazing-twitter-stats/.

  4. Twitter is an online news and social networking service where people communicate in short messages called tweets.

  5. Institut de Recherche en Informatique de Toulouse, UMR5505 CNRS, France.

  6. Twitter Streaming API is documented on https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/connecting.

  7. Twitter Search API is documented on https://developer.twitter.com/en/docs/tweets/search/api-reference/get-search-tweets.

  8. BDpedia structures the information from Wikipedia pages; it can be queried using SPARQL to extract structured information locally stored in DBpedia or through an endpoint framework.

  9. https://pypi.python.org/pypi/holidays.

  10. http://dbpedia.org/snorql/.

  11. https://www.cs.york.ac.uk/semeval-2013/task2/index.html.

  12. https://pythonprogramming.net/new-data-set-training-nltk-tutorial/.

  13. http://weka.sourceforge.net/doc.stable/weka/classifiers/functions/LibLINEAR.html .

  14. https://osirim.irit.fr/ OSIRIM for Observatory of Systems Information Retrieval and Indexing of Multimedia contents is one of the IRIT platforms. It is a federative project mainly supported by the European Regional Development Fund (ERDF), the French Government, the Region Midi-Pyrénées and the National Center for the Scientific Research (CNRS).

  15. Synthetic Minority Over-sampling Technique: synthesises new minority instances by “taking each minority class sample and introducing synthetic examples along the line segments joining any/all of the k minority class nearest neighbors.” (Chawla et al. 2002).

  16. http://weka.sourceforge.net/doc.stable/weka/attributeSelection/PrincipalComponents.html.

References

  • Assaad W, Gomez JM (2011) Social network in marketing (social media marketing) opportunities and risks. Int J Manag Public Sector Inform Commun Technol 2:13

    Google Scholar 

  • Bulearca M, Bulearca S (2010) Twitter: a viable marketing tool for SMEs? Global Bus Manag Res 2(4):296

    Google Scholar 

  • Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    Article  Google Scholar 

  • Fortin D, Uncles M, Burton S, Soboleva A (2011) Interactive or reactive? Marketing with Twitter. J Consum Mark 28:491–499

    Article  Google Scholar 

  • Gensler S, Völckner F, Liu-Thompkins Y, Wiertz C (2013) Managing brands in the social media environment. J Interact Mark 27(4):242–256

    Article  Google Scholar 

  • Greer CF, Ferguson DA (2011) Using Twitter for promotion and branding: a content analysis of local television Twitter sites. J Broadcast Electron Med 55(2):198–214

    Article  Google Scholar 

  • Hoang TBN, Mothe J (2018a) Location extraction from tweets. Inform Process Manag 54(2):129–144

    Article  Google Scholar 

  • Hoang TBN, Mothe J (2018b) Predicting information diffusion on Twitter—analysis of predictive features. J Comput Sci 28:257–264

    Article  Google Scholar 

  • Hong L, Dan O, Davison BD (2011) Predicting popular messages in Twitter. In: Proceedings of the 20th international conference companion on World wide web. ACM, pp 57–58

  • Hu Y, Hu C, Fu S, Shi P, Ning B (2016) Predicting the popularity of viral topics based on time series forecasting. Neurocomputing 210:55–65

    Article  Google Scholar 

  • Krouska A, Troussas C, Virvou M (2017) Comparative evaluation of algorithms for sentiment analysis over social networking services. J UCS 23(8):755–768

    Google Scholar 

  • Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of the 19th international conference on World wide web. ACM, pp 591–600

  • Laporte L, Flamary R, Canu S, Déjean S, Mothe J (2013) Non-convex regularizations for feature selection in ranking with sparse SVM. IEEE Trans Neural Netw Learn Syst 25(6):1118–1130

  • Le QV, Mikolov T (2014) Distributed representations of sentences and documents. ICML 14:1188–1196

    Google Scholar 

  • Lespagnol C, Mothe J, Ullah MZ (2019) Information nutritional label and word embedding to estimate information check-worthiness (short paper). In: ACM SIGIR, pp 941–944

  • Lingad J, Karimi S, Yin J (2013) Location extraction from disaster-related microblogs. In: Int. conf. on WWW. ACM, pp 1017–1020

  • Mangold WG, Faulds DJ (2009) Social media: the new hybrid element of the promotion mix. Bus Horiz 52(4):357–365

    Article  Google Scholar 

  • Mike Gotta PO (2006) Trends in social software. Collaboration and content strategies in-depth research overview

  • Mothe J, Ramiandrisoa F, Rasolomanana M (2018) Automatic keyphrase extraction using graph-based methods (short paper). In: ACM symposium on applied computing (SAC 2018). http://www.sigapp.org, https://www.irit.fr/publis/SIG/2018_SAC_MRR.pdf

  • Remy C, Pervin N, Toriumi F, Takeda H (2013) Information diffusion on Twitter: everyone has its chance, but all chances are not equal. In: Signal-image technology & internet-based systems (SITIS). IEEE, pp 483–490

  • Ritter A, Clark S, Etzioni O et al (2011) Named entity recognition in tweets: an experimental study. In: Proceedings of the conference on empirical methods in natural language processing. ACL, pp 1524–1534

  • Rogers M, Chapman C, Giotsas V (2012) Measuring the diffusion of marketing messages across a social network. J Direct Data Digit Mark Pract 14(2):97–130

    Article  Google Scholar 

  • Sabate F, Berbegal-Mirabent J, Cañabate A, Lebherz PR (2014) Factors influencing popularity of branded content in Facebook fan pages. Eur Manag J 32(6):1001–1011

    Article  Google Scholar 

  • Suh B, Hong L, Pirolli P, Chi EH (2010) Want to be retweeted? Large scale analytics on factors impacting retweet in Twitter network. In: 2010 IEEE second international conference on social computing (socialcom). IEEE, pp 177–184

  • Tamine L, Soulier L, Ben Jabeur L, Amblard F, Hanachi C, Hubert G, Roth C (2016) Social media-based collaborative information access: analysis of online crisis-related Twitter conversations. In: Proceedings of the 27th ACM conference on hypertext and social media. ACM, pp 159–168

  • Varshney D, Kumar S, Gupta V (2017) Predicting information diffusion probabilities in social networks: a Bayesian networks based approach. Knowl Based Syst 133:66–76

    Article  Google Scholar 

  • Xiong F, Liu Y, Zhang Z, Zhu J, Zhang Y (2012) An information diffusion model based on retweeting mechanism for online social media. Phys Lett A 376(30):2103–2108

    Article  Google Scholar 

  • Yang Z, Guo J, Cai K, Tang J, Li J, Zhang L, Su Z (2010) Understanding retweeting behaviors in social networks. In: Int. conf. on information and knowledge management. ACM, pp 1633–1636

  • Yu B, Chen M, Kwok L (2011) Toward predicting popularity of social marketing messages. In: International conference on social computing, behavioral-cultural modeling, and prediction. Springer, pp 317–324

  • Zhang J, Liu B, Tang J, Chen T, Li J (2013) Social influence locality for modeling retweeting behaviors. IJCAI 13:2761–2767

    Google Scholar 

Download references

Acknowledgements

This work has been partially funded by the European Union’s Horizon 2020 H2020-SU-SEC-2018 under the Grant Agreement n\(^{\circ }\)833115 (PREVISION Project). However, this paper presents the paper authors’ own views.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thi Bich Ngoc Hoang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hoang, T.B.N., Mothe, J. Prediction of brand stories spreading on social networks. Adv Data Anal Classif 16, 559–591 (2022). https://doi.org/10.1007/s11634-021-00450-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11634-021-00450-x

Keywords

Mathematics Subject Classification

Navigation