Skip to main content

CasSampling: Exploring Efficient Cascade Graph Learning for Popularity Prediction

  • Conference paper
  • First Online:
Machine Learning and Knowledge Discovery in Databases: Research Track (ECML PKDD 2023)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14171))

  • 857 Accesses

Abstract

Predicting the growth size of an information cascade is one of the primary challenges in understanding the diffusion of information. Recent efforts focus on utilizing graph neural networks to capture graph structure. However, there is considerable variance in the information cascade size (from few to million). From the perspective of efficiency and performance, the method of modeling each node is inappropriate for graph neural networks. In this paper, we propose a novel deep learning framework for popularity prediction called CasSampling. Firstly, we exploit a heuristic algorithm to sample the critical part of cascade graph. For the loss of structure information due to sampling, we keep outdegree of sampled node in the global graph as part of the node feature into the graph attention networks. For the loss of temporal information due to sampling, we utilize the time series to learn the global propagation time flow. Then, we design an attention aggregator for node-level representation to better integrate local-level propagation into the global-level time flow. Experiments conducted on two benchmark datasets demonstrate that our method significantly outperforms the state-of-the-art methods for popularity prediction. Additionally, the computation cost is much less than the baselines. Code and (public) datasets are available at https://github.com/Gration-Cheng/CasSampling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mishra, S., Rizoiu, M.A., Xie, L.: Modeling popularity in asynchronous social media streams with recurrent neural networks. In: Twelfth International AAAI Conference on Web and Social Media (2018)

    Google Scholar 

  2. Li, G., Chen, S., Feng, J., Tan, K.l., Li, W.S.: Efficient location-aware influence maximization. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 87–98 (2014)

    Google Scholar 

  3. Cao, Q., Shen, H., Cen, K., Ouyang, W., Cheng, X.: Deephawkes: bridging the gap between prediction and understanding of information cascades. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 1149–1158 (2017)

    Google Scholar 

  4. Chen, X., Zhou, F., Zhang, K., Trajcevski, G., Zhong, T., Zhang, F.: Information diffusion prediction via recurrent cascades convolution. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 770–781. IEEE (2019)

    Google Scholar 

  5. Tang, X., Liao, D., Huang, W., Xu, J., Zhu, L., Shen, M.: Fully exploiting cascade graphs for real-time forwarding prediction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 582–590 (2021)

    Google Scholar 

  6. Yuan, C., Li, J., Zhou, W., Lu, Y., Zhang, X., Hu, S.: DyHGCN: a dynamic heterogeneous graph convolutional network to learn users’ dynamic preferences for information diffusion prediction. In: Hutter, F., Kersting, K., Lijffijt, J., Valera, I. (eds.) ECML PKDD 2020. LNCS (LNAI), vol. 12459, pp. 347–363. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-67664-3_21

    Chapter  Google Scholar 

  7. Cao, Q., Shen, H., Gao, J., Wei, B., Cheng, X.: Popularity prediction on social platforms with coupled graph neural networks. In: Proceedings of the 13th International Conference on Web Search and Data Mining, pp. 70–78 (2020)

    Google Scholar 

  8. Wu, Z., Zhou, J., Liu, L., Li, C., Gu, F.: Deep popularity prediction in multi-source cascade with HERI-GCN. In: 2022 IEEE 38th International Conference on Data Engineering (ICDE) (2022)

    Google Scholar 

  9. Weng, L., Menczer, F., Ahn, Y.Y.: Virality prediction and community structure in social networks. Sci. Rep. 3(1), 1–6 (2013)

    Article  Google Scholar 

  10. Cui, P., Jin, S., Yu, L., Wang, F., Zhu, W., Yang, S.: Cascading outbreak prediction in networks: a data-driven approach. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 901–909 (2013)

    Google Scholar 

  11. Ma, Z., Sun, A., Cong, G.: On predicting the popularity of newly emerging hashtags in twitter. J. Am. Soc. Inform. Sci. Technol. 64(7), 1399–1410 (2013)

    Article  Google Scholar 

  12. Petrovic, S., Osborne, M., Lavrenko, V.: Rt to win! predicting message propagation in twitter. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 5, pp. 586–589 (2011)

    Google Scholar 

  13. Shulman, B., Sharma, A., Cosley, D.: Predictability of popularity: gaps between prediction and understanding. In: Tenth International Conference on Web and Social Media (2016)

    Google Scholar 

  14. Bao, P., Shen, H.W., Huang, J., Cheng, X.Q.: Popularity prediction in microblogging network: a case study on Sina Weibo. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 177–178 (2013)

    Google Scholar 

  15. Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, pp. 925–936 (2014)

    Google Scholar 

  16. Rizoiu, M.A., Xie, L., Sanner, S., Cebrian, M., Yu, H., Van Hentenryck, P.: Expecting to be hip: hawkes intensity processes for social media popularity. In: Proceedings of the 26th International Conference on World Wide Web, pp. 735–744 (2017)

    Google Scholar 

  17. Mishra, S., Rizoiu, M.A., Xie, L.: Feature driven and point process approaches for popularity prediction. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, pp. 1069–1078 (2016)

    Google Scholar 

  18. Shen, H., Wang, D., Song, C., Barabási, A.L.: Modeling and predicting popularity dynamics via reinforced poisson processes. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28 (2014)

    Google Scholar 

  19. Yang, S.H., Zha, H.: Mixture of mutually exciting processes for viral diffusion. In: International Conference on Machine Learning, pp. 1–9. PMLR (2013)

    Google Scholar 

  20. Xu, X., Zhou, F., Zhang, K., Liu, S., Trajcevski, G.: Casflow: exploring hierarchical structures and propagation uncertainty for cascade prediction. IEEE Trans. Knowl. Data Eng. (2021)

    Google Scholar 

  21. Xu, Z., Qian, M., Huang, X., Meng, J.: CasGCN: predicting future cascade growth based on information diffusion graph. arXiv preprint arXiv:2009.05152 (2020)

  22. Liao, D., Xu, J., Li, G., Huang, W., Liu, W., Li, J.: Popularity prediction on online articles with deep fusion of temporal process and content features. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 200–207 (2019)

    Google Scholar 

  23. Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks. In: International Conference on Learning Representations (2018, accepted as poster). https://openreview.net/forum?id=rJXMpikCZ

  24. Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)

    Google Scholar 

  25. Chen, X., Zhang, F., Zhou, F., Bonsangue, M.: Multi-scale graph capsule with influence attention for information cascades prediction. Int. J. Intell. Syst. 37(3), 2584–2611 (2022)

    Article  Google Scholar 

Download references

Acknowledgments

The work was supported by National Natural Science Foundation of China (Grant Nos. 61966020, 61972186, U21B2027), Yunnan high-tech industry development project (Grant No. 201606), Yunnan provincial major science and technology special plan projects (Grant No. 202103AA080015, 202002AD080001-5), Yunnan Basic Research Project (Grant No. 202001AS070014), and Talents and Platform Program of Science and Technology of Yunnan (Grant No. 202105AC160018).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xin Yan .

Editor information

Editors and Affiliations

Ethics declarations

Ethical Statement

The purpose of this paper is to explore efficient and effective methods for learning cascade graphs for popularity prediction while adhering to academic integrity and research ethics requirements. We used publicly available data from social media datasets that have been authorized by Twitter and Weibo officials. To ensure the confidentiality of personal information, all data is anonymized and stored securely. We obtained approval and permission from the ethics committee of our institution to conduct this research.

The models and algorithms used in this study are based on publicly available data and previous research results, and we have thoroughly tested and verified them. We commit to conducting a transparent and fair evaluation of the algorithms and models used in this research, and we will present them fully in the paper.

Throughout this study, we will adhere to academic standards and ethical requirements, striving to avoid any behavior that may violate these requirements. We hope that this research will contribute to the development of cascade graph learning and popularity prediction, promoting further research in this area.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cheng, G., Yan, X., Gao, S., Xu, G., Miao, X. (2023). CasSampling: Exploring Efficient Cascade Graph Learning for Popularity Prediction. In: Koutra, D., Plant, C., Gomez Rodriguez, M., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14171. Springer, Cham. https://doi.org/10.1007/978-3-031-43418-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-43418-1_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-43417-4

  • Online ISBN: 978-3-031-43418-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics