Abstract
Effectively modeling and predicting the size of information cascades is essential for downstream tasks such as rumor detection and epidemic prevention. Traditional methods normally rely on tedious hand-crafted feature engineering efforts, which is inefficient in complex diffusion processes such as social network (SN) cascades. In recent years, graph neural network methods have been successfully used in cascade prediction tasks. Most of these methods make use of the structural features of SNs, while the effect of textual user-generated content (UGC) is far from clear or fully utilized. In this paper, we focus on the questions of how the textual UGC affect user activation state and trigger their retweet behaviors. We propose a novel GNN-based model named TSGNN, which jointly model the textual content and structure features. It uses recurrent neural networks with attentions to learn content feature representations that potentially affect information propagation. We find that tweets of fewer high coherent topics are more likely to trigger user retweet behaviors, and we also design a gate mechanism to model the activation state of users under the combined influence of content, structure, and other self-activation. Experimental results demonstrate that TSGNN significantly outperforms all the state-of-the-art methods in multiple metrics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bian, T., et al.: Rumor detection on social media with bi-directional graph convolutional networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 549–556 (2020)
Cao, Q., Shen, H., Gao, J., Wei, B., Cheng, X.: Popularity prediction on social platforms with coupled graph neural networks. In: Proceedings of the 13th International Conference on Web Search and Data Mining, pp. 70–78 (2020)
Chen, X., Zhou, F., Zhang, K., Trajcevski, G., Zhong, T., Zhang, F.: Information diffusion prediction via recurrent cascades convolution. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 770–781. IEEE (2019)
Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, pp. 925–936 (2014)
Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in Twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 57–58 (2011)
Huang, Z., Wang, Z., Zhang, R.: Cascade2vec: learning dynamic cascade representation by recurrent graph neural networks. IEEE Access 7, 144800–144812 (2019)
Jenders, M., Kasneci, G., Naumann, F.: Analyzing and predicting viral tweets. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 657–664 (2013)
Khabiri, E., Hsu, C.F., Caverlee, J.: Analyzing and predicting community preference of socially generated metadata: a case study on comments in the digg community. In: ICWSM (2009)
Leskovec, J., Adamic, L.A., Huberman, B.A.: The dynamics of viral marketing. ACM Trans. Web (TWEB) 1(1), 5-es (2007)
Li, C., Ma, J., Guo, X., Mei, Q.: DeepCas: an end-to-end predictor of information cascades. In: Proceedings of the 26th International Conference on World Wide Web, pp. 577–586 (2017)
Liao, D., Xu, J., Li, G., Huang, W., Liu, W., Li, J.: Popularity prediction on online articles with deep fusion of temporal process and content features. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 200–207 (2019)
Liu, Y., Wu, Y.F.B.: Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Ma, Z., Sun, A., Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter. J. Am. Soc. Inform. Sci. Technol. 64(7), 1399–1410 (2013)
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: a content-based analysis of interestingness on Twitter. In: Proceedings of the 3rd International Web Science Conference, pp. 1–7 (2011)
Rong, Y., Huang, W., Xu, T., Huang, J.: Dropedge: towards deep graph convolutional networks on node classification. In: International Conference on Learning Representations (2019)
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017)
Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1445–1456 (2013)
Zhao, L., et al.: Online flu epidemiological deep modeling on disease contact network. GeoInformatica 24(2), 443–475 (2019). https://doi.org/10.1007/s10707-019-00376-9
Zhao, Q., Erdogdu, M.A., He, H.Y., Rajaraman, A., Leskovec, J.: Seismic: a self-exciting point process model for predicting tweet popularity. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1513–1522 (2015)
Zhao, Y., Wang, C., Chi, C.H., Lam, K.Y., Wang, S.: A comparative study of transactional and semantic approaches for predicting cascades on twitter. In: IJCAI, pp. 1212–1218 (2018)
Zhou, F., Xu, X., Trajcevski, G., Zhang, K.: A survey of information cascade analysis: models, predictions and recent advances. arXiv preprint arXiv:2005.11041 (2020)
Acknowledgement
This work is supported by the National Key R&D Program of China under Grants No. 2018YFC0831703.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, Y., Zeng, K., Wang, H., Song, X., Zhou, B. (2021). Content Matters: A GNN-Based Model Combined with Text Semantics for Social Network Cascade Prediction. In: Karlapalem, K., et al. Advances in Knowledge Discovery and Data Mining. PAKDD 2021. Lecture Notes in Computer Science(), vol 12712. Springer, Cham. https://doi.org/10.1007/978-3-030-75762-5_57
Download citation
DOI: https://doi.org/10.1007/978-3-030-75762-5_57
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75761-8
Online ISBN: 978-3-030-75762-5
eBook Packages: Computer ScienceComputer Science (R0)